Published January 1, 2021 | Version v1
Conference paper Open

Increasing Flexibility of FPGA-based CNN Accelerators with Dynamic Partial Reconfiguration

  • 1. Univ Twente, Enschede, Netherlands
  • 2. Tech Univ Ilmenau, Ilmenau, Germany

Description

Convolutional Neural Networks (CNN) are widely used for image classification and have achieved significantly accurate performance in the last decade. However, they require computationally intensive operations for embedded applications. In recent years, FPGA-based CNN accelerators have been proposed to improve energy efficiency and throughput. While dynamic partial reconfiguration (DPR) is increasingly used in CNN accelerators, the performance of dynamically reconfigurable accelerators is usually lower than the performance of pure static FPGA designs. This work presents a dynamically reconfigurable CNN accelerator architecture that does not sacrifice throughput performance or classification accuracy. The proposed accelerator is composed of reconfigurable macroblocks and dynamically utilizes the device resources according to model parameters. Moreover, we devise a novel approach, to the best of our knowledge, to hide the computations of the pooling layers inside the convolutional layers, thereby further improving throughput. Using the proposed architecture and DPR, different CNN architectures can be realized on the same FPGA with optimized throughput and accuracy. The proposed architecture is evaluated by implementing two different LeNet CNN models trained by different datasets and classifying different classes. Experimental results show that the implemented design achieves higher throughput than current LeNet FPGA accelerators.

Files

bib-bf646509-21ed-435f-a9c8-3a0db9187e1d.txt

Files (229 Bytes)

Name Size Download all
md5:fff53f9223bc0b9cbd37f465685ee263
229 Bytes Preview Download