2019
DOI: 10.3390/electronics8010065
|View full text |Cite
|
Sign up to set email alerts
|

A Uniform Architecture Design for Accelerating 2D and 3D CNNs on FPGAs

Abstract: Three-dimensional convolutional neural networks (3D CNNs) have gained popularity in many complicated computer vision applications. Many customized accelerators based on FPGAs are proposed for 2D CNNs, while very few are for 3D CNNs. Three-D CNNs are far more computationally intensive and the design space for 3D CNN acceleration has been further expanded since one more dimension is introduced, making it a big challenge to accelerate 3D CNNs on FPGAs. Motivated by the finding that the computation patterns of 2D … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
29
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 52 publications
(29 citation statements)
references
References 15 publications
0
29
0
Order By: Relevance
“…It is obvious that the error dynamics in Equations (18) and 22are driven by acceleration, which is manifested by velocity change. If we define acceleration a k = ∆v a k /T s that is always bounded in practice such that |a k | ≤ a max , then by considering 0 ≤ δt k < T s , we can evaluate the r.h.s.…”
Section: The Mt-type Division-less Algorithm Of the Second Ordermentioning
confidence: 99%
See 2 more Smart Citations
“…It is obvious that the error dynamics in Equations (18) and 22are driven by acceleration, which is manifested by velocity change. If we define acceleration a k = ∆v a k /T s that is always bounded in practice such that |a k | ≤ a max , then by considering 0 ≤ δt k < T s , we can evaluate the r.h.s.…”
Section: The Mt-type Division-less Algorithm Of the Second Ordermentioning
confidence: 99%
“…In the case of the sinusoidal variation of δt, we can observe relatively good tracking for both algorithms; however, a more smooth response with slightly better accuracy was achieved by the DLMT1 algorithm (as can be observed by comparison of Figure 8c,d. The tracking performance of the DLMT1 algorithm is better than the DLMT2 algorithm simply because the velocity error in the first case is affected by the difference ∆δt only (see Equation (38)), whereas in the second case, the δt may have a significant impact as well (see Equation (18)). In the case of the sawtooth variation of δt, we can observe high-velocity peaks when the sawtooth generates abrupt changes from zero to one.…”
Section: The Experimental Systemmentioning
confidence: 99%
See 1 more Smart Citation
“…Meng develops a deep model, termed the Gabor CNN, to address the computing-resource-saving problem [13]. Liu proposes a uniform architecture design by mapping convolutions to matrix multiplications for accelerating both two dimensional (2D) and three dimensional (3D) CNNs [14]. Despite the progress made, those accelerators take the algorithm as a black box, only focusing on hardware architecture optimization.…”
Section: Introductionmentioning
confidence: 99%
“…The current rise in the complexity of the applications and the increment of the capabilities of silicon technologies, as well as the so called time to market constrain, make HLS methodologies and tools of mandatory use in the near future [1]. Due to the multiple commercial solutions that can be found in the market for multiprocessor system-on-chips (MPSoCs) nowadays, it is strictly necessary to improve its techniques and methodologies [2] so that the technology is able to deal with the multiple implementation possibilities by using high-level design [3,4].…”
Section: Introductionmentioning
confidence: 99%