2022
DOI: 10.3390/electronics11071029
|View full text |Cite
|
Sign up to set email alerts
|

A Survey of Network-Based Hardware Accelerators

Abstract: Many practical data-processing algorithms fail to execute efficiently on general-purpose CPUs (Central Processing Units) due to the sequential matter of their operations and memory bandwidth limitations. To achieve desired performance levels, reconfigurable (FPGA (Field-Programmable Gate Array)-based) hardware accelerators are frequently explored that permit the processing units’ architectures to be better adapted to the specific problem/algorithm requirements. In particular, network-based data-processing algo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(10 citation statements)
references
References 48 publications
1
9
0
Order By: Relevance
“…However, the communication overheads (related to supplying the input data to the circuit) will always limit the ideal theoretical throughput. To solve this problem, designs have been proposed that allow the processing to be overlapped in time with data transfers [15].…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…However, the communication overheads (related to supplying the input data to the circuit) will always limit the ideal theoretical throughput. To solve this problem, designs have been proposed that allow the processing to be overlapped in time with data transfers [15].…”
Section: Methodsmentioning
confidence: 99%
“…This is because many processing elements can easily be instantiated, synthesized, and implemented according to the required network structure, and modern FPGAs contain plenty of distributed storage elements that can be used for effective pipelining. The principal characteristics, benefits, and limitations of the different approaches to implementing network-based hardware accelerators in FPGA and PSoC are reviewed in [15]. As indicated in [15], the majority of the analyzed implementations recur to low-level hardware designs (usually in VHDL/Verilog or in a specially developed language whose specifications are later translated to standard HDL (Hardware Description Language) RTL (Register-Transfer Level) descriptions).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The S1 receives ECON-T data, unpacks and calibrates it, and routes and sorts TCs in energy into projective 2 ϕ vs. 42 R/z bins per 120 • sector. The sorting uses batcher odd-even sorting networks [10][11][12], where on-the-fly truncation reduces the total number of firmware comparators required. Modules sums are here partially summed into module towers, and time multiplexing [13] with a 18 bunch-crossing period is applied before sending the data to S2.…”
Section: The Dataflow Of Hgcal Trigger Primitivesmentioning
confidence: 99%
“…The computation is executed until the array elements are sorted and in each iteration two phases occurs, Odd and Even Phases. In the odd phase, we perform a bubble sort on odd, and we perform a bubble sort on even indexed elements (11) . This process continues for the alternating index (odd-even pairs, even-odd pairs) until no swapping operation is performed and finally the array is in sorted form.…”
Section: Introductionmentioning
confidence: 99%