2018
DOI: 10.48550/arxiv.1805.03648
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Parallel Programming for FPGAs

Abstract: Many people have contributed to making this book happen. Probably first and foremost are the many people who have done research in the area of High-Level Synthesis. Underlying each of the applications in this book are many individual synthesis and mapping technologies which combine to result in high-quality implementations.Many people have worked directly on the Vivado R HLS tool over the years. From the beginning in Jason Cong's research group at UCLA as the AutoPilot tool, to making a commercial product at A… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(21 citation statements)
references
References 31 publications
0
21
0
Order By: Relevance
“…At a high level, an FPGA can be described as an array of programmable logic blocks and programmable interconnect between those blocks that can be configured after fabrication to implement arbitrary program logic. 88 An example of an Intel FPGA (formerly Altera) architecture is shown in Figure 6.The Adaptive Logic Modules (ALMs) are the fundamental compute units of an FPGA. ALMs consist of a Lookup table (LUT) which is a memory that maps address signals as inputs and the outputs are stored in the corresponding memory entries.…”
Section: Reconfigurable Architectures For MDmentioning
confidence: 99%
See 1 more Smart Citation
“…At a high level, an FPGA can be described as an array of programmable logic blocks and programmable interconnect between those blocks that can be configured after fabrication to implement arbitrary program logic. 88 An example of an Intel FPGA (formerly Altera) architecture is shown in Figure 6.The Adaptive Logic Modules (ALMs) are the fundamental compute units of an FPGA. ALMs consist of a Lookup table (LUT) which is a memory that maps address signals as inputs and the outputs are stored in the corresponding memory entries.…”
Section: Reconfigurable Architectures For MDmentioning
confidence: 99%
“…ALMs consist of a Lookup table (LUT) which is a memory that maps address signals as inputs and the outputs are stored in the corresponding memory entries. 88 LUTs can be programmed to compute any n-input Boolean function. Full Adders perform addition and subtraction of binary inputs.…”
Section: Reconfigurable Architectures For MDmentioning
confidence: 99%
“…In order to build the Huffman tree, we must sort the symbols based on their frequencies after we filter out symbols with a frequency of zero. Previous works [26,51] use radix sort to reduce the utilization of hardware resources. However, we find that radix sort typically takes more than 30% of the total time when we break down the execution time of generating codewords.…”
Section: Efficient and Adaptive Huffman Codermentioning
confidence: 99%
“…Therefore, to realize a high degree of parallelism required in protocol sized LDPC codes, many BRAMs must be used in parallel to access the BP LLRs. Furthermore to meet FPGA device timing constraints, today's dualported support for BRAMs limits the size of a single data access to 2,048 bits and the number of BRAMs accessible in a single clock cycle to 1,024 [37,68,74]. This limitation results in the maximum degree of achievable parallelization in current top-end Xilinx FP-GAs, which corresponds to a 2,048 (1,024 × 2) LDPC code block length.…”
Section: Classical Bp Decoder Limitationsmentioning
confidence: 99%