2009 International Conference on Field Programmable Logic and Applications 2009
DOI: 10.1109/fpl.2009.5272553
|View full text |Cite
|
Sign up to set email alerts
|

Generating high-performance custom floating-point pipelines

Abstract: Custom operators, working at custom precisions, are a key ingredient to fully exploit the FPGA flexibility advantage for high-performance computing. Unfortunately, such operators are costly to design, and application designers tend to rely on less efficient off-the-shelf operators. To address this issue, an open-source architecture generator framework is introduced. Its salient features are an easy learning curve from VHDL, the ability to embed arbitrary synthesizable VHDL code, portability to mainstream FPGA … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
42
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 39 publications
(42 citation statements)
references
References 12 publications
0
42
0
Order By: Relevance
“…We applied the SMI tool flow to three benchmark circuits: a single precision floating point adder, multiplier and divider, generated by FloPoCo [5]. The resource usage of the benchmarks and the model-predicted maximum operating frequency (F max ) are given in Table 1.…”
Section: Resultsmentioning
confidence: 99%
“…We applied the SMI tool flow to three benchmark circuits: a single precision floating point adder, multiplier and divider, generated by FloPoCo [5]. The resource usage of the benchmarks and the model-predicted maximum operating frequency (F max ) are given in Table 1.…”
Section: Resultsmentioning
confidence: 99%
“…RELATED WORK Tenca [6] introduced a 3-input floating-point adder that achieves reduced error compared to two IEEE floating-point adders; however, this approach requires the precision of the effective operation to increase from f to 2f+5; this approach is unlikely to scale for more than three or four operands. The FloPoCo compiler [7] can also generate effective 3-input operations when necessary, and can eliminate effective subtraction when all input operands are known to be positive, e.g., when computing x 2 + y 2 + z 2 . Our approach, in contrast, takes the same approach as Altera's floating-point datapath compiler [1,2], and scales up to 16 operands, while improving upon both area and delay.…”
Section: Resultsmentioning
confidence: 99%
“…This work was also an in-depth case study for the pipeline generator framework of FloPoCo [15]. This framework enables easy composition and synchronization of components that pipeline themselves to run at a user-provided frequency.…”
Section: F Pipeline Tuningmentioning
confidence: 99%
“…It comes with test vector generation [15]. In general, it should be immediately usable for application designers.…”
Section: B Contributionsmentioning
confidence: 99%
See 1 more Smart Citation