2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC) 2018
DOI: 10.1109/dac.2018.8465781
|View full text |Cite
|
Sign up to set email alerts
|

Area-Optimized Low-Latency Approximate Multipliers for FPGA-based Hardware Accelerators

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2020
2020
2025
2025

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 32 publications
(26 citation statements)
references
References 5 publications
0
26
0
Order By: Relevance
“…We have evaluated SIMDive against five SIMD and SISD accurate and approximate cutting-edge multipliers and dividers: performance-optimized accurate IPs of multiplier [36] and divider [37], provided by Xilinx Vivado, Mitchell [22], SoAs MBM [28], INZeD [29] and AAXD dividers [29] as they have the best resource-error trade-off when compared to the rest of designs ( [9,13,20,33]), CA [30] (based on approximate 4x4 multipliers) customized for FPGAs, and truncated multiplier (with 7x7 or 15x7 as the basic multiplier, the more accurate one is also exploited in SIMD structure). Note, hierarchical SIMD divider is not mathematically feasible by decomposing large one to small instances.…”
Section: Results and Discussion 41 Experimental Setupmentioning
confidence: 99%
See 3 more Smart Citations
“…We have evaluated SIMDive against five SIMD and SISD accurate and approximate cutting-edge multipliers and dividers: performance-optimized accurate IPs of multiplier [36] and divider [37], provided by Xilinx Vivado, Mitchell [22], SoAs MBM [28], INZeD [29] and AAXD dividers [29] as they have the best resource-error trade-off when compared to the rest of designs ( [9,13,20,33]), CA [30] (based on approximate 4x4 multipliers) customized for FPGAs, and truncated multiplier (with 7x7 or 15x7 as the basic multiplier, the more accurate one is also exploited in SIMD structure). Note, hierarchical SIMD divider is not mathematically feasible by decomposing large one to small instances.…”
Section: Results and Discussion 41 Experimental Setupmentioning
confidence: 99%
“…In particular, delay and energy are improved by 4× and 4.6×, respectively, in our proposed divider in SISD mode, as compared to accurate counterpart. In contrast, CA [30] with hierarchical implementation approach dissipates even more energy with lower throughput than accurate multiplier.…”
Section: Simulation and Synthesis Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…On FPGAs, approximate multipliers have to be implemented in logic and are not as fast as dedicated multiplication circuits for many input sizes. In state-of-the-art libraries such as [26,34,35], even small 8×8 multipliers show delays between 5ns and 12.5ns, which corresponds to a maximum operating frequency between 80MHz and 200MHz. The targeted color space conversion employs larger multiplications of size 12×x, likely limiting performance even further.…”
Section: Color Space Conversionmentioning
confidence: 99%