2017 Devices for Integrated Circuit (DevIC) 2017
DOI: 10.1109/devic.2017.8073975
|View full text |Cite
|
Sign up to set email alerts
|

A fast FPGA based architecture for computation of square root and Inverse Square Root

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 28 publications
(23 citation statements)
references
References 15 publications
0
23
0
Order By: Relevance
“…The maximum throughput of this design exceeds the throughput of the designs presented in [7, 11], which used Xilinx Virtex‐5 and Virtex‐6 devices. Additionally, the design presented by Hasnat et al that used magic numbers required 12% of the high‐end Xilinx Virtex‐5 flip‐flop logic.…”
Section: Resultsmentioning
confidence: 91%
See 1 more Smart Citation
“…The maximum throughput of this design exceeds the throughput of the designs presented in [7, 11], which used Xilinx Virtex‐5 and Virtex‐6 devices. Additionally, the design presented by Hasnat et al that used magic numbers required 12% of the high‐end Xilinx Virtex‐5 flip‐flop logic.…”
Section: Resultsmentioning
confidence: 91%
“…Recent work into fast FPGA architectures for square‐root and inverse square‐roots is presented by Hasnat et al [11]. A methodology is presented that uses seven ‘magic’ numbers, found experimentally.…”
Section: Introductionmentioning
confidence: 99%
“…Hasnat et al [24] uses 199 LUTs and 24 FFs which are quite low. However, we assume they use 14 DSPs which is above our numbers.…”
Section: Resultsmentioning
confidence: 99%
“…Hasnat et al [24] develop an FPGA implementation that calculates single-precision floating-point square root and inverse square root simultaneously with Quake's algorithm [25] modified using Newton Raphson method. They achieve quite low numbers for resource usage on Virtex 5, however, they do not support pipelining and do not provide the number for DSP usage.…”
Section: Related Workmentioning
confidence: 99%
“…In particular, reducing the number of calls for the square root operations proves to be significant. This is because, in modern processors, the square root operation is not computed directly but through the calculation of the inverse square root [35], which requires calling a division operation at the end of the calculation in order to obtain the sought result [36,37,38]. Therefore, the total impact of the arithmetic reduction in the proposed algorithm is not just the ratio of the counting of arithmetic operations of the Cholesky based approach and the proposed fast algorithm.…”
Section: Discussionmentioning
confidence: 99%