2006
DOI: 10.1007/11752578_63
|View full text |Cite
|
Sign up to set email alerts
|

FPGA Implementation of the Conjugate Gradient Method

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2008
2008
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(3 citation statements)
references
References 6 publications
0
3
0
Order By: Relevance
“…Several FPGA implementations of CG have been proposed in the past. Given that the convergence of conjugate gradient method is sensitive to precision used, in [6], a fractional number system is proposed. In [7], a high performance architecture for CG, in which data blocking to partition large sparse matrices into square blocks, was proposed.…”
Section: Introduction and Previous Workmentioning
confidence: 99%
“…Several FPGA implementations of CG have been proposed in the past. Given that the convergence of conjugate gradient method is sensitive to precision used, in [6], a fractional number system is proposed. In [7], a high performance architecture for CG, in which data blocking to partition large sparse matrices into square blocks, was proposed.…”
Section: Introduction and Previous Workmentioning
confidence: 99%
“…These implementations include a Cholesky approach that achieved a performance increase of 50% over software on a APEX EP20K15000E FPGA [1]; a Jacobi solver implementation on a Xilinx VirtexII Pro XC2VP50 which achieved a speedup of 1.3 to 36.8 relative to a highend processor, depending on the matrix structure [2]; and two CG implementations. One of these implementations used the Logarithmic Number System (LNS) and reached up to 0.94 GFLOPS on a VirtexII-6000 [3], while the other used a rational number system representation and achieved 0.27 GFLOPS on a VirtexII Pro XC2VP4 [4]. Table 1 summarizes FPGA implementations of Conjugate Gradient method in terms of year of publication, number system, input problem structure, device and GFLOPS achieved.…”
Section: Introductionmentioning
confidence: 99%
“…One uses a Logarithmic Number System (LNS) and achieves up to 1.1 GFLOPS on a VirtexII-6000 [30]. The other uses a rational number representation and achieves 0.27 GFLOPS using a VirtexII Pro XC2VP4 [31] and projects that it will be able to sustain 15 GFLOPS on a Virtex4-55. In contrast, we present a widely-parallelised and deeply-pipelined Conjugate Gradient method using the IEEE 754 [32] single precision floating point number representation.…”
Section: Previous Fpga Implementationsmentioning
confidence: 99%