2006 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines 2006
DOI: 10.1109/fccm.2006.8
|View full text |Cite
|
Sign up to set email alerts
|

A Hybrid Approach for Mapping Conjugate Gradient onto an FPGA-Augmented Reconfigurable Supercomputer

Abstract: Supercomputer companies such as Cray, Silicon Graphics, and SRC Computers now offer reconfigurable computer (RC)

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
28
0

Year Published

2006
2006
2015
2015

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 41 publications
(28 citation statements)
references
References 12 publications
0
28
0
Order By: Relevance
“…The three most notable examples are a molecular dynamics application [Kindratenko and Pointer 2006;Scrofano et al 2006], a full CG solver [Morris et al 2006], and a traffic simulation engine [Tripp et al 2005]. None of these efforts have considered requirements for future systems attempting to leverage FPGAs.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The three most notable examples are a molecular dynamics application [Kindratenko and Pointer 2006;Scrofano et al 2006], a full CG solver [Morris et al 2006], and a traffic simulation engine [Tripp et al 2005]. None of these efforts have considered requirements for future systems attempting to leverage FPGAs.…”
Section: Related Workmentioning
confidence: 99%
“…This led researchers to begin by focusing on kernel operations that are used in HPC and can be provided through a standard library interface. Operations from BLAS [Underwood and Hemmert 2004;Zhuo and Prasanna 2004;Dou et al 2005;Zhuo and Prasanna 2005a;Zhuo and Prasanna 2005b] to FFTs [Hemmert and Underwood 2005] to the sparse matrix operations at the core of an iterative solver [deLorimier and DeHon 2005;Zhuo and Prasanna 2005c] and even a full CG solver [Morris et al 2006] have been studied. The fundamental challenge for each of these efforts is the communications with the host.…”
Section: Introductionmentioning
confidence: 99%
“…A profile of CG 4 shows that it spends more than 95 percent of the execution time in SMVM (line 6 of the CG algorithm), so we targeted this module for the FPGA. Figure 3 shows our high-level CG design.…”
Section: High-level Cg Designmentioning
confidence: 99%
“…754 floating-point units, 1 molecular dynamics kernels, 2 linearalgebra routines, 3 and sparse matrix solvers. 4 Mapping two sparse matrix solvers onto an FPGA-augmented reconfigurable computers (RC) demonstrated more than a twofold speedup over software.…”
mentioning
confidence: 99%
“…One such property, the time invariance of A (A is not updated during CG iterations) leads to an obvious optimisation used in [6]: copy A to the hardware accelerator only once for multiple iterations. We use this property to statically schedule the entire algorithm (Section IV).…”
mentioning
confidence: 99%