2011 21st International Conference on Field Programmable Logic and Applications 2011
DOI: 10.1109/fpl.2011.91
|View full text |Cite
|
Sign up to set email alerts
|

Synthesizing Tiled Matrix Decomposition on FPGAs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Year Published

2012
2012
2014
2014

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(12 citation statements)
references
References 16 publications
0
12
0
Order By: Relevance
“…Similar concepts as in this paper for FPGA implementation and design of a unified, area-efficient unit that can perform the necessary computations (division, square root and inverse square root operations that will be discussed later) for calculating Householder QR factorization is presented in [11]. Finally, a tiled matrix decomposition based on blocking principles is presented in [12].…”
Section: Related Workmentioning
confidence: 99%
“…Similar concepts as in this paper for FPGA implementation and design of a unified, area-efficient unit that can perform the necessary computations (division, square root and inverse square root operations that will be discussed later) for calculating Householder QR factorization is presented in [11]. Finally, a tiled matrix decomposition based on blocking principles is presented in [12].…”
Section: Related Workmentioning
confidence: 99%
“…We survey recent work on tall-skinny QR factorization on parallel architectures like multi-cores [7], GPUs [8] and FPGAs [9]. While the performance of multi-cores is good for square matrices (90 GFLOPs with 58.8% efficiency), it decreases signficantly for tall-skinny matrices (2 GFLOPs with 1.2% efficiency) .…”
Section: Related Workmentioning
confidence: 99%
“…Recently, Tai et.al. [9] presented an architecture comprising linear array of processing elements (PEs) for the Householder QR targeting large square matrices. Each PE is responsible for computing an iteration of the outer loop in Algorithm 1.…”
Section: Parallel Architecture For Householder Qrmentioning
confidence: 99%
See 2 more Smart Citations