2009 International Conference on Computational Science and Engineering 2009
DOI: 10.1109/cse.2009.223
|View full text |Cite
|
Sign up to set email alerts
|

A Comparative Study of Blocking Storage Methods for Sparse Matrices on Multicore Architectures

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
17
0
1

Year Published

2012
2012
2019
2019

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 28 publications
(18 citation statements)
references
References 9 publications
0
17
0
1
Order By: Relevance
“…Table 1 lists sparse matrices used in our performance evaluation. These are all the matrices used in previous papers [26,21,9] that are larger than the 30 MB aggregate L2 cache of KNC (using 60 cores). A dense matrix stored in sparse format is also included.…”
Section: Understanding the Performance Of Spmv On Kncmentioning
confidence: 98%
“…Table 1 lists sparse matrices used in our performance evaluation. These are all the matrices used in previous papers [26,21,9] that are larger than the 30 MB aggregate L2 cache of KNC (using 60 cores). A dense matrix stored in sparse format is also included.…”
Section: Understanding the Performance Of Spmv On Kncmentioning
confidence: 98%
“…In this case, we can state that there are three nonzeros of the matrix in the positions (OE0, 1, 23), (2,12), and (2,19). In this case, we can state that there are three nonzeros of the matrix in the positions (OE0, 1, 23), (2,12), and (2,19).…”
Section: Performance Evaluation Using Hardware Counters For Samplingmentioning
confidence: 96%
“…In [10], a performance model for the blocked SpMV, which allows to pick in nearly all cases the actual optimal block size, was presented. In a recent work [12], a comparative study of different blocking storage techniques for sparse matrices on several multicore platforms was performed. Vuduc et al [11] extended the notion of blocking in order to exploit variable block shapes by decomposing the original matrix to a proper sum of submatrices, storing each submatrix in a variation of the blocked CSR format.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…But, in [10] we opt for balancing the number of columns per block. While in [4,5,9] authors try to balance the number of nonzero elements. The disadvantage of this type of decomposition is that it does not consider the phenomenon of "over loop ".…”
Section: Related Workmentioning
confidence: 99%