2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis 2010
DOI: 10.1109/sc.2010.29
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical Diagonal Blocking and Precision Reduction Applied to Combinatorial Multigrid

Abstract: Abstract-Memory bandwidth is a major limiting factor in the scalability of parallel iterative algorithms that rely on sparse matrix-vector multiplication (SpMV). This paper introduces Hierarchical Diagonal Blocking (HDB), an approach which we believe captures many of the existing optimization techniques for SpMV in a common representation. Using this representation in conjuction with precision-reduction techniques, we develop and evaluate high-performance SpMV kernels. We also study the implications of using o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0

Year Published

2011
2011
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 26 publications
(6 citation statements)
references
References 35 publications
0
6
0
Order By: Relevance
“…Indeed, Koutis et al have an algorithm and implementation based on related techniques that they refer to as "combinatorial multigrid" [BKMT10].Even if one does not demand provable running time bounds, we are not aware of any algorithm whose running time empirically scales nearly linearly on large classes of input graphs that does not roughly follow this general structure.…”
Section: Previous Nearly Linear Time Algorithmsmentioning
confidence: 99%
“…Indeed, Koutis et al have an algorithm and implementation based on related techniques that they refer to as "combinatorial multigrid" [BKMT10].Even if one does not demand provable running time bounds, we are not aware of any algorithm whose running time empirically scales nearly linearly on large classes of input graphs that does not roughly follow this general structure.…”
Section: Previous Nearly Linear Time Algorithmsmentioning
confidence: 99%
“…Recursive matrix layouts, similar to our QTS format, have inspired recent papers [4,17]. In their recursive sparse blocks (RSB) format [17], Martone et al, use a quad-tree with ZMorton ordering to store pointers to sparse submatrices that are sized to balance the work partitions.…”
Section: Non-symmetric Spmvmentioning
confidence: 99%
“…Tangwongsan et al [4], introduce a hierarchical storage format that points to the same memory region for off diagonal blocks if the matrix is symmetric (similar to our QTS format). These authors also point out that this kind of matrix storage enables lock and synchronization free concurrent symmetric SpMV.…”
Section: Concurrent Symmetric Spmvmentioning
confidence: 99%
“…Running algorithms on compressed inputs has been previously explored in the setting of sparse matrix-vector (spMV) multiplication [4,9,17,18,25]. Like graph algorithms, spMV is also a memory-bound computation, and so better improvements are observed in parallel.…”
Section: Introductionmentioning
confidence: 99%