Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis 2013
DOI: 10.1145/2503210.2503234
|View full text |Cite
|
Sign up to set email alerts
|

Accelerating sparse matrix-vector multiplication on GPUs using bit-representation-optimized schemes

Abstract: The sparse matrix-vector (SpMV) multiplication routine is an important building block used in many iterative algorithms for solving scientific and engineering problems. One of the main challenges of SpMV is its memory-boundedness. Although compression has been proposed previously to improve SpMV performance on CPUs, its use has not been demonstrated on the GPU because of the serial nature of many compression and decompression schemes. In this paper, we introduce a family of bit-representation-optimized (BRO) c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
17
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 45 publications
(17 citation statements)
references
References 24 publications
0
17
0
Order By: Relevance
“…Since then, GPU‐accelerated SpMVs have attracted considerable attention. Many new efficient SpMV kernels are proposed for GPUs using the variants of the CSR, ELL, and COO storage formats . On the multi‐GPU platform, researchers also develop many efficient SpMV algorithms .…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Since then, GPU‐accelerated SpMVs have attracted considerable attention. Many new efficient SpMV kernels are proposed for GPUs using the variants of the CSR, ELL, and COO storage formats . On the multi‐GPU platform, researchers also develop many efficient SpMV algorithms .…”
Section: Related Workmentioning
confidence: 99%
“…Many new efficient SpMV kernels are proposed for GPUs using the variants of the CSR, ELL, and COO storage formats. [7][8][9][10][11][12][13][14][15] On the multi-GPU platform, researchers also develop many efficient SpMV algorithms. 9,20,21 However, they only use a single storage format for all GPUs in these algorithms.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…BRO-COO reduces information redundancy in the row index array. Details of the BRO-ELL and BRO-COO storage formats can be found in [17].…”
Section: Bro-ell and Bro-coo Formatsmentioning
confidence: 99%
“…The problem with the reordering technique is that it changes the inherent locality of the original matrix. A recent work by Tang et al [20] studies bit-representations to compress index arrays. Similar to our work, a difference function is applied to index arrays.…”
Section: Related Workmentioning
confidence: 99%