2014
DOI: 10.1137/120900216
|View full text |Cite
|
Sign up to set email alerts
|

Compressed Multirow Storage Format for Sparse Matrices on Graphics Processing Units

Abstract: A new format for storing sparse matrices is proposed for efficient sparse matrix-vector (SpMV) product calculation on modern graphics processing units (GPUs). This format extends the standard compressed row storage (CRS) format and can be quickly converted to and from it. Computational performance of two SpMV kernels for the new format is determined for over 130 sparse matrices on Fermi-class and Kepler-class GPUs and compared with that of five existing generic algorithms and industrial implementations, includ… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 24 publications
(10 citation statements)
references
References 20 publications
0
10
0
Order By: Relevance
“…Reference source not found.. This transition matrix is a compressed row format (CSR) [43,44] based on index of row ⟶ column delimited by commas generating the dictionary of the model which defines the transitions between nodes in the state-space and mapping of states. Through , we can know nothing more than the neighbours (child nodes) of the current node (states reachable through a single reaction), then we consider these neighbours (child nodes) as our only goal states and there can be many in numbers.…”
Section: Letmentioning
confidence: 99%
“…Reference source not found.. This transition matrix is a compressed row format (CSR) [43,44] based on index of row ⟶ column delimited by commas generating the dictionary of the model which defines the transitions between nodes in the state-space and mapping of states. Through , we can know nothing more than the neighbours (child nodes) of the current node (states reachable through a single reaction), then we consider these neighbours (child nodes) as our only goal states and there can be many in numbers.…”
Section: Letmentioning
confidence: 99%
“…However, the storage format considers only single precision. A similar idea of processing a sparse matrix in chunks larger than individual rows has been also exploited in the Compressed Multi-Row Storage (CMRS) format presented in [55]. CMRS extends the CSR format by dividing the matrix rows into strips; a warp is then assigned to a strip for processing.…”
Section: New Formats Based On Csrmentioning
confidence: 99%
“…The first step towards the implementation of this technique was the ability to use collaborative threads to process a single row (as opposed to the row-based parallelization just described). Advanced sparse matrix formats such as ELL-T [20], SIC [21], CMRS [22] or RgCSR [23] use slightly different approaches to map a row to one or more threads in a warp. Unfortunately, this thread mapping is not flexible (i.e.…”
Section: Related Workmentioning
confidence: 99%