SC18: International Conference for High Performance Computing, Networking, Storage and Analysis 2018
DOI: 10.1109/sc.2018.00065
|View full text |Cite
|
Sign up to set email alerts
|

ParSy: Inspection and Transformation of Sparse Matrix Computations for Parallelism

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
19
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
4

Relationship

2
6

Authors

Journals

citations
Cited by 19 publications
(19 citation statements)
references
References 46 publications
0
19
0
Order By: Relevance
“…Wavefronts of the joint DAG can be aggregated to reduce the number of synchronizations. DAG partitioners such as Load-Balanced Level Coarsening (LBC) [8] and DAGP [20] apply aggregation, however, when applied to the joint DAG because they aggregate iterations from consecutive wavefronts, load imbalance might still occur. Also, by aggregating iterations from wavefronts in the joint DAG, DAG partitioning methods potentially improve the temporal locality between the two kernels but, this can disturb spatial locality within each kernel.…”
Section: Unfusedmentioning
confidence: 99%
See 2 more Smart Citations
“…Wavefronts of the joint DAG can be aggregated to reduce the number of synchronizations. DAG partitioners such as Load-Balanced Level Coarsening (LBC) [8] and DAGP [20] apply aggregation, however, when applied to the joint DAG because they aggregate iterations from consecutive wavefronts, load imbalance might still occur. Also, by aggregating iterations from wavefronts in the joint DAG, DAG partitioning methods potentially improve the temporal locality between the two kernels but, this can disturb spatial locality within each kernel.…”
Section: Unfusedmentioning
confidence: 99%
“…𝑐 (𝑣 𝑖 ) is the computational load of a vertex and is defined as the total number of nonzeros touched to complete its computation. Because sparse matrix computations are generally memory bandwidth-bound, 𝑐 (𝑣 𝑖 ) is a good metric to evaluate load balance in the algorithm [8]. 𝐹 is stored in the compressed sparse row (CSR) format and 𝐹 𝑖 is used to extract the set of vertices in 𝐺 1 that 𝑣 𝑖 ∈ 𝑉 2 depends on.…”
Section: Inputs and Output To Mspmentioning
confidence: 99%
See 1 more Smart Citation
“…Several sparse tensor compilers target linear algebra operations (e.g. taco [Chou et al 2018;Kjolstad et al 2017], ParSy [Cheshmi et al 2017[Cheshmi et al , 2018). They focus on constructing efficient iteration spaces between different sparse matrices under linear algebra operations.…”
Section: Related Work 81 Array Compilersmentioning
confidence: 99%
“…Cheshmi et al [6,7] developed Sympiler, an I/E compiler to optimize sparse computations by exploiting properties of the sparsity structure. This leads to executor code which leverages the sparsity pattern within the computation.…”
Section: Related Workmentioning
confidence: 99%