Performance modeling of the sparse matrix–vector product via convolutional neural networks

Barreda, Maria; Dolz, Manuel F.; Castaño, M. Asunción; Alonso‐Jordá, Pedro; Quintana–Ort́ı, Enrique S.

doi:10.1007/s11227-020-03186-1

Cited by 9 publications

(10 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Similarly, 80% of the training dataset was utilized only for training, and the remaining 20% was used for validation, to guide the training process. In Barreda et al (2020c) we showed that the trained models for the time metric using this dataset provide an appropriate generalization power. Using too much training data could lead to a model which overfits the problem.…”

Section: Obtaining the Datasetmentioning

confidence: 86%

“…The strategy of partitioning the vpos 0 array into blocks forces us to implement a blockwise version of the classic CSR-based SPMV Algorithm 1 to generate the training dataset for the CNNs, in which each block of vpos 0 is labeled with its corresponding ratios of execution time and energy consumption (total, package, and DRAM) per nonzero element. In a previous work (Barreda et al, 2020c), we analyzed the impact of the block size in the time predictions and concluded that the proposed network architectures deliver accurate results for small block sizes. The reason is that, in general, small blocks reflect a small set of sparsity patterns which, in turn, can be better captured by the CNN filters.…”

Section: Methodsmentioning

confidence: 96%

“…Specifically, Convolutional Neural Networks (CNNs) (Gu et al, 2018) may provide a powerful means to capture spatial and temporal dependencies using abstract representations of the sparse matrices involved in the SPMV through a set of convolutional filters. In this paper, we extend the work in (Barreda et al, 2020c) with the design of a more accurate execution time model and a new energy consumption model using CNNs as base algorithms. In particular, our paper makes the following contributions:…”

Section: Introductionmentioning

confidence: 99%

“…We study the estimations of these metrics considering different operating frequencies of the Intel Xeon Haswell core. We leverage the model migration technique presented in (Barreda et al, 2020c) for the different energy metrics and frequencies studied. We propose a blockwise realization to make the CNN model architecture independent of the sparse matrix dimension input blocks.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Convolutional neural nets for estimating the run time and energy consumption of the sparse matrix-vector product

Barreda

Dolz

Castaño

2020

The International Journal of High Performance Computing Applica

Self Cite

View full text Add to dashboard Cite

Modeling the performance and energy consumption of the sparse matrix-vector product (SpMV) is essential to perform off-line analysis and, for example, choose a target computer architecture that delivers the best performance-energy consumption ratio. However, this task is especially complex given the memory-bounded nature and irregular memory accesses of the SpMV, mainly dictated by the input sparse matrix. In this paper, we propose a Machine Learning (ML)-driven approach that leverages Convolutional Neural Networks (CNNs) to provide accurate estimations of the performance and energy consumption of the SpMV kernel. The proposed CNN-based models use a blockwise approach to make the CNN architecture independent of the matrix size. These models are trained to estimate execution time as well as total, package, and DRAM energy consumption at different processor frequencies. The experimental results reveal that the overall relative error ranges between 0.5% and 14%, while at matrix level is not superior to 10%. To demonstrate the applicability and accuracy of the SpMV CNN-based models, this study is complemented with an ad-hoc time-energy model for the PageRank algorithm, a popular algorithm for web information retrieval used by search engines, which internally realizes the SpMV kernel.

show abstract

Section: Obtaining the Datasetmentioning

confidence: 86%

Section: Methodsmentioning

confidence: 96%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Convolutional neural nets for estimating the run time and energy consumption of the sparse matrix-vector product

Barreda

Dolz

Castaño

2020

The International Journal of High Performance Computing Applica

Self Cite

View full text Add to dashboard Cite

show abstract

“…Other interesting efforts related with this topic are the article by Williams et al (2007) where the authors study some optimization techniques for the SpMV kernel over several hardware platforms; the proposal by Erguiz et al (2017) with advances over the automatic selection of different sparse triangular linear solvers on GPU; and the work by Barreda et al (2020) which offers a performance modeling of the SpMV kernel via convolutional neural networks with ARM as the target hardware platform.…”

Section: Automatic Tuning and Performance Models For The Spmv In Gpusmentioning

confidence: 99%

Selecting optimal SpMV realizations for GPUs via machine learning

Dufrechou

Ezzatti

Quintana–Ort́ı

2021

The International Journal of High Performance Computing Applica

View full text Add to dashboard Cite

More than 10 years of research related to the development of efficient GPU routines for the sparse matrix-vector product (SpMV) have led to several realizations, each with its own strengths and weaknesses. In this work, we review some of the most relevant efforts on the subject, evaluate a few prominent routines that are publicly available using more than 3000 matrices from different applications, and apply machine learning techniques to anticipate which SpMV realization will perform best for each sparse matrix on a given parallel platform. Our numerical experiments confirm the methods offer such varied behaviors depending on the matrix structure that the identification of general rules to select the optimal method for a given matrix becomes extremely difficult, though some useful strategies (heuristics) can be defined. Using a machine learning approach, we show that it is possible to obtain unexpensive classifiers that predict the best method for a given sparse matrix with over 80% accuracy, demonstrating that this approach can deliver important reductions in both execution time and energy consumption.

show abstract

DONN: leveraging heterogeneous outer products for CTR prediction

Kim

2024

Neural Comput & Applic

View full text Add to dashboard Cite

Performance modeling of the sparse matrix–vector product via convolutional neural networks

Cited by 9 publications

References 22 publications

Convolutional neural nets for estimating the run time and energy consumption of the sparse matrix-vector product

Convolutional neural nets for estimating the run time and energy consumption of the sparse matrix-vector product

Selecting optimal SpMV realizations for GPUs via machine learning

DONN: leveraging heterogeneous outer products for CTR prediction

Contact Info

Product

Resources

About