Run-time optimization of sparse matrix-vector multiplication on SIMD machines

Ziantz, Louis H.; Özturan, Can; Szymański, Bolesław K.

doi:10.1007/3-540-58184-7_111

Cited by 15 publications

(15 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…All programs are written in C + MPI (Message Passing Interface) [21] codes. The sparse ratio is set to 0.1 for all test three-dimensional sparse arrays used as test samples.…”

Section: Resultsmentioning

confidence: 99%

“…Ziantz et al [21] proposed a run-time technique that was applied to sparse arrays for array distributions and off-processor data fetching to reduce the communication and computation time. They used the block data distribution scheme with a bin-packing algorithm to distribute a global sparse array to processors.…”

Section: Related Workmentioning

confidence: 99%

“…However, it is a challenging problem to provide an efficient data distribution for irregular problems [18] on distributed memory multicomputers. In the literature [3,20,21], many methods have been proposed and were all performed in the following order, the data partition phase, then the data distribution phase, followed by the data compression phase, and called the Send Followed Compress (SFC). These methods are all focused on sparse arrays based on the traditional matrix representation (TMR) [12].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Performance evaluation of data distributions with load-balancing for sparse arrays

Lin

Chung

Liu

2004

7th International Symposium on Parallel Architectures, Algorithms and Networks, 2004. Proceedings.

View full text Add to dashboard Cite

show abstract

“…All programs are written in C + MPI (Message Passing Interface) [21] codes. The sparse ratio is set to 0.1 for all test three-dimensional sparse arrays used as test samples.…”

Section: Resultsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Performance evaluation of data distributions with load-balancing for sparse arrays

Lin

Chung

Liu

2004

7th International Symposium on Parallel Architectures, Algorithms and Networks, 2004. Proceedings.

View full text Add to dashboard Cite

show abstract

“…Ziantz et al [32] proposed a run-time optimization technique that was applied to sparse arrays for array distributions and off-processor data fetching to reduce the communication and the computation time. In their technique, they used the Block partition method with a binpacking algorithm to distribute a global sparse array to processors.…”

Section: Related Workmentioning

confidence: 99%

Efficient Data Distribution Schemes for EKMR-Based Sparse Arrays on Distributed Memory Multicomputers

2005

View full text Add to dashboard Cite

Abstract. Multi-dimensional sparse array operations can be used in the atmosphere and ocean sciences, the image processing, and etc., and have been an extensively investigated problem. Therefore, it becomes an important issue to propose efficient data distribution schemes for multi-dimensional sparse arrays. In our previous work, we have proposed two data distribution schemes Compress Followed Send (CFS) and Encoding-Decoding (ED) for sparse arrays based on the traditional matrix representation (TMR) scheme. We have proposed another scheme, called extended Karnaugh map representation (EKMR), to represent sparse arrays. The EKMR scheme can obtain better performance than the TMR scheme for some sparse array operations. Hence, in this paper, we want to propose efficient data distribution schemes for EKMR-based sparse arrays. We extend the CFS and the ED schemes for TMR-based sparse arrays to EKMR-based sparse arrays first. Then, we compare the performance of these two schemes with that of the Send Followed Compress (SFC), which is an intuitive data distribution scheme for sparse arrays. Finally, we compare these three schemes for EKMR-based sparse arrays with those of TMR-based sparse arrays, respectively. Both the theoretical analysis and the experimental tests were conducted. From the theoretical analysis and the experimental results, we can see that the ED scheme is superior to the CFS scheme that is superior to the SFC scheme for most of testing EKMR-based sparse arrays; the performance of these three schemes for EKMR-based sparse arrays is better than that of TMR-based sparse arrays for all of testing cases, respectively.

show abstract

“…In the data distribution phase, these local sparse arrays are distributed to processors. In the data compression phase, a local sparse array is compressed by data compression methods in order to obtain better performance for sparse array operations [7,15,16,18,21,23,26,30]. A data distribution scheme with this order is called the Send Followed Compress (SFC) scheme.…”

Section: Introductionmentioning

confidence: 99%