Performance Improvement of Sparse Matrix Vector Product on Vector Machines

Tiyyagura, Sunil R.; Küster, Uwe; Borowski, Stefan

doi:10.1007/11758501_30

Cited by 2 publications

(2 citation statements)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Using vector registers to reduce memory operations for loading and storing the result vector further improves the performance of JAD based sparse MVP to 2.2 GFlop/s. Further optimizations result in a maximum performance of 20% vector peak (which is 16 GFlop/s) for sparse MVP on NEC SX-8 [6].…”

Section: Average Vector Lengthmentioning

confidence: 99%

“…Thus, small blocks can be formed by grouping the equations at each grid point. Operating on such dense blocks considerably reduces the amount of indirect addressing required for sparse MVP [6]. This improves the performance of the kernel dramatically on vector machines [9] and also remarkably on superscalar architectures [10,11].…”

Section: Block-based Linear Iterative Solver (Blis)mentioning

confidence: 99%

See 1 more Smart Citation

Block-Based Approach to Solving Linear Systems

Tiyyagura

Küster

2007

Computational Science – ICCS 2007

Self Cite

View full text Add to dashboard Cite

Abstract. This paper addresses the efficiency issues in solving large sparse linear systems parallely on scalar and vector architectures. Linear systems arise in numerous applications that need to solve PDEs on complex domains. The major time consuming part of large scale implicit Finite Element (FE) or Finite Volume (FV) simulation is solving the assembled global system of equations. First, the performance of widely used public domain solvers which target performance on scalar machines is analyzed on a typical vector machine. Then, a newly developed parallel sparse iterative solver (Block-based Linear Iterative Solver -BLIS) targeting performance on both scalar and vector systems is introduced and the time needed for solving linear systems is compared on different architectures. Finally, the reasons behind the scaling behaviour of parallel iterative solvers is analysed.

show abstract

Section: Average Vector Lengthmentioning

confidence: 99%