Pratik Nayak scite author profile

Pratik Nayak

4Publications

73Citation Statements Received

11Citation Statements Given

How they've been cited

117

How they cite others

Affiliations

Institute for Plasma Research, Karlsruhe Institute of Technology, Kerntechnische Entsorgung Karlsruhe (Germany)

Publications

Order By: Most citations

Load-balancing Sparse Matrix Vector Product Kernels on GPUs

Anzt

Cojean

Chen

et al. 2020

ACM Trans. Parallel Comput.

View full text Add to dashboard Cite

Efficient processing of Irregular Matrices on Single Instruction, Multiple Data (SIMD)-type architectures is a persistent challenge. Resolving it requires innovations in the development of data formats, computational techniques, and implementations that strike a balance between thread divergence, which is inherent for Irregular Matrices, and padding, which alleviates the performance-detrimental thread divergence but introduces artificial overheads. To this end, in this article, we address the challenge of designing high performance sparse matrix-vector product (S p MV) kernels designed for Nvidia Graphics Processing Units (GPUs). We present a compressed sparse row (CSR) format suitable for unbalanced matrices. We also provide a load-balancing kernel for the coordinate (COO) matrix format and extend it to a hybrid algorithm that stores part of the matrix in SIMD-friendly Ellpack format (ELL) format. The ratio between the ELL- and the COO-part is determined using a theoretical analysis of the nonzeros-per-row distribution. For the over 2,800 test matrices available in the Suite Sparse matrix collection, we compare the performance against S p MV kernels provided by NVIDIA’s cuSPARSE library and a heavily-tuned sliced ELL (SELL-P) kernel that prevents unnecessary padding by considering the irregular matrices as a combination of matrix blocks stored in ELL format.

show abstract

A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic

Abdelfattah¹,

Anzt²,

Boman³

et al. 2020

Preprint

View full text Add to dashboard Cite

Ginkgo: A high performance numerical linear algebra library

Anzt¹,

Cojean²,

Chen³

et al. 2020

JOSS

View full text Add to dashboard Cite

show abstract

Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing

Anzt

Cojean

Flegar

et al. 2022

ACM Trans. Math. Softw.

View full text Add to dashboard Cite

In this article, we present Ginkgo , a modern C++ math library for scientific high performance computing. While classical linear algebra libraries act on matrix and vector objects, Ginkgo ’s design principle abstracts all functionality as “linear operators,” motivating the notation of a “linear operator algebra library.” Ginkgo ’s current focus is oriented toward providing sparse linear algebra functionality for high performance graphics processing unit (GPU) architectures, but given the library design, this focus can be easily extended to accommodate other algorithms and hardware architectures. We introduce this sophisticated software architecture that separates core algorithms from architecture-specific backends and provide details on extensibility and sustainability measures. We also demonstrate Ginkgo ’s usability by providing examples on how to use its functionality inside the MFEM and deal.ii finite element ecosystems. Finally, we offer a practical demonstration of Ginkgo ’s high performance on state-of-the-art GPU architectures.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Pratik Nayak

Load-balancing Sparse Matrix Vector Product Kernels on GPUs

A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic

Ginkgo: A high performance numerical linear algebra library

Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing

Contact Info

Product

Resources

About