Julien Herrmann scite author profile

We show in this paper how linear algebra calculations can be enhanced by statistical techniques in the case of a square linear system Ax = b. We study a random transformation of A that enables us to avoid pivoting and then to reduce the amount of communication. Numerical experiments show that this randomization can be performed at a very aordable computational price while providing us with a satisfying accuracy when compared to partial pivoting. This random transformation called Partial Random Buttery Transformation (PRBT) is optimized in terms of data storage and ops count. We propose a solver where PRBT and the LU factorization with no pivoting take advantage of the latest generation of hybrid multicore/GPU machines and we compare its Gop/s performance with a solver implemented in a current parallel library.

show abstract

Optimal Multistage Algorithm for Adjoint Computation

Aupy¹,

Herrmann²,

Hovland³

et al. 2016

SIAM J. Sci. Comput.

View full text Add to dashboard Cite

International audienceWe reexamine the work of Stumm and Walther on multistage algorithms for adjoint computation. We provide an optimal algorithm for this problem when there are two levels of checkpoints , in memory and on disk. Previously, optimal algorithms for adjoint computations were known only for a single level of checkpoints with no writing and reading costs; a well-known example is the binomial checkpointing algorithm of Griewank and Walther. Stumm and Walther extended that binomial checkpointing algorithm to the case of two levels of checkpoints, but they did not provide any optimality results. We bridge the gap by designing the first optimal algorithm in this context. We experimentally compare our optimal algorithm with that of Stumm and Walther to assess the difference in performance

show abstract

Multilevel Algorithms for Acyclic Partitioning of Directed Acyclic Graphs

Herrmann¹,

Ozkaya²,

Uçar³

et al. 2019

SIAM J. Sci. Comput.

View full text Add to dashboard Cite

Acyclic Partitioning of Large Directed Acyclic Graphs

Herrmann

Kho

Uçar

et al. 2017

View full text Add to dashboard Cite

Abstract-Finding a good partition of a computational directed acyclic graph associated with an algorithm can help find an execution pattern improving data locality, conduct an analysis of data movement, and expose parallel steps. The partition is required to be acyclic, i.e., the inter-part edges between the vertices from different parts should preserve an acyclic dependency structure among the parts. In this work, we adopt the multilevel approach with coarsening, initial partitioning, and refinement phases for acyclic partitioning of directed acyclic graphs and develop a direct k-way partitioning scheme. To the best of our knowledge, no such scheme exists in the literature. To ensure the acyclicity of the partition at all times, we propose novel and efficient coarsening and refinement heuristics. The quality of the computed acyclic partitions is assessed by computing the edge cut, the total volume of communication between the parts, and the critical path latencies. We use the solution returned by well-known undirected graph partitioners as a baseline to evaluate our acyclic partitioner, knowing that the space of solution is more restricted in our problem. The experiments are run on large graphs arising from linear algebra applications.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Julien Herrmann

Accelerating Linear System Solutions Using Randomization Techniques

Optimal Multistage Algorithm for Adjoint Computation

Multilevel Algorithms for Acyclic Partitioning of Directed Acyclic Graphs

Acyclic Partitioning of Large Directed Acyclic Graphs

Contact Info

Product

Resources

About