Florent Pruvost scite author profile

Florent Pruvost

5Publications

85Citation Statements Received

63Citation Statements Given

How they've been cited

How they cite others

Affiliations

Laboratoire Bordelais de Recherche en Informatique, École Centrale Paris, French Institute for Research in Computer Science and Automation

Publications

Order By: Most citations

Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model

Agullo¹,

Aumage²,

Faverge³

et al. 2024

IEEE Trans. Parallel Distrib. Syst.

View full text Add to dashboard Cite

The emergence of accelerators as standard computing resources on supercomputers and the subsequent architectural complexity increase revived the need for high-level parallel programming paradigms. Sequential task-based programming model has been shown to efficiently meet this challenge on a single multicore node possibly enhanced with accelerators, which motivated its support in the OpenMP 4.0 standard. In this paper, we show that this paradigm can also be employed to achieve high performance on modern supercomputers composed of multiple such nodes, with extremely limited changes in the user code. To prove this claim, we have extended the StarPU runtime system with an advanced inter-node data management layer that supports this model by posting communications automatically. We illustrate our discussion with the task-based tile Cholesky algorithm that we implemented on top of this new runtime system layer. We show that it enables very high productivity while achieving a performance competitive with both the pure Message Passing Interface (MPI)-based ScaLAPACK Cholesky reference implementation and the DPLASMA Cholesky code, which implements another (non-sequential) task-based programming paradigm.

show abstract

Accelerated Waveform Relaxation methods for power systems

Pruvost

Laurent-Gengoux

Magoulès

et al. 2011

View full text Add to dashboard Cite

Preconditioners for Schwarz relaxation methods applied to differential algebraic equations

Magoulès

Laurent-Gengoux

Pruvost

2014

International Journal of Computer Mathematics

View full text Add to dashboard Cite

show abstract

Speed-up the computing efficiency of waveform relaxation method for power system transient stability

Pruvost

Laurent-Gengoux

Magoulès

et al. 2011

View full text Add to dashboard Cite

This paper deals with some improvements of the Waveform Relaxation method for large power systems transient stability analysis. In this context, the classical Waveform Relaxation iterative method is usually not efficient due to its slow convergence. The convergence speed depends on a lot of parameters such as the model, the disturbance nature, the number of subdomains created by the decomposition, the initialization, the time domain length, etc. In order to make the method competitive with usual and mastered sequential methods, some important points are here investigated such as an initialization of nonlinear integrations with linear solutions and a preconditioning technique to deal with a large number of subsystems. These innovative methods can bring interesting speedups for large power systems as illustrated in the presented numerical experiments on a large and realistic European power system.

show abstract

On the Arithmetic Intensity of Distributed-Memory Dense Matrix Multiplication Involving a Symmetric Input Matrix (SYMM)

Agullo

Buttari

Coulaud

et al. 2023

View full text Add to dashboard Cite

Dense matrix multiplication involving a symmetric input matrix (SYMM) is implemented in reference distributed-memory codes with the same data distribution as its general analogue (GEMM). We show that, when the symmetric matrix is dominant, such a 2D block-cyclic (2D BC) scheme leads to a lower arithmetic intensity (AI) of SYMM than that of GEMM by a factor of 2. We propose alternative data distributions preserving the memory benefit of SYMM of storing only half of the matrix while achieving up to the same AI as GEMM. We also show that, in the case we can afford the same memory footprint as GEMM, SYMM can achieve a higher AI. We propose a task-based design of SYMM independent of the data distribution. This design allows for scalable A-stationary SYMM with which all discussed data distributions, may they be very irregular, can be easily assessed. We have integrated the resulting code in a reduction dimension algorithm involving a randomized singular value decomposition dominated by SYMM. An experimental study shows a compelling impact on performance.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Florent Pruvost

Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model

Accelerated Waveform Relaxation methods for power systems

Preconditioners for Schwarz relaxation methods applied to differential algebraic equations

Speed-up the computing efficiency of waveform relaxation method for power system transient stability

On the Arithmetic Intensity of Distributed-Memory Dense Matrix Multiplication Involving a Symmetric Input Matrix (SYMM)

Contact Info

Product

Resources

About