Proceedings of the 34th ACM International Conference on Supercomputing 2020
DOI: 10.1145/3392717.3392773
|View full text |Cite
|
Sign up to set email alerts
|

How I learned to stop worrying about user-visible endpoints and love MPI

Abstract: MPI+threads is gaining prominence as an alternative to the traditional "MPI everywhere" model in order to better handle the disproportionate increase in the number of cores compared with other on-node resources. However, the communication performance of MPI+threads can be 100x slower than that of MPI everywhere. Both MPI users and developers are to blame for this slowdown. MPI users traditionally have not exposed logical communication parallelism. Consequently, MPI libraries have used conservative approaches, … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(6 citation statements)
references
References 30 publications
0
6
0
Order By: Relevance
“…The implicit method [11,16] builds on the thesis that MPI already has a sufficient mechanism to allow users to express the inherent parallelism in their communications. Based on the outcomedictated serial execution model, any operations that do not affect MPI-specified outcomes are candidates for deserialization.…”
Section: Implicit Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…The implicit method [11,16] builds on the thesis that MPI already has a sufficient mechanism to allow users to express the inherent parallelism in their communications. Based on the outcomedictated serial execution model, any operations that do not affect MPI-specified outcomes are candidates for deserialization.…”
Section: Implicit Methodsmentioning
confidence: 99%
“…Communications from separate VCIs can be fully concurrent. For a detailed discussion of VCIs, see [16].…”
Section: Mapping Vci To Mpix Streammentioning
confidence: 99%
See 1 more Smart Citation
“…We have included support for interaction of MPICH with user-level thread (ULT) libraries [39], primarily targeting Argobots and the BOLT runtime [40]. Other issues being investigated include the design and evaluation of multiple virtual communication interfaces (VCIs) for multithreaded MPI communication [41].…”
Section: Solution Strategymentioning
confidence: 99%
“…Unfortunately, the proposed model has foundered in the MPI Forum's standardization process. While endpoints supported thread ranks (in groups and communicators) and thus addressability in point‐to‐point and collective communication, the proposal was not adopted as improvements could be adopted in the MPI implementations transparently 5 and because it was judged that the likely impacts on transport state and receive‐queue lengths would be unacceptable with large‐count multicore nodes in current and forthcoming scalable systems 6,7 …”
Section: Introductionmentioning
confidence: 99%