2022
DOI: 10.48550/arxiv.2203.17063
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Efficient and Eventually Consistent Collective Operations

Roman Iakymchuk,
Amandio Faustino,
Andrew Emerson
et al.

Abstract: Collective operations are common features of parallel programming models that are frequently used in High-Performance (HPC) and machine/ deep learning (ML/ DL) applications. In strong scaling scenarios, collective operations can negatively impact the overall application performance: with the increase in core count, the load per rank decreases, while the time spent in collective operations increases logarithmically.In this article, we propose a design for eventually consistent collectives suitable for ML/ DL co… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 11 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?