2010
DOI: 10.1177/1094342009359011
|View full text |Cite
|
Sign up to set email alerts
|

Architecture of the Component Collective Messaging Interface

Abstract: Different programming paradigms utilize a variety of collective communication operations, often with different semantics. We present the component collective messaging interface (CCMI) that can support asynchronous nonblocking collectives and is extensible to different programming paradigms and architectures. CCMI is designed with components written in the C++ programming language, allowing it to be reusable and extendible. Collective algorithms are embodied in topological schedules and executors that execute … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
4
3
1

Relationship

4
4

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 14 publications
0
7
0
Order By: Relevance
“…The performance for large messages at PPN=4 and 16 saturates as the broadcast data spills out of the L2 cache and the performance is driven by DDR throughput. To improve broadcast performance, by up to a factor of nearly 10, we also implemented a 10-color rectangle broadcast, where the root sends data to all the remaining nodes in the 5D torus via 10 edge disjoint spanning trees [15]. The peak throughput of this algorithm is 18 GB/s.…”
Section: Performance Anaylysismentioning
confidence: 99%
“…The performance for large messages at PPN=4 and 16 saturates as the broadcast data spills out of the L2 cache and the performance is driven by DDR throughput. To improve broadcast performance, by up to a factor of nearly 10, we also implemented a 10-color rectangle broadcast, where the root sends data to all the remaining nodes in the 5D torus via 10 edge disjoint spanning trees [15]. The peak throughput of this algorithm is 18 GB/s.…”
Section: Performance Anaylysismentioning
confidence: 99%
“…Native Algorithm. Kumar et al [2010] describes the different native all-gather algorithms used on the Blue Gene/P.…”
Section: All-gather Algorithmsmentioning
confidence: 99%
“…PAMI collective library builds upon the portable Component Collective Messaging Interface (CCMI [4]) from Blue Gene/P, with optimized implementations on Blue Gene/Q, Power7 IH [6] and Intel x86 clusters.…”
Section: Pami Collectivesmentioning
confidence: 99%