2009
DOI: 10.1007/978-3-642-03869-3_11
|View full text |Cite
|
Sign up to set email alerts
|

Process Mapping for MPI Collective Communications

Abstract: Abstract.It is an important problem to map virtual parallel processes to physical processors (or cores) in an optimized way to get scalable performance due to non-uniform communication cost in modern parallel computers. Existing work uses profile-guided approaches to optimize mapping schemes to minimize the cost of point-to-point communications automatically. However, these approaches cannot deal with collective communications and may get sub-optimal mappings for applications with collective communications.In … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
13
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 32 publications
(13 citation statements)
references
References 16 publications
0
13
0
Order By: Relevance
“…Thanks to an adequate placement policy enforced by both these mapping and binding parameters, it is possible to take into account the physical topology and reduce the communication costs for instance [14,15]. This is also used to improve collective communication performance [16], Unfortunately, these options are totally non-standard and can even change from one version of a process manager to the other.…”
Section: Process Managers and Process Mappingmentioning
confidence: 99%
“…Thanks to an adequate placement policy enforced by both these mapping and binding parameters, it is possible to take into account the physical topology and reduce the communication costs for instance [14,15]. This is also used to improve collective communication performance [16], Unfortunately, these options are totally non-standard and can even change from one version of a process manager to the other.…”
Section: Process Managers and Process Mappingmentioning
confidence: 99%
“…Various MPI process mapping methods have been proposed in the related studies. Most of the methods rely on offline profiling to trace communication between processes and to analyze the communication behaviors of the applications [2,9,21,31]. The main drawback of these methods is the requirement of offline profiling, which has a high overhead and is potentially timeconsuming.…”
Section: Introductionmentioning
confidence: 99%
“…One consists in implementing codes that take into account the system characteristics [5]- [8], for instance minimizing the number of messages across the network or using blocks of data that fit in the caches to avoid cache misses. The other approach maps the processes to specific cores to improve the performance without changing the codes [9]- [11]. Knowledge of the topology of the machine and of some hardware parameters is necessary in both approaches.…”
Section: Introductionmentioning
confidence: 99%