2015
DOI: 10.1002/cpe.3457
|View full text |Cite
|
Sign up to set email alerts
|

Genetic algorithm based task reordering to improve the performance of batch scheduled massively parallel scientific applications

Abstract: SUMMARYThe growth in size of networked high performance computers along with novel accelerator-based node architectures has further emphasized the importance of communication efficiency in high performance computing. The world's largest high performance computers are usually operated as shared user facilities due to the costs of acquisition and operation. Applications are scheduled for execution in a shared environment and are placed on nodes that are not necessarily contiguous on the interconnect. Furthermore… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 29 publications
0
3
0
Order By: Relevance
“…However, we believe code modifications are possible which would make it unnecessary to use a random permutation of nodes: by an adjustment of the code it should be possible to recast the communication pattern as a nearest neighbor communication and then use known methods to map the communication pattern optimally to the network, see, e.g. [33]; this will be a topic of future study. In any case, as with other parallel applications, optimizing communications in a multiuser environment is challenging insofar as the network bandwidth is shared by other users and furthermore it is not always possible for a user to reserve a communication-optimal subset of nodes for job execution.…”
Section: -Way Weak Scaling Resultsmentioning
confidence: 99%
“…However, we believe code modifications are possible which would make it unnecessary to use a random permutation of nodes: by an adjustment of the code it should be possible to recast the communication pattern as a nearest neighbor communication and then use known methods to map the communication pattern optimally to the network, see, e.g. [33]; this will be a topic of future study. In any case, as with other parallel applications, optimizing communications in a multiuser environment is challenging insofar as the network bandwidth is shared by other users and furthermore it is not always possible for a user to reserve a communication-optimal subset of nodes for job execution.…”
Section: -Way Weak Scaling Resultsmentioning
confidence: 99%
“…Sankaran et al [33] used a genetic algorithms for optimizing the mapping for two large-scale parallel S3D and LAMMPS on the Cray XK7 machine. Bhanot et al [13] used simulated annealing to optimize task layout of parallel applications SAGE and UMT2000 on the BlueGene/L machine.…”
Section: Related Workmentioning
confidence: 99%
“…The search space is reduced by generating a full-active schedule that satisfies precedence constraints, a neighborhood search is applied to exploit the search space for better solutions and to enhance the GA. Simulation suggests sustainability of this hybrid GA in solving JSSP. Sankaran et al [21] proposed a GA based parallel optimization technique aiming to improve the performance of batch schedule of two massively parallel application codes; a turbulent combustion flow solver (S3D) and a molecular dynamics code (LAMMPS). Experiments have shown a significant deviation from ideal weak scaling and variability in performance.…”
Section: Previous Workmentioning
confidence: 99%