High Performance Computing on Vector Systems 2010 2010
DOI: 10.1007/978-3-642-11851-7_3
|View full text |Cite
|
Sign up to set email alerts
|

Empirical Optimization of Collective Communications with ADCL

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2014
2014
2014
2014

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 5 publications
0
1
0
Order By: Relevance
“…To match DRAM page size and CPU cache size, the CMSSL library contains routines for automatic selection of optimal parameters, such as loop order and operator alignments, for matrix multiplication in both local and global scopes [18]. Benkert et al uses an empirical approach for MPI communication auto-tuning with ADCL library [3]. In comparison, Orthrus introduces a self-adapting technique to improve collective I/O performance and demonstrates its feasibility and effectiveness, which may inspire more innovative applications of the technique to address I/O issues in large-scale HPC systems.…”
Section: Related Workmentioning
confidence: 99%
“…To match DRAM page size and CPU cache size, the CMSSL library contains routines for automatic selection of optimal parameters, such as loop order and operator alignments, for matrix multiplication in both local and global scopes [18]. Benkert et al uses an empirical approach for MPI communication auto-tuning with ADCL library [3]. In comparison, Orthrus introduces a self-adapting technique to improve collective I/O performance and demonstrates its feasibility and effectiveness, which may inspire more innovative applications of the technique to address I/O issues in large-scale HPC systems.…”
Section: Related Workmentioning
confidence: 99%