2014 43rd International Conference on Parallel Processing Workshops 2014
DOI: 10.1109/icppw.2014.47
|View full text |Cite
|
Sign up to set email alerts
|

Improving Data Movement Performance for Sparse Data Patterns on the Blue Gene/Q Supercomputer

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
5
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 19 publications
0
5
0
Order By: Relevance
“…Some approaches focused on improving data locality and used a polynomial time file domain assignment algorithm to minimize communication during the shuffle phase [11]. Other researchers took the routing mechanism of BG/Q into consideration when issued sparse data access [12]. Tessier et al [13] used a different approach which combines an optimized buffering system and a topology-aware aggregators mapping algorithm targeting any kind of architecture, so the new algorithm can be easily extended.…”
Section: Related Workmentioning
confidence: 99%
“…Some approaches focused on improving data locality and used a polynomial time file domain assignment algorithm to minimize communication during the shuffle phase [11]. Other researchers took the routing mechanism of BG/Q into consideration when issued sparse data access [12]. Tessier et al [13] used a different approach which combines an optimized buffering system and a topology-aware aggregators mapping algorithm targeting any kind of architecture, so the new algorithm can be easily extended.…”
Section: Related Workmentioning
confidence: 99%
“…Certain approaches focus on discovering data locality and using a polynomial-time file domain to aggregator assignment algorithm to minimize communication between computing processes and aggregators [15]. Other researchers try to take the routing mechanism into consideration when issuing sparse data access on BG/Q [16]. Previous IBM supercomputers BG/P adopt a general method designed to increase the I/O bandwidth of collective I/O [17].…”
Section: Related Workmentioning
confidence: 99%
“…Certain approaches focus on data locality and a polynomial time assignment algorithm (the Hungarian algorithm) to reduce the communication between compute nodes and aggregators [20]. Others have concentrated their efforts on the specific problem of sparse data patterns on the BG/Q, using an algorithm to take into account paths on the network topology [21]. A more general method designed to increase the I/O bandwidth of collective I/O for the previous version of IBM supercomputers BG/P has been proposed in [22].…”
Section: Related Workmentioning
confidence: 99%