2014
DOI: 10.1007/978-3-319-09967-5_18
|View full text |Cite
|
Sign up to set email alerts
|

Compiler Optimizations for Non-contiguous Remote Data Movement

Abstract: Abstract. Remote Memory Access (RMA) programming is one of the core concepts behind modern parallel programming languages such as UPC and Fortran 2008 or high-performance libraries such as MPI-3 One Sided or SHMEM. Many applications have to communicate noncontiguous data due to their data layout in main memory. Previous studies showed that such non-contiguous transfers can reduce communication performance by up to an order of magnitude. In this work, we demonstrate a simple scheme for statically optimizing non… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2019
2019
2019
2019

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 21 publications
0
1
0
Order By: Relevance
“…Non-contiguous memory transfers can be accelerated by compiling DDTs pack/unpack functions at runtime [11,49]. However, the main optimization of this approach it to choose the best data copying strategy for x86 architectures [50]. The same cannot be applied to NICs, where the only way to move data to the host is by issuing DMA writes to it.…”
Section: Related Workmentioning
confidence: 99%
“…Non-contiguous memory transfers can be accelerated by compiling DDTs pack/unpack functions at runtime [11,49]. However, the main optimization of this approach it to choose the best data copying strategy for x86 architectures [50]. The same cannot be applied to NICs, where the only way to move data to the host is by issuing DMA writes to it.…”
Section: Related Workmentioning
confidence: 99%