Scalable parallel building blocks for custom data analysis

Peterka, Tom; Ross, Robert; Gyulassy, Attila; Pascucci, Valerio; Kendall, Wesley; Shen, Han‐Wei; Lee, Teng-Yok; Chaudhuri, Anathbandhu

doi:10.1109/ldav.2011.6092324

Cited by 51 publications

(28 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It is the job of the runtime to map those instructions to code running on a process and messages exchanged between processes. The starting point of our work is DIY1 [32,34], a C library that structures data into blocks. It expects the computation to be organized in a bulk-synchronous pattern, but does not enforce this structure through programming convention.…”

Section: Data Parallelism and Block-structured Abstractionsmentioning

confidence: 99%

“…We compare the performance with DIY1 [32,34] (which only supports in-core blocks) and when possible, with equivalent collective functions in MPI itself (by assigning one block per MPI rank). We want to Figure 2: Cian mini-app merge-and swap-reduce using DIY2 compared with MPI using reduce and reducescatter, respectively.…”

Section: Benchmark Applicationsmentioning

confidence: 99%

“…Topological structures such as Morse-Smale complexes [16] and persistence diagrams [23] can be reduced this way. The merge-reduction algorithm in DIY1 was first published in 2011 [34]. The swap communication pattern is used for associative reduction of homogeneous contiguous data buffers that remain distributed instead of being merged into a smaller number of blocks.…”

mentioning

confidence: 99%

See 2 more Smart Citations

Block-parallel data analysis with DIY2

Morozov

Peterka

2016

2016 IEEE 6th Symposium on Large Data Analysis and Visualization (LDAV)

Self Cite

View full text Add to dashboard Cite

Section: Data Parallelism and Block-structured Abstractionsmentioning

confidence: 99%

Section: Benchmark Applicationsmentioning

confidence: 99%

mentioning

confidence: 99%

See 1 more Smart Citation

Block-parallel data analysis with DIY2

Morozov

Peterka

2016

2016 IEEE 6th Symposium on Large Data Analysis and Visualization (LDAV)

Self Cite

View full text Add to dashboard Cite

“…It also uses the DIY [23] data-parallel programming library for the multi-GPU extension, the HDF5 library for data I/O, and the Simple DirectMedia Layer (SDL) [24] library for OpenGL visualization. The programming language used within CUDA (CUDA-C) is an extension of the C programming language which allows one to implement GPU-based parallel functions, called kernels, which, when called, are executed n times in parallel by n different CUDA threads.…”

Section: Parallel Implementationmentioning

confidence: 99%

“…Given any of these patterns, spatially contiguous scan subregions can be defined such that the degree of overlap between adjacent scan points is preserved. Data partitioning is achieved using the DIY parallel programming library [23] that is written on top of MPI to facilitate communication between parallel processes. In DIY terminology, we assign a DIY block (not to be confused with CUDA blocks) to each GPU.…”

Section: Multi-gpu Algorithmmentioning

confidence: 99%

Parallel ptychographic reconstruction

et al. 2014

Self Cite

View full text Add to dashboard Cite

Abstract:Ptychography is an imaging method whereby a coherent beam is scanned across an object, and an image is obtained by iterative phasing of the set of diffraction patterns. It is able to be used to image extended objects at a resolution limited by scattering strength of the object and detector geometry, rather than at an optics-imposed limit. As technical advances allow larger fields to be imaged, computational challenges arise for reconstructing the correspondingly larger data volumes, yet at the same time there is also a need to deliver reconstructed images immediately so that one can evaluate the next steps to take in an experiment. Here we present a parallel method for real-time ptychographic phase retrieval. It uses a hybrid parallel strategy to divide the computation between multiple graphics processing units (GPUs) and then employs novel techniques to merge sub-datasets into a single complex phase and amplitude image. Results are shown on a simulated specimen and a real dataset from an X-ray experiment conducted at a synchrotron light source.

show abstract

Efficient Software for Programmable Visual Analysis Using Morse-Smale Complexes

Shivashankar

Natarajan

2017

Mathematics and Visualization

View full text Add to dashboard Cite

Scalable parallel building blocks for custom data analysis

Cited by 51 publications

References 36 publications

Block-parallel data analysis with DIY2

Block-parallel data analysis with DIY2

Parallel ptychographic reconstruction

Efficient Software for Programmable Visual Analysis Using Morse-Smale Complexes

Contact Info

Product

Resources

About