A parallel pattern for iterative stencil + reduce

Aldinucci, Marco; Danelutto, Marco; Drocco, Maurizio; Kilpatrick, Peter; Misale, Claudia; Pezzi, Guilherme Peretti; Torquati, Massimo

doi:10.1007/s11227-016-1871-z

Cited by 11 publications

(18 citation statements)

References 13 publications

(20 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As proposed in [16], using information from the application collected at runtime (without relying on user hints), it is possible to automatically derive the cutoff technique that is best suited for the application. In [2] we discussed the FastFlow implementation of a loop-of-stencil-reduce pattern, targeting iterative data parallel computations on heterogeneous multicores. We showed that various iterative kernels can be easily and effectively parallelised by using the Loop-of-stencil-reduce on the available GPUs by ex-ploiting the OpenCL capabilities of the FastFlow parallel framework.…”

Section: Resultsmentioning

confidence: 99%

The RePhrase Extended Pattern Set for Data Intensive Parallel Computing

Danelutto

Matteis

Sensi

et al. 2017

Int J Parallel Prog

View full text Add to dashboard Cite

We discuss the extended parallel pattern set identified within the EU-funded project RePhrase as a candidate pattern set to support data intensive applications targeting heterogeneous architectures. The set has been designed to include three classes of pattern, namely (1) core patterns, modelling common, not necessarily data intensive parallelism exploitation patterns, usually to be used in composition; (2) high level patterns, modelling common, complex and complete parallelism exploitation patterns; and (3) building block patterns, modelling the single components of data intensive applications, suitable for useâ\u80\u94in compositionâ\u80\u94to implement patterns not covered by the core and high level patterns. We discuss the expressive power of the RePhrase extended pattern set and results illustrating the performances that may be achieved with the FastFlow implementation of the high level patterns

show abstract

Section: Resultsmentioning

confidence: 99%

The RePhrase Extended Pattern Set for Data Intensive Parallel Computing

Danelutto

Matteis

Sensi

et al. 2017

Int J Parallel Prog

View full text Add to dashboard Cite

show abstract

“…The elements in each level (grid) are organized as boxes, each of which has k elements on every dimension. Therefore, a level with grid dimension of m would have (m/k) 3 boxes. The boxes in each level are distributed to the processes evenly before the computation starts.…”

Section: Hpgmg Benchmarkmentioning

confidence: 99%

“…It implies that the computation of one box requires two layers of data from other boxes on each face. So the overlapped areas between two boxes, called "ghost areas", have a depth of 2 in this algorithm, and thus the boxes are "enlarged" into size (k + 2 * 2) 3 . The stencil computation also requires the three β parameters, with the pattern of β i shown in Figure 3(c).…”

Section: Hpgmg Benchmarkmentioning

confidence: 99%

See 1 more Smart Citation

Solving a trillion unknowns per second with HPGMG on Sunway TaihuLight

Yang

et al. 2019

Cluster Comput

View full text Add to dashboard Cite

Benchmarks for supercomputers are important tools, not only for evaluating and ranking modern supercomputers, but also for providing hints for future architecture design. As a new benchmark, HPGMG (High Performance Geometric Multigrid) solves a linear equation set with a full geometric multi-grid algorithm. It involves computation on different scales, data movement with various volumes, global communication and neighbor communication with both large and small messages, etc., and is more correlated to real world applications than traditional benchmarks such as LINPACK. Therefore, it is desirable to examine how well HPGMG can perform on leadership supercomputers such as Sunway Taihulight. Sunway Taihulight, the No. 1 supercomputer in the Top 500 list from June 2016 to June 2018, which uses a specially designed many-core architecture SW26010, is of great interest to the community of high performance computing. With careful analysis and code design, we came up with an efficient implementation of HPGMG on SW26010 processors. We not only employed traditional optimization techniques such as 2.5D

show abstract

“…The map pattern is suitable for multicore architecures because each strand of computation on each node in the map pattern sequence can be mapped to a core for parallel execution. Map pattern is very suitable for embarrasingly parallel problems and can be nested with other patterns (Aldinucci et al, 2016), (Sheshikala et al, 2016) to create a more powerful pattern for computation. A map pattern can be advanced with a reduce pattern to form a Map-Reduce pattern which can help enhance parallel computation.…”

Section: Map Patternmentioning

confidence: 99%

Accelerating Green Computing with Hybrid Asymmetric Multicore Architectures and Safe Parallelism

Mogale

Esiefarienrhe

Gasela

et al. 2019

2019 International Conference on Advances in Big Data, Computing and Data Communication Systems (icABCD)

View full text Add to dashboard Cite

In this paper we present a novel strategy for accelarating green computing by utilizing and adopting the Hybrid Asymmetric Multicore Architectures (HAMA) model with Safe Parallelism. Most of the modern computing is serial and contributes to the global footprint of energy consumption. These impacts are often witnessed and experienced in many server farms and cloud computing platforms where the majority of the world's information resides. Evidently in this paper we present a novel strategy that can help decelerate the global footprint of energy consumption caused by computing. Through our strategy we prove that by adopting HAMA and utilizing safe parallelism energy consumption per computation can be minimized.

show abstract

A parallel pattern for iterative stencil + reduce

Cited by 11 publications

References 13 publications

The RePhrase Extended Pattern Set for Data Intensive Parallel Computing

The RePhrase Extended Pattern Set for Data Intensive Parallel Computing

Solving a trillion unknowns per second with HPGMG on Sunway TaihuLight

Accelerating Green Computing with Hybrid Asymmetric Multicore Architectures and Safe Parallelism

Contact Info

Product

Resources

About