Targeting Heterogeneous Architectures via Macro Data Flow

Aldinucci, Marco; Danelutto, Marco; Kilpatrick, Peter; Torquati, Massimo

doi:10.1142/s0129626412400063

Cited by 6 publications

(5 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The feasibility of refactoring code in such a way that a map originally targeting CPU cores only is transformed into a map targeting CPU cores and GPUs has already been demonstrated in [1]. There we have shown not only that using both CPU cores and GPUs improves the performance of programs with respect to the performances achieved when using only CPU cores, but also that an automatic scheduling procedure may be set up which dynamically uses GPUs and CPU cores to achieve optimal load balancing and, therefore, performances.…”

Section: Resultsmentioning

confidence: 99%

See 1 more Smart Citation

Structured Data Access Annotations for Massively Parallel Computations

Aldinucci

Campa

Kilpatrick

et al. 2013

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract.We describe an approach aimed at addressing the issue of joint exploitation of control (stream) and data parallelism in a skeleton based parallel programming environment, based on annotations and refactoring. Annotations drive efficient implementation of a parallel computation. Refactoring is used to transform the associated skeleton tree into a more efficient, functionally equivalent skeleton tree. In most cases, cost models are used to drive the refactoring process. We show how sample use case applications/kernels may be optimized and discuss preliminary experiments with FastFlow assessing the theoretical results.

show abstract

Section: Resultsmentioning

confidence: 99%

“…As already suggested in [1] the skeleton tree could be annoted also with information related to the target architecture at hand in order to optimize mappings and/or distribution of data. As an example, let us consider a system S provided with n CPUs and r GPUs defined as S = {cpu 1 , .…”

Section: Access Driven Optimization Suppose To Have the Following Abmentioning

confidence: 99%

Structured Data Access Annotations for Massively Parallel Computations

Aldinucci

Campa

Kilpatrick

et al. 2013

Lecture Notes in Computer Science

View full text Add to dashboard Cite

show abstract

“…We are currently working to implement the higher tier "algorithmic skeletons" in such a way that application programmers may seamlessly implement extended FastFlow applications much in the same way that they use to implement "single multi-core" applications with the original framework. The whole activity-along with the activities aimed at supporting GPUs within FastFlow [15]-is aimed at providing suitable means to implement the computing model designed within ParaPhrase, an FP7 STREP project whose intent is to use parallel design patterns and algorithmic skeletons to program heterogeneous-multi-core plus GPUcollections of processing elements.…”

Section: Discussionmentioning

confidence: 99%

Targeting Distributed Systems in FastFlow

Aldinucci

Campa

Danelutto

et al. 2013

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. FastFlow is a structured parallel programming framework targeting shared memory multi-core architectures. In this paper we introduce a FastFlow extension aimed at supporting a network of multi-core workstation as well. The extension supports the execution of FastFlow programs by coordinating-in a structured way-the fine grain parallel activities running on a single workstation. We discuss the design and the implementation of this extension presenting preliminary experimental results validating it on state-of-the-art networked multi-core nodes.

show abstract

“…Data flow with non-negligible instruction code 5 has been demonstrated to be very effective (in terms of performance) in the case of fine grain computations [18], [19]. Due to the smaller synchronisation overhead (available data determine execution of code, rather than abstract coordination of a template graph) we expect data flow implementation of structured parallel computations to be more efficient also as far as the performance/power trade-off is concerned.…”

Section: A) Unbalanced Embarrassingly Parallel Computationsmentioning

confidence: 99%

A Green Perspective on Structured Parallel Programming

Danelutto

Torquati

Kilpatrick

2015

2015 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing

View full text Add to dashboard Cite

Structured parallel programming, and in particular programming models using the algorithmic skeleton or parallel design pattern concepts, are increasingly considered to be the only viable means of supporting effective development of scalable and efficient parallel programs.Structured parallel programming models have been assessed in a number of works in the context of performance. In this paper we consider how the use of structured parallel programming models allows knowledge of the parallel patterns present to be harnessed to address both performance and energy consumption.We consider different features of structured parallel programming that may be leveraged to impact the performance/energy trade-off and we discuss a preliminary set of experiments validating our claims.

show abstract

Targeting Heterogeneous Architectures via Macro Data Flow

Cited by 6 publications

References 17 publications

Structured Data Access Annotations for Massively Parallel Computations

Structured Data Access Annotations for Massively Parallel Computations

Targeting Distributed Systems in FastFlow

A Green Perspective on Structured Parallel Programming

Contact Info

Product

Resources

About