2014
DOI: 10.1016/j.parco.2014.10.002
|View full text |Cite
|
Sign up to set email alerts
|

Couillard: Parallel programming via coarse-grained Data-flow Compilation

Abstract: ResumoData-flow is a natural approach to parallelism. However, describing dependencies and control between finegrained data-flow tasks can be complex and present unwanted overheads. TALM (TALM is an Architecture and Language for Multi-threading) introduces a user-defined coarse-grained parallel data-flow model, where programmers identify code blocks, called superinstructions, to be run in parallel and connect them in a data-flow graph. TALM has been implemented as a hybrid Von Neumann/data-flow execution syste… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0
1

Year Published

2017
2017
2021
2021

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 19 publications
(6 citation statements)
references
References 30 publications
0
5
0
1
Order By: Relevance
“…TALM is an execution model which provides an instruction set including all special instructions required for dataflow model such as the Steer and IncTag instructions, introduced in Section 2.2, in order to enable dataflow execution on top of von Neumann machines. It has a toolchain composed of Trebuchet (a virtual machine), Couillard (a dataflow compiler) 21 and an assembler that generates the binary code from the TALM assembly language to run on Trebuchet .…”
Section: Experimental Validation Considerationsmentioning
confidence: 99%
See 1 more Smart Citation
“…TALM is an execution model which provides an instruction set including all special instructions required for dataflow model such as the Steer and IncTag instructions, introduced in Section 2.2, in order to enable dataflow execution on top of von Neumann machines. It has a toolchain composed of Trebuchet (a virtual machine), Couillard (a dataflow compiler) 21 and an assembler that generates the binary code from the TALM assembly language to run on Trebuchet .…”
Section: Experimental Validation Considerationsmentioning
confidence: 99%
“…TALM 20 introduces a user‐defined coarse‐grained parallel data‐flow model, where programmers identify code blocks, called super‐instructions, to be run in parallel and connect them in a data‐flow graph. Using the Trebuchet as a TALM's implementation, the Couillard 21 is a full compiler that creates a dataflow graph based on a super‐instructions expressed in an annotated C‐program. In 2014, was proposed a Python library that provides a dataflow programming in a minimalistic and reasonably simple syntax: the Sucuri 22 …”
Section: Introductionmentioning
confidence: 99%
“…Figure 5A shows the distribution only for reused subgraphs for the Eager-Match detection strategy with and without inspection (EM+I and EM, respectively). Each column shows the percentage of reused subgraphs with 2 to 4 nodes ( [2,4]), 5 to 9 nodes ( [5,9]), 10 to 49 nodes ( [10,49]), 50…”
Section: Of 12mentioning
confidence: 99%
“…In Dataflow, tasks are executed according to their dependencies, instead of following program order, thus allowing concurrent execution without the need of a program counter. Several recent investigations use the dataflow model as an appealing alternative for parallel programming that is more transparent to developers, while still providing the desirable performance . Moreover, dataflow architectures have been proposed by different research groups and are already a reality in the industry of high performance computing accelerators …”
Section: Introductionmentioning
confidence: 99%
“…The optimal partitioning is an NP-complete problem [51], in order to avoid time-consuming in automatic analysis of the code, the construction of the graph A for each application may be made in the design stage. The proposed characterization for CPS environments is based on a coarse-grained data flow approach which is a natural paradigm for describing digital signal processing applications for concurrent implementation on parallel hardware [57].…”
Section: Target Applicationsmentioning
confidence: 99%