2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture 2010
DOI: 10.1109/micro.2010.13
|View full text |Cite
|
Sign up to set email alerts
|

Task Superscalar: An Out-of-Order Task Pipeline

Abstract: Abstract-We present Task Superscalar, an abstraction of instruction-level out-of-order pipeline that operates at the tasklevel. Like ILP pipelines, which uncover parallelism in a sequential instruction stream, task superscalar uncovers tasklevel parallelism among tasks generated by a sequential thread. Utilizing intuitive programmer annotations of task inputs and outputs, the task superscalar pipeline dynamically detects intertask data dependencies, identifies task-level parallelism, and executes tasks out-of-… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
104
0
1

Year Published

2014
2014
2021
2021

Publication Types

Select...
4
4
1

Relationship

4
5

Authors

Journals

citations
Cited by 108 publications
(105 citation statements)
references
References 19 publications
0
104
0
1
Order By: Relevance
“…We evaluate our proposals using an in-house trace-driven simulator, based on the methodology of [9], that models a multicore CPU connected to a discrete GPU through a PCIe bus. The simulator performs a coarse-grained modeling of the CPU, tracing the execution of our benchmarks on an Intel Core i7 930 chip.…”
Section: Methodsmentioning
confidence: 99%
“…We evaluate our proposals using an in-house trace-driven simulator, based on the methodology of [9], that models a multicore CPU connected to a discrete GPU through a PCIe bus. The simulator performs a coarse-grained modeling of the CPU, tracing the execution of our benchmarks on an Intel Core i7 930 chip.…”
Section: Methodsmentioning
confidence: 99%
“…The Task Superscalar [4] architecture was the first one to address this problem, proposing a decoupled model in which different finite state machines (modules) manage the most cumbersome functionalities of the runtime. The first implementation of the Task Superscalar architecture, the Hardware Task Superscalar, has already demonstrated high potential [5].…”
Section: Introductionmentioning
confidence: 99%
“…Dataflow computing offers a simple way to achieve high-performance, and high degree of concurrency and speculation, by means of implicit synchronization [2], [3]. Architectural exploitation of dataflow principles have been investigated in several research works [4]- [11]. Generally, dataflow-inspired execution models split the applications into a large set of threads [14], [25].…”
Section: Introductionmentioning
confidence: 99%