2007
DOI: 10.1109/pact.2007.4336195
|View full text |Cite
|
Sign up to set email alerts
|

Architectural Support for the Stream Execution Model on General-Purpose Processors

Abstract: There has recently been much interest in stream processing, both in industry (e.g., Cell, NVIDIA G80, ATI R580) and academia (e.g., Stanford Merrimac, MIT RAW), with stream programs becoming increasingly popular for both media and more general-purpose computing. Although a special style of programming called stream programming is needed to target these stream architectures, huge performance benefits can be achieved.In this paper, we minimally add architectural features to commodity general-purpose processors … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
25
0

Year Published

2009
2009
2013
2013

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 26 publications
(25 citation statements)
references
References 30 publications
0
25
0
Order By: Relevance
“…By contrast, for on-chip core-to-core communication, RDMA provides significant traffic reduction, which together with event responses and NI cache integration are the focus of our work. Streaming hardware support for general purpose systems exploiting caches for streaming data was considered in [10,9,23,4]. In [10] cache control bits are used for best-effort avoidance of replacements and scattergather enhancements of the L2 controller are proposed for a single-core system.…”
Section: Related Work and Contributionsmentioning
confidence: 99%
“…By contrast, for on-chip core-to-core communication, RDMA provides significant traffic reduction, which together with event responses and NI cache integration are the focus of our work. Streaming hardware support for general purpose systems exploiting caches for streaming data was considered in [10,9,23,4]. In [10] cache control bits are used for best-effort avoidance of replacements and scattergather enhancements of the L2 controller are proposed for a single-core system.…”
Section: Related Work and Contributionsmentioning
confidence: 99%
“…Most pertinent to this work is Streamware [14] and the related architectural support [15] which target stream processing. They propose a software programmable Stream-LoadStore (SLS) hardware unit, that resembles EBP, and is used by the runtime software.…”
Section: Related Workmentioning
confidence: 99%
“…Streaming hardware support, for general purpose systems using caches, was considered in [4,9,10,27]. In [10] cache control bits are used for best-effort avoidance of replacements and scatter-gather enhancements of the L2 controller are proposed for a uniprocessor system.…”
Section: Related Work and Contributionsmentioning
confidence: 99%
“…In [10] cache control bits are used for best-effort avoidance of replacements and scatter-gather enhancements of the L2 controller are proposed for a uniprocessor system. Streamware [9] exploits the compiler to avoid replacements of streaming data mapped to processor caches, for codes amenable to stream processing.…”
Section: Related Work and Contributionsmentioning
confidence: 99%