14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05) 2005
DOI: 10.1109/pact.2005.33
|View full text |Cite
|
Sign up to set email alerts
|

Optimizing Compiler for the CELL Processor

Abstract: Developed for multimedia and game applications, as well as other numerically intensive workloads, the CELL processor provides support both for highly parallel codes, which have high computation and memory requirements, and for scalar codes, which require fast response time and a full-featured programming environment. This first generation CELL processor implements on a single chip a Power Architecture processor with two levels of cache, and eight attached streaming processors with their own local memories and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
80
0
1

Year Published

2007
2007
2011
2011

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 122 publications
(81 citation statements)
references
References 29 publications
0
80
0
1
Order By: Relevance
“…The purpose is to evolve OpenMP into an appropriate programming model for many-core processors with explicitly managed memory hierarchy [21], e.g. the IBM CELL [22] and the IBM Cyclops-64 [23] processor.…”
Section: Future Workmentioning
confidence: 99%
“…The purpose is to evolve OpenMP into an appropriate programming model for many-core processors with explicitly managed memory hierarchy [21], e.g. the IBM CELL [22] and the IBM Cyclops-64 [23] processor.…”
Section: Future Workmentioning
confidence: 99%
“…In the remainder of this section we examine the approaches for mapping BDV algorithm across the CELL clusters and within the CELL. Compiler assisted parallelization using IBM XL compiler [31] will be part of our future studies.…”
Section: Mapping Bdv Onto the Ibm Cellmentioning
confidence: 99%
“…This small memory size influences the way code and data for the applications are partitioned across the CELL. For many programs with a small code size, function overlaying and resident partition management [31] might not be necessary. The total code size for the B-U computation is about 25 KB and hence no function overlaying mechanism was used in our implementation.…”
Section: Application Data Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…The first is a software-caching technique, which emulates the behavior of a hardware cache by the software. The most representative example of such methods is the local memory in the CELL BE processor [2]. However, there is no highly successful scheme to eliminate the high address translation overhead at runtime, because a single memory reference instruction is replaced by a couple of instructions for softwareemulated cache lookups.…”
Section: Introductionmentioning
confidence: 99%