2007
DOI: 10.1016/j.parco.2007.09.005
|View full text |Cite
|
Sign up to set email alerts
|

High performance combinatorial algorithm design on the Cell Broadband Engine processor

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0

Year Published

2008
2008
2012
2012

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 25 publications
(16 citation statements)
references
References 22 publications
0
16
0
Order By: Relevance
“…Also, the simulation of the benchmarks should finish within reasonable time and the performance statistics should have a significant number of branch miss stall cycles in order to have some room for improvement. In our opinion, the selected benchmarks are a good representation of applications suitable for the Cell processor and comply with our requirements MiniGZip is a parallel SPU implementation of the GZIP (de)compression program based on the ZLIB library, implemented by Seunghwa Kang [6]. The list ranking problem is a fundamental problem for many combinatorial and graphtheoretic applications.…”
Section: Methodsmentioning
confidence: 99%
“…Also, the simulation of the benchmarks should finish within reasonable time and the performance statistics should have a significant number of branch miss stall cycles in order to have some room for improvement. In our opinion, the selected benchmarks are a good representation of applications suitable for the Cell processor and comply with our requirements MiniGZip is a parallel SPU implementation of the GZIP (de)compression program based on the ZLIB library, implemented by Seunghwa Kang [6]. The list ranking problem is a fundamental problem for many combinatorial and graphtheoretic applications.…”
Section: Methodsmentioning
confidence: 99%
“…Another example is Cell Broadband Engine (Cell BE) in PlayStation3 and QS22 blade [14]. It is a heterogeneous multicore chip consisting of a traditional microprocessor called power processing element (PPE) that controls eight singleinstruction multiple-data (SIMD) co-processing units called synergistic processing elements (SPEs) [15].…”
Section: V(r) ← F({v(r′)|r′ ∈ Neighbor(r)})mentioning
confidence: 99%
“…T B is employed in the triplet because of the significant overhead due to branch misprediction in SPEs. However, since T B can be decreased by unrolling loops and inserting branch hints, it is difficult to report accurate number of actual branches [13]. When the misprediction probability is low, T B has little influence on execution time.…”
Section: (P P E) Cmentioning
confidence: 99%
“…Similar to the complexity model proposed in [13], the adapted model ignores several features of the Cell for the sake of simplifying the analysis. For example, we ignore the effect of floating point precision on the performance of numerical algorithms.…”
Section: (P P E) Cmentioning
confidence: 99%