2008
DOI: 10.1109/mcse.2008.152
|View full text |Cite
|
Sign up to set email alerts
|

Moving Scientific Codes to Multicore Microprocessor CPUs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2009
2009
2014
2014

Publication Types

Select...
3
3
2

Relationship

4
4

Authors

Journals

citations
Cited by 18 publications
(10 citation statements)
references
References 3 publications
0
10
0
Order By: Relevance
“…It also shares with our full application a very important feature: it has a large difference stencil, and the computation proceeds in phases that alternate between computations of cell and cell interface quantities. Each of these alternating phases of computation becomes transformed in our high performance code expression into a separate stage in a computation pipeline (see [2][3][4][5]). In this way, this PPM advection kernel reflects the overall character of our full application.…”
Section: Advection Examplementioning
confidence: 99%
“…It also shares with our full application a very important feature: it has a large difference stencil, and the computation proceeds in phases that alternate between computations of cell and cell interface quantities. Each of these alternating phases of computation becomes transformed in our high performance code expression into a separate stage in a computation pipeline (see [2][3][4][5]). In this way, this PPM advection kernel reflects the overall character of our full application.…”
Section: Advection Examplementioning
confidence: 99%
“…The indexing of these vector temporaries is baroque (cf. [4][5][6]). The programming effort required to produce such a code, modify it, debug it, and maintain it is excessive.…”
Section: The Solution: Extreme Pipelining Of the Computationmentioning
confidence: 99%
“…The body of this outer loop will consist of a series of vector loops with tests and jumps to the end of the outer loop in between some of these inner loops (cf. [5,6]). This is basically the same program transformation that we described in [3] at a lower level of dimensionality and that, with that lower dimensionality, we used in alternative expressions of our sPPM benchmark code in the late 1990s.…”
Section: Reducing the Programming Burdenmentioning
confidence: 99%
“…Like other heterogeneous systems supporting multiple instruction set architectures, the Cell is not easy to program [27], requiring two separate source codes: one for the PPE, and the second for the SPEs. However, it is relatively easier to program than GPUs employing new programming languages, such as OpenGL and CUDA.…”
Section: Sti Cell Bementioning
confidence: 99%