2010
DOI: 10.1016/j.procs.2010.04.230
|View full text |Cite
|
Sign up to set email alerts
|

Boosting the performance of computational fluid dynamics codes for interactive supercomputing

Abstract: An extreme form of pipelining of the Piecewise-Parabolic Method (PPM) gas dynamics code has been used to dramatically increase its performance on the new generation of multicore CPUs. Exploiting this technique, together with a full integration of the several data post-processing and visualization utilities associated with this code has enabled numerical experiments in computational fluid dynamics to be performed interactively on a new, dedicated system in our lab, with immediate, user controlled visualization … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2011
2011
2015
2015

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(6 citation statements)
references
References 8 publications
0
5
0
Order By: Relevance
“…To fully realize these benefits, massive pipelining of numerical algorithms to achieve the highest possible amount of reuse of cached data is extremely important. The briquette data structure, massive code pipelining, and short, aligned vector operands all are features of the code restructuring and transformation that our team has been advocating as a result of our experience with the IBM Cell processor [2][3][4][5]. We have built and are refining automatic code translators that perform these code restructuring transformations as a precompilation step.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…To fully realize these benefits, massive pipelining of numerical algorithms to achieve the highest possible amount of reuse of cached data is extremely important. The briquette data structure, massive code pipelining, and short, aligned vector operands all are features of the code restructuring and transformation that our team has been advocating as a result of our experience with the IBM Cell processor [2][3][4][5]. We have built and are refining automatic code translators that perform these code restructuring transformations as a precompilation step.…”
Section: Discussionmentioning
confidence: 99%
“…It also shares with our full application a very important feature: it has a large difference stencil, and the computation proceeds in phases that alternate between computations of cell and cell interface quantities. Each of these alternating phases of computation becomes transformed in our high performance code expression into a separate stage in a computation pipeline (see [2][3][4][5]). In this way, this PPM advection kernel reflects the overall character of our full application.…”
Section: Advection Examplementioning
confidence: 99%
See 2 more Smart Citations
“…In the traditional blocking scheme, each loop runs over the entire domain before proceeding to the next loop. In an alternative scheme (Woodward et al, 2010) all of the loops are run on a block before moving to the next block, as illustrated in Figure 5. Each large rectangle represents the iteration space at different points of progress (indicated by shading), and each subrectangle represents a block of the iteration space that fits into local memory.…”
Section: </Machine>mentioning
confidence: 99%