2008
DOI: 10.1007/978-3-540-68555-5_18
|View full text |Cite
|
Sign up to set email alerts
|

Nested Parallelization of the Flow Solver TFS Using the ParaWise Parallelization Environment

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2010
2010
2011
2011

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 4 publications
0
3
0
Order By: Relevance
“…Furthermore, there are situations where only a small number of blocks have been defined for the simulation (in the extreme case a single block), so the block level parallelisation will not satisfy every eventuality. As an overhead is incurred each time a loop is parallelised, the higher the level of the loop the parallelisation is applied at, the lower the overheads will be, as discussed in [10]. Therefore, the ideal loop for an OpenMP parallelisation in the typical COSA is the block loop.…”
Section: Shared Memory Parallelisationmentioning
confidence: 99%
“…Furthermore, there are situations where only a small number of blocks have been defined for the simulation (in the extreme case a single block), so the block level parallelisation will not satisfy every eventuality. As an overhead is incurred each time a loop is parallelised, the higher the level of the loop the parallelisation is applied at, the lower the overheads will be, as discussed in [10]. Therefore, the ideal loop for an OpenMP parallelisation in the typical COSA is the block loop.…”
Section: Shared Memory Parallelisationmentioning
confidence: 99%
“…The ParaWise/CAPO automatic parallelization environment [15,16] has been used to assist in the OpenMP parallelization of the TFS multi-block code accomplished by the coauthors of [3] from Parallel Software Products and the University of Greenwich. A parallel version of TFS that can scale to large numbers of processors targeted at Sun Microsystems Sun Fire E25K shared memory parallel systems (SFE25K) was the ultimate goal of this work [17].…”
Section: Fig 3 Computing the Critical Points For A Combustion Enginementioning
confidence: 99%
“…OpenMP is employed on the block and on the loop level. This application puts a high load on the memory system and thus is quite sensitive to ccNUMA effects [3].…”
Section: Introductionmentioning
confidence: 99%