2005
DOI: 10.1002/cpe.894
|View full text |Cite
|
Sign up to set email alerts
|

Practical performance portability in the Parallel Ocean Program (POP)

Abstract: SUMMARY

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
44
0

Year Published

2007
2007
2021
2021

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 73 publications
(44 citation statements)
references
References 12 publications
0
44
0
Order By: Relevance
“…For such evaluation, we experimented with (1) a complex parallel I/O benchmark, Flash I/O [1], which is closely modeled after the FLASH astrophysics code, and (2) a production-scale climate simulation application, the Parallel Ocean Program (POP) [7]. More details on these workloads are given below.…”
Section: Resultsmentioning
confidence: 99%
“…For such evaluation, we experimented with (1) a complex parallel I/O benchmark, Flash I/O [1], which is closely modeled after the FLASH astrophysics code, and (2) a production-scale climate simulation application, the Parallel Ocean Program (POP) [7]. More details on these workloads are given below.…”
Section: Resultsmentioning
confidence: 99%
“…The model grid (192×128×20) generated internally is an equally-spaced latitude-longitude global grid with idealized land-masses. The x1 benchmark is set up to be identical to the actual production configuration of the Community Climate System Model [41]. The model grid (320×384×40), topography, initial state, equation of state coefficients and other benchmark specifications for x1 are available at the POP website [70].…”
Section: Parallel Ocean Programmentioning
confidence: 99%
“…Jones [11] describes the addition of a more flexible data structure that allows efficient execution of POP on both cache and vector processors. Wang [24] describes code modifications to POP that improve performance on a specific machine architecture.…”
Section: Background and Related Workmentioning
confidence: 99%
“…The horizontal dimensions are decomposed into logically rectangular two-dimensional (2D) blocks [11]. The computational mesh is distributed across multiple processors by placing one or more 2D blocks on each processor.…”
Section: Data Structures Within Popmentioning
confidence: 99%