2016
DOI: 10.1016/j.jocs.2016.04.014
|View full text |Cite
|
Sign up to set email alerts
|

Route to exascale: Novel mathematical methods, scalable algorithms and Computational Science skills

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
4
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 9 publications
0
4
0
Order By: Relevance
“…Ideally, a heterogeneous application will minimize communication between the GPU and CPU, which effectively minimizes latency costs. Minimizing latency in high-performance computing is one of the barriers to exascale computing that requires the implementation of novel techniques to improve [5].…”
Section: Introductionmentioning
confidence: 99%
“…Ideally, a heterogeneous application will minimize communication between the GPU and CPU, which effectively minimizes latency costs. Minimizing latency in high-performance computing is one of the barriers to exascale computing that requires the implementation of novel techniques to improve [5].…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, we must anticipate that multiscale simulations will become an increasingly important form of scientific application on high end computing resources, necessitating the development of sustainable and reusable solutions for such emerging applications, that is, generic algorithms for multiscale computing. As we move into the exascale performance era we need to drastically change the way we use HPC for simulation based sciences [25].…”
Section: Introductionmentioning
confidence: 99%
“…In many ways recent improvements in computational capacity have been sustained by the development of accelerators or co-processors, such as general purpose graphics processing units (GPGPUs) or the Intel Xeon Phi manycore processor, that augment the computational capabilities of the CPU. These devices have grown in power and complexity over the last two decades, leading to an increasing reliance on them for enabling efficient floating-point computation on HPC systems [1]. As these systems grow in complexity, computational power, and physical size, latency and bandwidth costs limit the performance of applications that require regular inter-node communicationsuch as CFD simulations.…”
Section: Introductionmentioning
confidence: 99%
“…to hide network and memory latency, have very high computation/communication overlap, have minimal communication, have fewer synchronization points", and "mathematical methods developed and corresponding scientific algorithms need to match these architectures [standard processors and GPGPUs] to extract the most performance. This includes different system-specific levels of parallelism as well as co-scheduling of computation" [1].…”
Section: Introductionmentioning
confidence: 99%