2014
DOI: 10.1007/978-3-642-55224-3_54
|View full text |Cite
|
Sign up to set email alerts
|

Using Intel Xeon Phi Coprocessor to Accelerate Computations in MPDATA Algorithm

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
20
0

Year Published

2014
2014
2018
2018

Publication Types

Select...
4
3

Relationship

4
3

Authors

Journals

citations
Cited by 23 publications
(20 citation statements)
references
References 6 publications
0
20
0
Order By: Relevance
“…However, after empirical performance and programmability studies performed by many researchers [15,42,47,48] it is clear that to achieve high performance, Intel Xeon Phi still needs help from programmers, and that merely relying on compilers with traditional programming models is still far from reality. In fact, high degree of parallelism of Xeon Phi accelerators is best suited to applications that are structured to use the parallelism.…”
Section: Intel Mic Architecturementioning
confidence: 99%
“…However, after empirical performance and programmability studies performed by many researchers [15,42,47,48] it is clear that to achieve high performance, Intel Xeon Phi still needs help from programmers, and that merely relying on compilers with traditional programming models is still far from reality. In fact, high degree of parallelism of Xeon Phi accelerators is best suited to applications that are structured to use the parallelism.…”
Section: Intel Mic Architecturementioning
confidence: 99%
“…This method combines the island-of-core strategy with the (3+1)D hier-archical decomposition proposed previously in [19,21]. The efficiency of the method is evaluated for the implementation of MPDATA on the SGI UV 2000 and UV 3000 servers, as well as 2-and 4-socket ccNUMA platforms based on various Intel CPU microarchitectures, including Skylake, Broadwell, and Haswell.…”
Section: Introductionmentioning
confidence: 99%
“…To alleviate the memory-bound nature of MPDATA, we proposed [19][20][21] a new strategy of workload distribution for heterogeneous stencils computations. The main aim was to better exploit the cache hierarchy by moving the bulk of data traffic from the main memory to the cache hierarchy.…”
Section: Introduction To Parallelization Of Mpdata Application On Shamentioning
confidence: 99%
“…This method is based on the partitioning of available cores/threads into independent work teams. This paper is an extended version of work presented in [1,12]. This study not only proposes modifications in the (3 + 1)D decomposition of MPDATA, but also introduces the notion of work team partitioning.…”
Section: Introductionmentioning
confidence: 99%
“…As a result, modern processor architectures are very unbalanced concerning the relation of theoretical peak performance versus memory bandwidth [1]. One of the main problems of porting codes to the latest computing platforms is to take the full advantage of memory hierarchies.…”
Section: Introductionmentioning
confidence: 99%