Accelerators for Technical Computing: Is It Worth the Pain? A TCO Perspective

Wienke, Sandra; Mey, Dieter an; Müller, Matthias S.

doi:10.1007/978-3-642-38750-0_25

Cited by 12 publications

(2 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In these simulations, a reduction in memory consumption especially affects acquisition costs. However, an analysis considering total costs of ownership (as described by Wienke et al, 2013) will reveal a detailed insight.…”

Section: Discussionmentioning

confidence: 99%

Towards an accurate simulation of the crystallisation process in injection moulded plastic components by hybrid parallelisation

Wienke

Spekowius

Dammer

et al. 2013

The International Journal of High Performance Computing Applica

Self Cite

View full text Add to dashboard Cite

The simulation of the crystallisation process during the injection moulding process of plastic components is time consuming, resulting in the ability to simulate only small parts of a component. To remove this constraint and enable the simulation of complex parts, the computing power of high-performance computers is demanded. A further design objective is high scalability in performance and memory consumption on today’s and future high-performance computing architectures to allow precise predictions of global part properties. In this work, we present a simulation tool for the crystallisation process and the parallelisation of the tool by a hybrid MPI-Pthreads approach that meets this design objective. We verify the performance and memory consumption of our parallelisation using a large simulation area of a realistic plastic component as a case study and can further predict that entire parts will also be calculable.

show abstract

Section: Discussionmentioning

confidence: 99%

Towards an accurate simulation of the crystallisation process in injection moulded plastic components by hybrid parallelisation

Wienke

Spekowius

Dammer

et al. 2013

The International Journal of High Performance Computing Applica

Self Cite

View full text Add to dashboard Cite

show abstract

“…• our example GPU, Tesla K20m, turned out to be the fastest processor in the comparison; the observations made by several authors [39,15,36] show that Xeon Phi processors applied to different scientific computing algorithms, although usually faster than standard Xeon CPUs, are usually slower than the recent HPC targeted GPUs…”

Section: Computational Experimentsmentioning

confidence: 91%

Finite element numerical integration for first order approximations on multi-core architectures

Banaś,

Krużel,

Bielański

2015

Preprint

View full text Add to dashboard Cite

The paper presents investigations on the implementation and performance of the finite element numerical integration algorithm for first order approximations and three processor architectures, popular in scientific computing, classical CPU, Intel Xeon Phi and NVIDIA Kepler GPU. A unifying programming model and portable OpenCL implementation is considered for all architectures. Variations of the algorithm due to different problems solved and different element types are investigated and several optimizations aimed at proper optimization and mapping of the algorithm to computer architectures are demonstrated. Performance models of execution are developed for different processors and tested in practical experiments. The results show the varying levels of performance for different architectures, but indicate that the algorithm can be effectively ported to all of them. The general conclusion is that the finite element numerical integration can achieve sufficient performance on different multi-and many-core architectures and should not become a performance bottleneck for finite element simulation codes.

show abstract

OpenCL Performance Portability for Xeon Phi Coprocessor and NVIDIA GPUs: A Case Study of Finite Element Numerical Integration

Banaś

Krużel

2014

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Accelerators for Technical Computing: Is It Worth the Pain? A TCO Perspective

Cited by 12 publications

References 9 publications

Towards an accurate simulation of the crystallisation process in injection moulded plastic components by hybrid parallelisation

Towards an accurate simulation of the crystallisation process in injection moulded plastic components by hybrid parallelisation

Finite element numerical integration for first order approximations on multi-core architectures

OpenCL Performance Portability for Xeon Phi Coprocessor and NVIDIA GPUs: A Case Study of Finite Element Numerical Integration

Contact Info

Product

Resources

About