Improving numerical reproducibility and stability in large-scale numerical simulations on GPUs

Taufer, Michela; Padron, O.; Saponaro, Philip; Patel, Sandeep

doi:10.1109/ipdps.2010.5470481

Cited by 40 publications

(35 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Therefore we have concentrated on two subroutines responsible for those calculations. We used a general purpose MD simulation code mm_par 18 for GPU implementation using CUDA. The subroutines are v_real() and v_pme().…”

Section: Methodsmentioning

confidence: 99%

“…Recently several studies have reported GPU acceleration of MD simulation. [12][13][14][15][16][17][18][19] However, most studies considered only relatively simple systems like a system interacting only through Lennard-Jones interaction, which are lack of electrostatic interactions. The purpose of this paper is to implement a general purpose MD simulation code 20 for GPU using NVIDIA's CUDA 21 (Compute Unified Device Architecture).…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Accelerating Molecular Dynamics Simulation Using Graphics Processing Unit

Myung¹,

Sakamaki²,

Oh³

et al. 2010

Bulletin of the Korean Chemical Society

View full text Add to dashboard Cite

We have developed CUDA-enabled version of a general purpose molecular dynamics simulation code for GPU. Implementation details including parallelization scheme and performance optimization are described. Here we have focused on the non-bonded force calculation because it is most time consuming part in molecular dynamics simulation. Timing results using CUDA-enabled and CPU versions were obtained and compared for a biomolecular system containing 23558 atoms. CUDA-enabled versions were found to be faster than CPU version. This suggests that GPU could be a useful hardware for molecular dynamics simulation.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Accelerating Molecular Dynamics Simulation Using Graphics Processing Unit

Myung¹,

Sakamaki²,

Oh³

et al. 2010

Bulletin of the Korean Chemical Society

View full text Add to dashboard Cite

show abstract

“…Questions whether such simulated results are reproducible or not have been reported more or less recently, e.g. in energy science [1], dynamic weather forecasting [2], atomic or molecular dynamic [3,4], fluid dynamic [5]. This paper focuses on numerical non-reproducibility due to the finite precision of computer arithmetic -see [6] for other issues about "reproducible research" in computational mathematics.…”

Section: Numerical Reproducibility: Context and Motivationsmentioning

confidence: 99%

First steps towards more numerical reproducibility

2014

View full text Add to dashboard Cite

Abstract. Questions whether numerical simulation is reproducible or not have been reported in several sensitive applications. Numerical reproducibility failure mainly comes from the finite precision of computer arithmetic. Results of floating-point computation depends on the computer arithmetic precision and on the order of arithmetic operations. Massive parallel HPC which merges, for instance, many-core CPU and GPU, clearly modifies these two parameters even from run to run on a given computing platform. How to trust such computed results? This paper presents how three classic approaches in computer arithmetic may provide some first steps towards more numerical reproducibility. Numerical reproducibility: context and motivationsAs computing power increases towards exascale, more complex and larger scale numerical simulations are performed in various domains. Questions whether such simulated results are reproducible or not have been reported more or less recently, e.g. in energy science [1], dynamic weather forecasting [2], atomic or molecular dynamic [3,4], fluid dynamic [5]. This paper focuses on numerical non-reproducibility due to the finite precision of computer arithmetic -see [6] for other issues about "reproducible research" in computational mathematics.The following example illustrates a typical failure of numerical reproducibility. In the energetic field, power system state simulation aims to compute at "real time" a reliable estimate of the bus voltages for a given power grid topology and a set of on-line measures. Numerically speaking, a large and sparse linear system is solved at every iteration of a Newton-Raphson process. The core computation is a sparse matrix-vector product automatically parallelised by the computing environment. The authors of [1] exhibit a significant variability (up to 25% relative difference) between two runs on a massively multithreaded system. The culprit? Here as in the previously cited references: non deterministic sums.Floating-point summation is not associative. Parallelism introduces non deterministic events from run to run, even using a unique binary file and a given computing platform. The order of communications, the number of computing units (threads, processors) and the associated data placement may vary, and hence the parallel partial sums. Even sequential frames that comply with the IEEE-754 floating-point arithmetic standard [7] are still numerically very sensitive to many features: low-level arithmetic unit properties (variable precision registers, fused operators), compiler optimizations, language flaws or library versions reduce numerical repeatability and numerical portability [8]. * Authors thank Cl.-P. Jeannerod (INRIA) for his significant contribution, I. Said (LIP6) for his help in the numerical experiments related to the acoustic wave equation and the GT Arith, GDR Informatique Mathématique, for its support.

show abstract

“…Such differences are particularly noticeable with new computing architectures such as multicore processors, GPUs (Graphics Processing Units) and APUs (Accelerated Processing Units). In high performance numerical simulations, reproducibility problems have been identified in various domains: energy science [1], climate science [2], atomic or molecular dynamics [3], [4], fluid dynamics [5]. Various studies have been carried out on numerical reproducibility on different architectures.…”

Section: Introductionmentioning

confidence: 99%

“…Various studies have been carried out on numerical reproducibility on different architectures. On the one hand, strategies have been proposed [2], [3], [4], [5] to improve numerical accuracy, using for instance accurate summations. Other works aim at forcing the reproducibility of results, either affected by the same rounding errors [6], [7] or correctly rounded [8], [9], [10], [11].…”

Section: Introductionmentioning

confidence: 99%

Estimation of numerical reproducibility on CPU and GPU

Jézéquel¹,

Lamotte²,

Said³

2015

Annals of Computer Science and Information Systems

View full text Add to dashboard Cite

Abstract-Differences in simulation results may be observed from one architecture to another or even inside the same architecture. Such reproducibility failures are often due to different rounding errors generated by different orders in the sequence of arithmetic operations. Reproducibility problems are particularly noticeable on new computing architectures such as multicore processors or GPUs (Graphics Processing Units). DSA (Discrete Stochastic Arithmetic) enables one to estimate rounding error propagation in simulation programs. In this paper, it is shown that DSA can be used to estimate which digits in simulation results may be different from one environment to another because of rounding errors. A particular implementation of DSA, which enables numerical validation in hybrid CPU-GPU environments, is described. The estimation of numerical reproducibility using DSA is illustrated by a wave propagation code which can be affected by reproducibility problems when executed on different architectures.

show abstract

Improving numerical reproducibility and stability in large-scale numerical simulations on GPUs

Cited by 40 publications

References 12 publications

Accelerating Molecular Dynamics Simulation Using Graphics Processing Unit

Accelerating Molecular Dynamics Simulation Using Graphics Processing Unit

First steps towards more numerical reproducibility

Estimation of numerical reproducibility on CPU and GPU

Contact Info

Product

Resources

About