2015
DOI: 10.1016/j.parco.2015.09.001
|View full text |Cite
|
Sign up to set email alerts
|

Numerical reproducibility for the parallel reduction on multi- and many-core architectures

Abstract: Abstract. On modern multi-core, many-core, and heterogeneous architectures, floating-point computations, especially reductions, may become non-deterministic and, therefore, non-reproducible mainly due to the nonassociativity of floating-point operations. We introduce an approach to compute the correctly rounded sums of large floating-point vectors accurately and efficiently, achieving deterministic results by construction. Our multi-level algorithm consists of two main stages: first, a filtering stage that rel… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
67
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
4
3
2

Relationship

4
5

Authors

Journals

citations
Cited by 52 publications
(67 citation statements)
references
References 18 publications
0
67
0
Order By: Relevance
“…On the other side, we want to preserve every bit of information before the final rounding to the desired format to assure correct-rounding and, therefore, reproducibility. To accomplish our goal, we merge together FPE and long accumulators, tune them, and efficiently implement them on various architectures, including conventional CPUs, Nvidia and AMD GPUs, and Intel Xeon Phi co-processors (for details we refer to Reference [13]).…”
Section: Exblas: Accurate and Bitwise Reproducible Basic Linear Algebmentioning
confidence: 99%
“…On the other side, we want to preserve every bit of information before the final rounding to the desired format to assure correct-rounding and, therefore, reproducibility. To accomplish our goal, we merge together FPE and long accumulators, tune them, and efficiently implement them on various architectures, including conventional CPUs, Nvidia and AMD GPUs, and Intel Xeon Phi co-processors (for details we refer to Reference [13]).…”
Section: Exblas: Accurate and Bitwise Reproducible Basic Linear Algebmentioning
confidence: 99%
“…There are two other academic efforts to guarantee reproducibility. The first [7], which is built upon [33], Alternatively, Collange et al [6] proposed a multi-level approach to compute the reproducible summation.…”
Section: Related Workmentioning
confidence: 99%
“…Significance arithmetics is regaining interest thanks to the Unum proposal [3] or indirectly through numerous problems encountered with exascale computers and the lack of confidence in numerical results [4]. If the Unum system is based on real problems, the proposed solution is subject to criticism for numerous reasons as pointed out by W. Kahan [5].…”
Section: Introductionmentioning
confidence: 99%
“…If the Unum system is based on real problems, the proposed solution is subject to criticism for numerous reasons as pointed out by W. Kahan [5]. On the other hand, indirect solutions based on software solutions to detect cancellation [6], [7], or avoiding rounding errors [4] are not meant to be efficient nor effective for real time execution.…”
Section: Introductionmentioning
confidence: 99%