Ultimately Fast Accurate Summation

Rump, Siegfried M.

doi:10.1137/080738490

Cited by 68 publications

(72 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The first claim is a direct consequence of [15,Lemma 3.6]. The second claim then follows from the fact that Algorithm 7 is Algorithm 6 applied to 2 p x.…”

Section: B a Special Case: Extracting One Bit Onlymentioning

confidence: 89%

“…In particular, for β = 2 and p = 53, this yields the "magic number" C = 2 52 + 2 51 mentioned before, together with the range |x| 2 51 . Also, for β = 10 and p = 16 (decimal64 IEEE-754 format), the largest range for nonnegative inputs is x 9·10 15 , obtained with C = 10 15 ; and the largest range for signed inputs is |x| 4.5 · 10 15 , obtained with C = 5.5 · 10 15 .…”

Section: Absolute Splittingsmentioning

confidence: 98%

“…For computing sign(x) · ufp(x), we can use the following algorithm, introduced by Rump in [15]. These solutions, however, raise the following issues.…”

Section: B a Special Case: Extracting One Bit Onlymentioning

confidence: 99%

See 2 more Smart Citations

On Various Ways to Split a Floating-Point Number

Jeannerod

Muller

Zimmermann

2018

2018 IEEE 25th Symposium on Computer Arithmetic (ARITH)

View full text Add to dashboard Cite

Abstract-We review several ways to split a floating-point number, that is, to decompose it into the exact sum of two floatingpoint numbers of smaller precision. All the methods considered here involve only a few IEEE floating-point operations, with rounding to nearest and including possibly the fused multiply-add (FMA). Applications range from the implementation of integer functions such as round and floor to the computation of suitable scaling factors aimed, for example, at avoiding spurious underflows and overflows when implementing functions such as the hypotenuse.

show abstract

“…The first claim is a direct consequence of [15,Lemma 3.6]. The second claim then follows from the fact that Algorithm 7 is Algorithm 6 applied to 2 p x.…”

Section: B a Special Case: Extracting One Bit Onlymentioning

confidence: 89%

Section: Absolute Splittingsmentioning

confidence: 98%

See 1 more Smart Citation

On Various Ways to Split a Floating-Point Number

Jeannerod

Muller

Zimmermann

2018

2018 IEEE 25th Symposium on Computer Arithmetic (ARITH)

View full text Add to dashboard Cite

show abstract

“…Algorithms AccSum [12] and FastAccSum [11] also rely on error-free transformations of the entry vector. They split the summands, relatively to max |p i | and n, such that their higher order parts are then exactly accumulated.…”

Section: Some Accurate or Reproducible Summation Algorithmsmentioning

confidence: 99%

Efficiency of Reproducible Level 1 BLAS

Chohra¹,

Langlois²,

Parello³

2016

Scientific Computing, Computer Arithmetic, and Validated Numerics

View full text Add to dashboard Cite

Abstract. Numerical reproducibility failures appear in massively parallel floating-point computations. One way to guarantee the numerical reproducibility is to extend the IEEE-754 correct rounding to larger computing sequences, as for instance for the BLAS libraries. Is the overcost for numerical reproducibility acceptable in practice? We present solutions and experiments for the level 1 BLAS and we conclude about the efficiency of these reproducible routines.

show abstract

“…However, as we mentioned already, this solution solely increases the accuracy of basic operations, but it neither reaches bit-accurate results nor is efficient for large precisions. Therefore, some recent works focus on hybrid solutions that store the sum as floating-point numbers of fixed exponent [26,27] without completely avoiding the previous drawbacks. These algorithms are mainly sequential and are not suitable for parallelization.…”

Section: Related Workmentioning

confidence: 99%

Numerical reproducibility for the parallel reduction on multi- and many-core architectures

et al. 2015

View full text Add to dashboard Cite

Abstract. On modern multi-core, many-core, and heterogeneous architectures, floating-point computations, especially reductions, may become non-deterministic and, therefore, non-reproducible mainly due to the nonassociativity of floating-point operations. We introduce an approach to compute the correctly rounded sums of large floating-point vectors accurately and efficiently, achieving deterministic results by construction. Our multi-level algorithm consists of two main stages: first, a filtering stage that relies on fast vectorized floating-point expansion; second, an accumulation stage based on superaccumulators in a high-radix carry-save representation. We present implementations on recent Intel desktop and server processors, Intel Xeon Phi co-processors, and both AMD and NVIDIA GPUs. We show that numerical reproducibility and bit-perfect accuracy can be achieved at no additional cost for large sums that have dynamic ranges of up to 90 orders of magnitude by leveraging arithmetic units that are left underused by standard reduction algorithms.

show abstract

Ultimately Fast Accurate Summation

Cited by 68 publications

References 26 publications

On Various Ways to Split a Floating-Point Number

On Various Ways to Split a Floating-Point Number

Efficiency of Reproducible Level 1 BLAS

Numerical reproducibility for the parallel reduction on multi- and many-core architectures

Contact Info

Product

Resources

About