2009
DOI: 10.1137/080738490
|View full text |Cite
|
Sign up to set email alerts
|

Ultimately Fast Accurate Summation

Abstract: Abstract. We present two new algorithms FastAccSum and FastPrecSum, one to compute a faithful rounding of the sum of floating-point numbers and the other for a result "as if" computed in K-fold precision. Faithful rounding means the computed result either is one of the immediate floating-point neighbors of the exact result or is equal to the exact sum if this is a floating-point number. The algorithms are based on our previous algorithms AccSum and PrecSum and improve them by up to 25%. The first algorithm ada… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
71
0
1

Year Published

2009
2009
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 68 publications
(72 citation statements)
references
References 26 publications
0
71
0
1
Order By: Relevance
“…The first claim is a direct consequence of [15,Lemma 3.6]. The second claim then follows from the fact that Algorithm 7 is Algorithm 6 applied to 2 p x.…”
Section: B a Special Case: Extracting One Bit Onlymentioning
confidence: 89%
See 2 more Smart Citations
“…The first claim is a direct consequence of [15,Lemma 3.6]. The second claim then follows from the fact that Algorithm 7 is Algorithm 6 applied to 2 p x.…”
Section: B a Special Case: Extracting One Bit Onlymentioning
confidence: 89%
“…In particular, for β = 2 and p = 53, this yields the "magic number" C = 2 52 + 2 51 mentioned before, together with the range |x| 2 51 . Also, for β = 10 and p = 16 (decimal64 IEEE-754 format), the largest range for nonnegative inputs is x 9·10 15 , obtained with C = 10 15 ; and the largest range for signed inputs is |x| 4.5 · 10 15 , obtained with C = 5.5 · 10 15 .…”
Section: Absolute Splittingsmentioning
confidence: 98%
See 1 more Smart Citation
“…Algorithms AccSum [12] and FastAccSum [11] also rely on error-free transformations of the entry vector. They split the summands, relatively to max |p i | and n, such that their higher order parts are then exactly accumulated.…”
Section: Some Accurate or Reproducible Summation Algorithmsmentioning
confidence: 99%
“…However, as we mentioned already, this solution solely increases the accuracy of basic operations, but it neither reaches bit-accurate results nor is efficient for large precisions. Therefore, some recent works focus on hybrid solutions that store the sum as floating-point numbers of fixed exponent [26,27] without completely avoiding the previous drawbacks. These algorithms are mainly sequential and are not suitable for parallelization.…”
Section: Related Workmentioning
confidence: 99%