2007
DOI: 10.1109/arith.2007.25
|View full text |Cite
|
Sign up to set email alerts
|

Optimistic Parallelization of Floating-Point Accumulation

Abstract: Abstract-Floating-point arithmetic is notoriously nonassociative due to the limited precision representation which demands intermediate values be rounded to fit in the available precision. The resulting cyclic dependency in floating-point accumulation inhibits parallelization of the computation, including efficient use of pipelining. In practice, however, we observe that floating-point operations are "mostly" associative. This observation can be exploited to parallelize floating-point accumulation using a form… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
26
0

Year Published

2009
2009
2016
2016

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 30 publications
(26 citation statements)
references
References 13 publications
0
26
0
Order By: Relevance
“…Kapre and DeHon [8] developed a speculative approach to parallel floating-point accumulation that maintains compliance. Operands are added two at a time, in parallel, using IEEE-754-compliant operators under the optimistic assumption that all operators are associative for the given inputs.…”
Section: Resultsmentioning
confidence: 99%
“…Kapre and DeHon [8] developed a speculative approach to parallel floating-point accumulation that maintains compliance. Operands are added two at a time, in parallel, using IEEE-754-compliant operators under the optimistic assumption that all operators are associative for the given inputs.…”
Section: Resultsmentioning
confidence: 99%
“…There are many multiplier architectures using different multiplication techniques for example, Array multiplier, redundant binary architecture and many more architectures using tree structures but they have problem of larger delay [1][2][3][4][5]. Different algorithms are also there that perform floating point multiplication like BOOTH algorithm, algorithms based on ancient mathematics [11], and many more. Ali Akoglu introduced an algorithm using ancient Vedic mathematics rules for fast multiplication which generate partial products concurrently.…”
Section: Related Workmentioning
confidence: 99%
“…As shown in Figure-1. Then accumulation is done of partial products and then finally partial products are added using binary tree [5], [7], [8], [11]. Accumulation is done by using Wallace tree as shown in Figure2.…”
Section: Vedic Multiplicationmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, the fattree style designs, which required presorting and preordering before input the data (Kapre and DeHon, 2007), will become extremely difficult when the matrix is very large and sparse. This challenge brings current applications and technology trends to motivate a paradigm shift in on-chip interconnect architectures from bus-based point-to-point network to packet-based switch network.…”
Section: Introductionmentioning
confidence: 99%