2020 57th ACM/IEEE Design Automation Conference (DAC) 2020
DOI: 10.1109/dac18072.2020.9218661
|View full text |Cite
|
Sign up to set email alerts
|

ATUNs: Modular and Scalable Support for Atomic Operations in a Shared Memory Multiprocessor

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 19 publications
0
3
0
Order By: Relevance
“…Several works have widely discussed the implementation of near and far AMOs [10,24,41]. Other works have tried to improve the performance of far AMOs by (i) reducing the latency between consumer and producer through data forwarding [24,68], or (ii) speeding-up lock exchange [3,20].…”
Section: A Near and Far Amo Architecturesmentioning
confidence: 99%
See 1 more Smart Citation
“…Several works have widely discussed the implementation of near and far AMOs [10,24,41]. Other works have tried to improve the performance of far AMOs by (i) reducing the latency between consumer and producer through data forwarding [24,68], or (ii) speeding-up lock exchange [3,20].…”
Section: A Near and Far Amo Architecturesmentioning
confidence: 99%
“…Most proposals focus on using dedicated hardware (accelerators, networks, new instructions) to speed-up synchronization primitives [1,6,25,41,47,55,61,62,69]. However, these proposals incur large area overheads.…”
Section: Related Workmentioning
confidence: 99%
“…We assume the most general case where the concurrent aggregation of multiple packets in a shared aggregation buffer is executed in a critical section. Indeed, even if in principle we could use atomic operations [75], the computation would still be affected by severe contention and/or performance overhead [76,77]. Moreover, by assuming a critical section we cover the cases where the user function cannot be executed by using atomic operations, and the aggregation of sparse data that, as we will show in Section 7, requires more complex processing and in most cases needs to be executed anyhow in a mutually exclusive way.…”
Section: Aggregation Using a Single Buffermentioning
confidence: 99%

Flare: Flexible In-Network Allreduce

De Sensi,
Di Girolamo,
Ashkboos
et al. 2021
Preprint
Self Cite