Proceedings of the 21st International Conference on Parallel Architectures and Compilation Techniques 2012
DOI: 10.1145/2370816.2370853
|View full text |Cite
|
Sign up to set email alerts
|

Complexity-effective multicore coherence

Abstract: Much of the complexity and overhead (directory, state bits, invalidations) of a typical directory coherence implementation stems from the effort to make it "invisible" even to the strongest memory consistency model. In this paper, we show that a much simpler, directory-less/broadcast-less, multicore coherence can outperform a directory protocol but without its complexity and overhead. Motivated by recent efforts to simplify coherence, we propose a hardware approach that does not require any application guidanc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
108
0
1

Year Published

2013
2013
2020
2020

Publication Types

Select...
2
2
2

Relationship

2
4

Authors

Journals

citations
Cited by 101 publications
(111 citation statements)
references
References 39 publications
2
108
0
1
Order By: Relevance
“…SC-for-DRF protocols rely on the guarantee that, during DRF regions, threads perform either private or read-only memory accesses [1], [2], [20]. A memory access is private if it targets a memory location that is only accessed by one thread during the execution of one DRF region; and is read-only if the location is not written within the DRF region.…”
Section: A Sequential Consistency For Drf Protocolsmentioning
confidence: 99%
See 4 more Smart Citations
“…SC-for-DRF protocols rely on the guarantee that, during DRF regions, threads perform either private or read-only memory accesses [1], [2], [20]. A memory access is private if it targets a memory location that is only accessed by one thread during the execution of one DRF region; and is read-only if the location is not written within the DRF region.…”
Section: A Sequential Consistency For Drf Protocolsmentioning
confidence: 99%
“…This excessive invalidation limits their performance [1], [2]. In contrast, SPEL reduces self-invalidation, by relying on the compiler to indicate the points of synchronization that indeed require self-invalidating cached data.…”
Section: A Sequential Consistency For Drf Protocolsmentioning
confidence: 99%
See 3 more Smart Citations