2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture (ISCA) 2018
DOI: 10.1109/isca.2018.00028
|View full text |Cite
|
Sign up to set email alerts
|

Non-Speculative Store Coalescing in Total Store Order

Abstract: We present a non-speculative solution for a coalescing store buffer in total store order (TSO) consistency. Coalescing violates TSO with respect to both conflicting loads and conflicting stores, if partial state is exposed to the memory system. Proposed solutions for coalescing in TSO resort to speculation-and-rollback or centralized arbitration to guarantee atomicity for the set of stores whose order is affected by coalescing. These solutions can suffer from scalability, complexity, resource-conflict deadlock… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
9
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
1

Relationship

4
3

Authors

Journals

citations
Cited by 11 publications
(9 citation statements)
references
References 29 publications
0
9
0
Order By: Relevance
“…Deadlock avoidance has been also studied in proposals allow loads [43,45] and stores [44] to be non-speculatively reordered while offering an ordered behaviour to the programmer. Writers-Block [43] resolves the deadlocks by guaranteeing the loads always to progress through tear-off copies.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Deadlock avoidance has been also studied in proposals allow loads [43,45] and stores [44] to be non-speculatively reordered while offering an ordered behaviour to the programmer. Writers-Block [43] resolves the deadlocks by guaranteeing the loads always to progress through tear-off copies.…”
Section: Related Workmentioning
confidence: 99%
“…Writers-Block [43] resolves the deadlocks by guaranteeing the loads always to progress through tear-off copies. Non-speculative store coalescing [44] relies on a predetermined order of writes to solve deadlocks. Applying a predetermined order to Free atomics could lead to a reduction in timeouts, when a predetermined order in performing the stores older than the Free atomic can be guaranteed.…”
Section: Related Workmentioning
confidence: 99%
“…Ros and Kaxiras [28] propose coalescing stores in the store buffer and avoid breaking the store order by performing stores in atomic groups. In contrast to our approach where DRF stores can perform completely out-of-order, stores in an atomic group perform following a globally defined order.…”
Section: Related Workmentioning
confidence: 99%
“…Previous proposals alleviate the store buffer bottleneck by performing the stores out-of-order while still complying with TSO [8,15,28,33,34]. However, they either require complex hardware structures and speculative writes [34], or they only enable limited reordering [28,33].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation