2011 23rd International Symposium on Computer Architecture and High Performance Computing 2011
DOI: 10.1109/sbac-pad.2011.18
|View full text |Cite
|
Sign up to set email alerts
|

Classification and Elimination of Conflicts in Hardware Transactional Memory Systems

Abstract: This paper analyzes the sources of performance losses in hardware transactional memory and investigates techniques to reduce the losses. It dissects the root causes of data conflicts in hardware transactional memory systems (HTM) into four classes of conflicts: true sharing, false sharing, silent store, and write-write conflicts. These conflicts can cause performance and energy losses due to aborts and extra communication. To quantify losses, the paper first proposes the 5C cache-miss classification model that… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 28 publications
0
4
0
Order By: Relevance
“…The FASTM cache allows speculatively modified lines to be gang-invalidated on abort, effectively speeding up the rollback phase and avoiding software handler intervention. However, this approach may slow down the re-execution of aborted transactions due to the encounter of additional L1 misses for such contaminated lines [Waliullah and Stenstrom 2011], as the L1 is not able to hold both clean and speculative copies of the data. For the experiments with FASTM, we did not run any SMT configurations, but simply set up the simulation to use 16 single-threaded cores.…”
Section: Htm Systemsmentioning
confidence: 99%
See 1 more Smart Citation
“…The FASTM cache allows speculatively modified lines to be gang-invalidated on abort, effectively speeding up the rollback phase and avoiding software handler intervention. However, this approach may slow down the re-execution of aborted transactions due to the encounter of additional L1 misses for such contaminated lines [Waliullah and Stenstrom 2011], as the L1 is not able to hold both clean and speculative copies of the data. For the experiments with FASTM, we did not run any SMT configurations, but simply set up the simulation to use 16 single-threaded cores.…”
Section: Htm Systemsmentioning
confidence: 99%
“…The Base configuration, which essentially applies simple optimizations to the LogTM design to mitigate these overheads (like avoiding trapping to software when the undo log is empty) performs fairly competitively to the three better performing configurations. The SCIN-cache designs perform noticeably better than FASTM in high contention, due to their improved cache performance: By retaining a clean copy of speculatively updated lines in L1 cache, they avoid contamination misses [Waliullah and Stenstrom 2011] when transactions reexecute. This is visible in Table XII, whose first four data columns show the average number of cache misses per transaction for the systems plotted in Figure 9.…”
Section: Experiments 1: Impact Of Simultaneous Multiversioning On Tm Omentioning
confidence: 99%
“…We refer to such an event as an abort miss. Such misses have also been referred to as contamination misses [18]. Workloads with large write sets and high contention over small amounts of shared data would experience the greatest drop in private cache hit rates.…”
Section: A Lazy Htmsmentioning
confidence: 99%
“…Keeping such writes away from the cache improves performance by reducing the number of contamination misses [18] -misses due to invalidation of speculatively updated lines on aborts -and redundant permission downgrades from exclusive or dirty state to shared state (which we term downgrade misses) that allow detection of conflicts. Moreover, this also mitigates the effect of false writerwriter conflicts.…”
Section: Conceptual Overviewmentioning
confidence: 99%