2015 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) 2015
DOI: 10.1109/cgo.2015.7054187
|View full text |Cite
|
Sign up to set email alerts
|

On performance debugging of unnecessary lock contentions on multicore processors: A replay-based approach

Abstract: Locks have been widely used as an effective synchronization mechanism among processes and threads. However, we observe that, a large number of false inter-thread dependencies (i.e., unnecessary lock contention) exist during the execution on multicore processors, which incurs significant performance overhead. This paper, therefore, presents a performance debugging framework, PERF-PLAY, to facilitate a comprehensive and in-depth understanding of the performance impact of unnecessary lock contentions. The core te… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(5 citation statements)
references
References 23 publications
0
5
0
Order By: Relevance
“…Last but not least, SyncPerf contributes with an additional tool to help diagnose some specific type of performance issues. Zheng et al [44] replay traces to identify lock contentions and calculate potential improvement that can be achieved by reducing the contentions. HaLock [21] utilizes a hardware memory tracing tool to store profiling data, in order to avoid memory interference with the applications.…”
Section: Related Workmentioning
confidence: 99%
“…Last but not least, SyncPerf contributes with an additional tool to help diagnose some specific type of performance issues. Zheng et al [44] replay traces to identify lock contentions and calculate potential improvement that can be achieved by reducing the contentions. HaLock [21] utilizes a hardware memory tracing tool to store profiling data, in order to avoid memory interference with the applications.…”
Section: Related Workmentioning
confidence: 99%
“…We use its Pin-based version. Pin 12) is an underlying framework, which enables programmers to perform the program analysis at runtime without any source codes.…”
Section: Methodsmentioning
confidence: 99%
“…Therefore it is necessary to know the explicit characteristics of lock-related thread interleavings (e.g., lock interleavings [12]). We first take the several typical lock interleavings into account for the demand of practical detection, and then classify them into four categories.…”
Section: Fig 1 An Execution Snapshot In Different Ordersmentioning
confidence: 99%
“…Performance debugging techniques are applied to reduce the scope of code locations and the dependency on expert knowledge. Such techniques include replay, 38 profiling, 39,40 and mining. 3 Performance Estimation includes research in building and applying performance prediction models.…”
Section: Rq11: Research Categories and Distributionmentioning
confidence: 99%