Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming 2018
DOI: 10.1145/3178487.3178499
|View full text |Cite
|
Sign up to set email alerts
|

Featherlight on-the-fly false-sharing detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
20
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
1

Relationship

3
5

Authors

Journals

citations
Cited by 19 publications
(22 citation statements)
references
References 30 publications
1
20
0
1
Order By: Relevance
“…Second, NumaPerf detects more performance issues than the combination of all existing NUMA profilers [9,13,17,23,24,27,29,32]. The performance issues that cannot be detected by existing NUMA profilers are highlighted with a checkmark in the last column of the table, although some can be detected by other tools (but not NUMA tools), such as cache false/true sharing issues [7,12,[20][21][22]. This comparison with existing NUMA profilers is based on the methodology, instead of based on the results of specific tools.…”
Section: Effectivenessmentioning
confidence: 99%
See 1 more Smart Citation
“…Second, NumaPerf detects more performance issues than the combination of all existing NUMA profilers [9,13,17,23,24,27,29,32]. The performance issues that cannot be detected by existing NUMA profilers are highlighted with a checkmark in the last column of the table, although some can be detected by other tools (but not NUMA tools), such as cache false/true sharing issues [7,12,[20][21][22]. This comparison with existing NUMA profilers is based on the methodology, instead of based on the results of specific tools.…”
Section: Effectivenessmentioning
confidence: 99%
“…It aims for identifying the performance issues for the hybrid DRAM-HBM architecture, but not the NUMA architecture, and has a higher overhead than NumaPerf. Some tools focus on the detection of false/true sharing issues [7,12,[20][21][22], but skipping other NUMA issues.…”
Section: Other Related Toolsmentioning
confidence: 99%
“…Even though it has no runtime overhead, it cannot capture all the program objects or their references as it relies on static analysis. Chabbi et al [7] employ PMUs and debug registers to detect false sharing but do not generalize it for inter-thread communication matrices; furthermore, their technique does not quantify communication volume even for false sharing. Even though these tools can count memory access events, they do not associate these events to threads and are not used in generating communication pattern among threads.…”
Section: Related Workmentioning
confidence: 99%
“…Inter-thread communication is an important performance indicator in shared-memory multi-core systems [38]. Thread communication information offers valuable insights: it divulges, to an extent, the inner workings of the program without having to examine the code meticulously; it can be used for identifying possible sources of communication-related performance overhead in parallel applications [7,33]; it can also be used for verifying the multicore hardware design. Therefore, identifying which groups of threads communicate in what volume and their quantitative comparison against expectations offer avenues to tune software for high performance.…”
Section: Introductionmentioning
confidence: 99%
“…Many applications use calling context to attain better understanding of program behavior. Indeed, the ability to inspect call stack context is an essential part of a wide variety of tools for debugging [4,9,12,13,17,18,24,31], testing [6,20,33], and analyzing [2,11,28,29,38,41,42] modern software.…”
Section: Introductionmentioning
confidence: 99%