Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis 2019
DOI: 10.1145/3295500.3356214
|View full text |Cite
|
Sign up to set email alerts
|

ComDetective

Abstract: Inter-thread communication is a vital performance indicator in shared-memory systems. Prior works on identifying inter-thread communication employed hardware simulators or binary instrumentation and suffered from inaccuracy or high overheads-both space and time-making them impractical for production use. We propose ComDetective, which produces communication matrices that are accurate and introduces low runtime and low memory overheads, thus making it practical for production use. ComDetective employs hardware … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
6
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 25 publications
0
6
0
Order By: Relevance
“…To compare the precise event sampling capabilities of Intel and AMD for a full-fledged profiling tool, we use an open-source tool, ComDetective [3], which captures the interthread communication within an application. The main idea of ComDetective is to use PMU samples and debug register traps to detect cache line transfers between threads.…”
Section: Full-fledged Profiling Toolmentioning
confidence: 99%
See 2 more Smart Citations
“…To compare the precise event sampling capabilities of Intel and AMD for a full-fledged profiling tool, we use an open-source tool, ComDetective [3], which captures the interthread communication within an application. The main idea of ComDetective is to use PMU samples and debug register traps to detect cache line transfers between threads.…”
Section: Full-fledged Profiling Toolmentioning
confidence: 99%
“…We performed experiments to compare the accuracy, overheads, and stability of ComDetective under PEBS and IBS running on the Intel Cascade Lake and the AMD Zen 2 machines. The sampling interval that we use in each experiment is 500K, which is the default sampling interval in the experiments reported in [3].…”
Section: Full-fledged Profiling Toolmentioning
confidence: 99%
See 1 more Smart Citation
“…To detect inter-thread communications and measure data locality in multi-threaded code with low overheads, we substantially extend COMDETECTIVE, 14 a tool that captures inter-thread communications in the forms of communication matrices, and REUSETRACKER, 15 another tool that measures reuse distance in multi-threaded applications, to work in AMD machines. We introduced these tools in our previous works, 14,15 and developed them to interface with Intel PEBS in sampling memory accesses. In this work, we extend them to leverage Instruction Based Sampling (IBS) facility when running on AMD machines to sample memory loads and stores in detecting communications and measuring reuse distance.…”
mentioning
confidence: 99%
“…In our experimental study, we firstly verify the accuracy of COMDETECTIVE + and REUSETRACKER + by using the microbenchmarks introduced in 14 and in 15 on an AMD multicore. We also perform sensitivity analysis to evaluate the impacts of different thread counts, sampling intervals, and debug register counts on the accuracy of our tools.…”
mentioning
confidence: 99%