2017 IEEE International Conference on Cluster Computing (CLUSTER) 2017
DOI: 10.1109/cluster.2017.115
|View full text |Cite
|
Sign up to set email alerts
|

LIKWID Monitoring Stack: A Flexible Framework Enabling Job Specific Performance monitoring for the masses

Abstract: Abstract-System monitoring is an established tool to measure the utilization and health of HPC systems. Usually system monitoring infrastructures make no connection to job information and do not utilize hardware performance monitoring (HPM) data. To increase the efficient use of HPC systems automatic and continuous performance monitoring of jobs is an essential component. It can help to identify pathological cases, provides instant performance feedback to the users, offers initial data to judge on the optimiza… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
9
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 23 publications
(10 citation statements)
references
References 11 publications
0
9
0
1
Order By: Relevance
“…We use the performance measure codes likwid (Hager et al, 2010;Röhl et al, 2017;Gruber et al, 2020) and perf (de Melo, 2010) to measure the overall floating-point operations (FLOP) and energy usage of the digital processor. For the Intel mobile processor, this provides a power consumption of P D = 10 W during computing.…”
Section: Energy and Power Consumptionmentioning
confidence: 99%
“…We use the performance measure codes likwid (Hager et al, 2010;Röhl et al, 2017;Gruber et al, 2020) and perf (de Melo, 2010) to measure the overall floating-point operations (FLOP) and energy usage of the digital processor. For the Intel mobile processor, this provides a power consumption of P D = 10 W during computing.…”
Section: Energy and Power Consumptionmentioning
confidence: 99%
“…Ball et al [27] first proposed a set of optimal algorithms for program profiling and instruction tracing to reduce the overhead of profilers. Hardware specific tools such as LIKWID [58,64] for x86 environments and MemProf [46] for NUMA systems make use of hardware counters allowing developers to explore optimization opportunities specific to the underlying system architecture. Linux perf [34] provides low-level system metrics by attaching to tracepoints, performance counters or probes similarly to eBPF [36].…”
Section: Profilersmentioning
confidence: 99%
“…Ball et al [27] first proposed a set of optimal algorithms for program profiling and instruction tracing to reduce the overhead of profilers. Hardware specific tools such as LIKWID [63,69] for x86 environments and MemProf [48] for NUMA systems make use of hardware counters allowing developers to explore optimization opportunities specific to the underlying system architecture. Linux perf [34] provides low-level system metrics by attaching to tracepoints, performance counters or probes similarly to eBPF [36].…”
Section: Profilersmentioning
confidence: 99%