Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244)
DOI: 10.1109/hpdc.1998.709980
|View full text |Cite
|
Sign up to set email alerts
|

The NetLogger methodology for high performance distributed systems performance analysis

Abstract: We describe a methodology that enables the real-time diagnosis of performance problems in complex high-performance distributed systems. The IntroductionDevelopers of high-speed network-based distributed systems often observe performance problems such as unexpectedly low network throughput or high latency. The reasons for the poor performance can be manifold and are frequently not obvious. It is often difficult to track down performance problems because of the complex interaction between the many distributed… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
22
0

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 71 publications
(22 citation statements)
references
References 5 publications
0
22
0
Order By: Relevance
“…generating log files that incorporate dependencies between entities, the start and stop times of activities, and the inputs/outputs of activities, has been a topic of interest in many programming languages research [16]. There are many infrastructures, specifically in web service adaptors, that can be repurposed for collecting provenance [17].…”
Section: Related Workmentioning
confidence: 99%
“…generating log files that incorporate dependencies between entities, the start and stop times of activities, and the inputs/outputs of activities, has been a topic of interest in many programming languages research [16]. There are many infrastructures, specifically in web service adaptors, that can be repurposed for collecting provenance [17].…”
Section: Related Workmentioning
confidence: 99%
“…Of existing trace-based monitoring systems, only NetLogger [28] allows online monitoring by transmitting collected events to a central collection node for runtime analysis. This allows it to obtain a global view of system behavior.…”
Section: Related Workmentioning
confidence: 99%
“…Trace-based profiling approaches [1,3,9,11,15,20,21,22,24,25,28,31] can provide detailed information of the whole system, including cross-node performance information such as profiling of the distributed critical path of the entire application.…”
Section: Introductionmentioning
confidence: 99%
“…Monitoring systems are widely deployed to detect system/application events and collect performance data [15]. Some monitoring systems today are integrated with service request management systems to automatically open incident tickets upon detection of certain events [11].…”
Section: A Data Integrationmentioning
confidence: 99%