Proceedings. IEEE International Conference on Cluster Computing
DOI: 10.1109/clustr.2002.1137727
|View full text |Cite
|
Sign up to set email alerts
|

Supermon: a high-speed cluster monitoring system

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
45
0
1

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 79 publications
(46 citation statements)
references
References 3 publications
0
45
0
1
Order By: Relevance
“…TAUg has shown low overheads up to 512 processors. TAUoverSupermon takes a different strategy from TAUg, using the Supermon cluster monitoring framework [15] as the data aggregation mechanism (referred to in their paper as the "transport"). It too, shows low overhead up to 512 processors.…”
Section: B Types Of Existing Performance Analysis Toolsmentioning
confidence: 99%
“…TAUg has shown low overheads up to 512 processors. TAUoverSupermon takes a different strategy from TAUg, using the Supermon cluster monitoring framework [15] as the data aggregation mechanism (referred to in their paper as the "transport"). It too, shows low overhead up to 512 processors.…”
Section: B Types Of Existing Performance Analysis Toolsmentioning
confidence: 99%
“…It relies on symbolic expressions at all levels for communication between components in order to reduce node perturbation when parsing messages [27]. A single mon process runs on all the compute nodes, they parse the symbolic expressions from the kernel module and make that information available over a TCP port to clients and the szlpermon data aggregator.…”
Section: Supermonmentioning
confidence: 99%
“…Querying each node in the cluster is important to achieve the third goal of scalability since this time will only increase as cluster sizes become larger. To accomplish this goal, the Fountain daemons are arranged in a tree topology as described in act as a wrapper around a more specialized monitoring component like Supermon [27], and present its data using the SSS interface.…”
Section: Query Performancementioning
confidence: 99%
“…Recently, several projects have begun to address this problem by integrating hierarchical communication structures with online aggregation mechanisms, like MRNet [2] or Supermon [3], into their tools. On the debugger side, HP's Ladebug [4] relies on a tree of debug daemons to control large…”
Section: Introductionmentioning
confidence: 99%