2012
DOI: 10.1088/1742-6596/396/3/032008
|View full text |Cite
|
Sign up to set email alerts
|

Service Availability Monitoring Framework Based On Commodity Software

Abstract: The Worldwide LHC Computing Grid (WLCG) infrastructure continuously operates thousands of grid services scattered around hundreds of sites. Participating sites are organized in regions and support several virtual organizations, thus creating a very complex and heterogeneous environment. The Service Availability Monitoring (SAM) framework is responsible for the monitoring of this infrastructure. SAM is a complete monitoring framework for grid services and grid operational tools. Its current implementation tailo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 6 publications
0
7
0
Order By: Relevance
“…Availability computation gathers the service status information to compute the fraction of time the service was up (in OK state) during the period the service was known. Unlike availability, reliability does not consider scheduled downtime in the service known period [3].…”
Section: Results Computationmentioning
confidence: 99%
See 1 more Smart Citation
“…Availability computation gathers the service status information to compute the fraction of time the service was up (in OK state) during the period the service was known. Unlike availability, reliability does not consider scheduled downtime in the service known period [3].…”
Section: Results Computationmentioning
confidence: 99%
“…SAM is a complete monitoring framework for grid services deployed and managed for distributed environments. It provides two main services: SAM-Nagios, a regional scheduler of monitoring probes supporting local configuration and storage/visualization of probe results; and SAM-Gridmon, a central aggregator of monitoring data and computation engine providing multiple interfaces that exposes curated monitoring results [3].…”
Section: Monitoring Servicesmentioning
confidence: 99%
“…Service Availability Monitoring, SAM [8] is a system developed for monitoring, at different level, services (based on grid and cloud paradigms) in EGI distributed infrastructure. The advantage of SAM is that it is integrated with other systems used for sites operations in EGI context and numerous sensors have been developed in ad-hoc manner.…”
Section: Related Workmentioning
confidence: 99%
“…It ensures feedback on the quality of services delivered by the sites and identifies and helps to mitigate outages caused by middleware failures. For WLCG and EGI, this is performed by the Service Availability Monitoring framework, a well established monitoring platform that computes the overall availability and reliability of services [4].…”
Section: Introductionmentioning
confidence: 99%
“…1 shows how SAM-MR extends the existing architecture and how it connects to the existing SAM components. Distributed model of SAM currently provides two deployment models: a centralized instance that computes overall availability and reliability and a set of regional instances that gather data from WLCG experiments and National Grid Initiatives (NGIs) [4]. Both models are connected to a message bus that offers efficient transport of data between all the instances.…”
Section: Introductionmentioning
confidence: 99%