Proceedings of the 2017 Internet Measurement Conference 2017
DOI: 10.1145/3131365.3131384
|View full text |Cite
|
Sign up to set email alerts
|

Pinpointing delay and forwarding anomalies using large-scale traceroute measurements

Abstract: Understanding data plane health is essential to improving Internet reliability and usability. For instance, detecting disruptions in peer and provider networks can identify repairable connectivity problems. Currently this task is time consuming as it involves a fair amount of manual observation, as an operator has poor visibility beyond their network's border. In this paper we leverage existing public RIPE Atlas measurement data to monitor and analyze network conditions; creating no new measurements. We demons… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
49
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3
1

Relationship

3
6

Authors

Journals

citations
Cited by 38 publications
(49 citation statements)
references
References 44 publications
0
49
0
Order By: Relevance
“…Disco [27] detects surge of Atlas probe disconnections using a burst modeling algorithm. Using also Atlas data, authors of [12] rely on the central limit theorem to model usual Internet delays and identify network disruptions.…”
Section: Introductionmentioning
confidence: 99%
“…Disco [27] detects surge of Atlas probe disconnections using a burst modeling algorithm. Using also Atlas data, authors of [12] rely on the central limit theorem to model usual Internet delays and identify network disruptions.…”
Section: Introductionmentioning
confidence: 99%
“…To validate the ability of our method to detect events, we analyze two events which have been discussed in the literature (as this provides some groundtruth against which we can compare our results): AMS-IX outage in May 2015 [4,17,21], and DE-CIX Frankfurt outage in April 2018 [5] [4], on the 13th of May 2015, AMS-IX experienced a partial outage due to a switch interface generating looped traffic on the peering LAN. The event lasted for seven minutes and two seconds, from 10:22:12 to 10:29:14 (UTC time) before the switch interface was disconnected.…”
Section: Monitoring Large Internet Infrastructuresmentioning
confidence: 99%
“…In [14] a technique is proposed to address these challenges when a link is observed by a sufficient number of probes with different return paths. That technique monitors the shifts in the distribution of the median differential RTT(RT T Dif f ) and distinguishes strong alarms.…”
Section: F Delay Change Detectionmentioning
confidence: 99%
“…Note that multiple routers of colo A and colo B may be involved in this grouping. Like in [14], we calculate confidence intervals for both the observed and the reference RT T Dif f to detect significant statistical changes. If those confidence intervals stop to ovelap we report an alarm like those of Fig.…”
Section: F Delay Change Detectionmentioning
confidence: 99%