2015
DOI: 10.1145/2829988.2787501
|View full text |Cite
|
Sign up to set email alerts
|

Rollback-Recovery for Middleboxes

Abstract: Network middleboxes must offer high availability, with automatic failover when a device fails. Achieving high availability is challenging because failover must correctly restore lost state (e.g., activity logs, port mappings) but must do so quickly (e.g., in less than typical transport timeout values to minimize disruption to applications) and with little overhead to failure-free operation (e.g., additional per-packet latencies of 10-100s of µs). No existing middlebox design provides failover that is correct, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
91
0
2

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 60 publications
(94 citation statements)
references
References 35 publications
1
91
0
2
Order By: Relevance
“…On SPLASH-2, Chimera saw overhead ranging from 1.6× for two cores to over 3× for 8 cores. Similar systems exhibited substantially larger overheads [24] or only considered small and easily parallelizable programs [43].…”
Section: Related Workmentioning
confidence: 99%
“…On SPLASH-2, Chimera saw overhead ranging from 1.6× for two cores to over 3× for 8 cores. Similar systems exhibited substantially larger overheads [24] or only considered small and easily parallelizable programs [43].…”
Section: Related Workmentioning
confidence: 99%
“…One or more backup nodes are assigned to a flow (constraints (11)). A flow is assigned a backup node provided that it is using atleast one backup instance hosted on the node (constraint (12)). This model is an Integer Non-linear Program (INLP) because of the non linearity of equation (10).…”
Section: B Formulation: All-anymentioning
confidence: 99%
“…Typically, NFs need to maintain 10-100s of state variables that are per-flow or shared across flows [10]. Backup instances of stateful NFs need to have updated state information to ensure successful failover and service continuity [11], [12].…”
Section: Introductionmentioning
confidence: 99%
“…Active-active replication, where master and slave are executed on all inputs but only the master's output is released to users, will not work because of the non-deterministic nature of packet processing in middleboxes. Sherry et al [130], [131] proposed a fault-tolerant middlebox, a new design for fault-tolerant middleboxes that achieves correctness, fast recovery with only a slight increase in latency. They took a replay-based approach that maintains a log of inputs to the system and recreates the lost state by replaying the inputs from the log in the event of a failure.…”
Section: State Managementmentioning
confidence: 99%