Proceedings of the Twenty-Fifth International Conference on Architectural Support for Programming Languages and Operating Syste 2020
DOI: 10.1145/3373376.3378496
|View full text |Cite
|
Sign up to set email alerts
|

Hermes: A Fast, Fault-Tolerant and Linearizable Replication Protocol

Abstract: Today's datacenter applications are underpinned by datastores that are responsible for providing availability, consistency, and performance. For high availability in the presence of failures, these datastores replicate data across several nodes. This is accomplished with the help of a reliable replication protocol that is responsible for maintaining the replicas strongly-consistent even when faults occur. Strong consistency is preferred to weaker consistency models that cannot guarantee an intuitive behavior f… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 36 publications
(21 citation statements)
references
References 74 publications
0
21
0
Order By: Relevance
“…The protocol saturates at around 4K clients per site, when the outgoing network bandwidth at the leader reaches 95% usage. The fact that the leader can be a bottleneck in leader-based protocol has been reported by several prior works [13,23,24,31,43].…”
Section: Full Replication Deploymentmentioning
confidence: 98%
See 1 more Smart Citation
“…The protocol saturates at around 4K clients per site, when the outgoing network bandwidth at the leader reaches 95% usage. The fact that the leader can be a bottleneck in leader-based protocol has been reported by several prior works [13,23,24,31,43].…”
Section: Full Replication Deploymentmentioning
confidence: 98%
“…In particular, many storage systems orchestrate data access using a fault-tolerant timestamping service [2,3,36,42,47], usually implemented by a leader-based SMR protocol [29,35]. As reported in prior works, the leader is a potential bottleneck and is unfair with respect to client locations [14,24,25,32,44]. To sidestep these problems, leaderless protocols order commands in a fully decentralized manner.…”
Section: Related Workmentioning
confidence: 99%
“…Several previous systems have demonstrated such load balancers [3,5,7,51]. We implement a simple one using a distributed, replicated key-value store based on Hermes [35]. We extract a key from each request and look it up in the key-value store.…”
Section: Zeus System Architecturementioning
confidence: 99%
“…Cache coherence protocols have been used to implement hardware transactions [29]. Zeus builds on ideas in Hermes [35], which adapted concepts from cache coherence and applied them to enforce strong consistency for replicated in-memory datastores. Hermes allows for local reads and fast reliable updates to individual objects from all replicas; however, it does not support multi-object reliable transactions or ownerships.…”
Section: Related Workmentioning
confidence: 99%
“…To this end, we first develop a taxonomy of existing protocols, classifying them into four classes based on their operational patterns (Section 3). To understand the performance of the different classes of protocols, we carefully select ten protocols for analysis: ZAB [25], Multi-Paxos [39], CHT and multi-leader CHT [10], CRAQ [62], Derecho [26], Classic Paxos (CP) [36], All-Aboard Paxos [23], ABD [48] and Hermes [33].…”
Section: Introductionmentioning
confidence: 99%