Multi-shot distributed transaction commit

Chockler, Gregory; Gotsman, Alexey

doi:10.1007/s00446-021-00389-4

Cited by 5 publications

(18 citation statements)

References 47 publications

(90 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We present protocols in two different models-the standard asynchronous messagepassing model ( §3) and a model with Remote Direct Memory Access (RDMA), which allows a machine to access the memory of another machine over the network without involving the latter's CPU ( §5). Our protocols are parametric in the isolation level provided, and we prove that they correctly implement the TCS specification from the multi-shot commit problem [5] ( §4).…”

Section: Introductionmentioning

confidence: 84%

“…A Transaction Certification Service (TCS) is meant to be used in the context of transactional processing systems with optimistic concurrency control [26], where transactions are first executed speculatively, and the results are submitted for certification to the TCS. We start by reviewing its specification proposed in [5]. Clients invoke the TCS using requests of the form certify(t, l), where t ∈ T is a unique transaction identifier and l ∈ L is the transaction payload, which carries the results of the optimistic execution of the transaction (e.g., read and write sets).…”

Section: Transaction Certification Servicementioning

confidence: 99%

“…A typical system based on optimistic concurrency control will ensure that transactions submitted for certification only read versions written by previously committed transactions. A history produced by such a system that is correct with respect to certification function (2) is also serializable [5]. Hence, a TCS correct with respect to this certification function can indeed be used to implement serializability.…”

Section: Transaction Certification Servicementioning

confidence: 99%

“…When a shard s votes on a transaction, it does not have information about all transactions in the system, but only those that concern it. Hence, the votes are computed using not the global certification function f , but shard-local certification functions [5], which check for conflicts only on objects managed by the shard and correspondingly take as parameters only the parts of the transaction payloads relevant to the shard: for a payload l we denote this by l | s. For example, let Obj s be the set of objects managed by a shard s. For a payload l = ⟨R,W , V c ⟩ of the form given above, we let…”

Section: Transaction Certification Servicementioning

confidence: 99%

“…TCS is the most challenging part of transaction processing in systems with the above architecture, since it requires solving a distributed agreement problem among the replicated shards participating in the transaction. This agreement problem has been recently formalized as the multi-shot commit problem [5], generalizing the classical atomic commit problem [9] to more faithfully reflect the requirements of modern transaction processing systems (we review the new problem statement in §2).…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Reconfigurable Atomic Transaction Commit

Bravo

Gotsman

2019

Proceedings of the 2019 ACM Symposium on Principles of Distributed Computing

Self Cite

View full text Add to dashboard Cite

Modern data stores achieve scalability by partitioning data into shards and fault-tolerance by replicating each shard across several servers. A key component of such systems is a Transaction Certification Service (TCS), which atomically commits a transaction spanning multiple shards. Existing TCS protocols require 2f + 1 crash-stop replicas per shard to tolerate f failures. In this paper we present atomic commit protocols that require only f + 1 replicas and reconfigure the system upon failures using an external reconfiguration service. We furthermore rigorously prove that these protocols correctly implement a recently proposed TCS specification. We present protocols in two different models-the standard asynchronous message-passing model and a model with Remote Direct Memory Access (RDMA), which allows a machine to access the memory of another machine over the network without involving the latter's CPU. Our protocols are inspired by a recent FARM system for RDMA-based transaction processing. Our work codifies the core ideas of FARM as distributed TCS protocols, rigorously proves them correct and highlights the trade-offs required by the use of RDMA.

show abstract

Section: Introductionmentioning

confidence: 84%

Section: Transaction Certification Servicementioning

confidence: 99%

Section: Transaction Certification Servicementioning

confidence: 99%

Section: Transaction Certification Servicementioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Reconfigurable Atomic Transaction Commit

Bravo

Gotsman

2019

Proceedings of the 2019 ACM Symposium on Principles of Distributed Computing

Self Cite

View full text Add to dashboard Cite

show abstract

White-Box Atomic Multicast

Gotsman¹,

Lefort²,

Chockler³

2019

2019 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN)

Self Cite

View full text Add to dashboard Cite

Atomic multicast is a communication primitive that delivers messages to multiple groups of processes according to some total order, with each group receiving the projection of the total order onto messages addressed to it. To be scalable, atomic multicast needs to be genuine, meaning that only the destination processes of a message should participate in ordering it. In this paper we propose a novel genuine atomic multicast protocol that in the absence of failures takes as low as 3 message delays to deliver a message when no other messages are multicast concurrently to its destination groups, and 5 message delays in the presence of concurrency. This improves the latencies of both the fault-tolerant version of classical Skeen's multicast protocol (6 or 12 message delays, depending on concurrency) and its recent improvement by Coelho et al. (4 or 8 message delays). To achieve such low latencies, we depart from the typical way of guaranteeing fault-tolerance by replicating each group with Paxos. Instead, we weave Paxos and Skeen's protocol together into a single coherent protocol, exploiting opportunities for white-box optimisations. We experimentally demonstrate that the superior theoretical characteristics of our protocol are reflected in practical performance pay-offs.

show abstract