Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. uBFT: Microsecond-Scale BFT using Disaggregated Memory
 
conference paper

uBFT: Microsecond-Scale BFT using Disaggregated Memory

Aguilera, Marcos K.
•
Ben-David, Naama
•
Guerraoui, Rachid  
Show more
2023
ASPLOS 2023: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems
28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2023)

We propose uBFT, the first State Machine Replication (SMR) system to achieve microsecond-scale latency in data centers, while using only 2f+1 replicas to tolerate f Byzantine failures. The Byzantine Fault Tolerance (BFT) provided by uBFT is essential as pure crashes appear to be a mere illusion with real-life systems reportedly failing in many unexpected ways. uBFT relies on a small non-tailored trusted computing base—disaggregated memory—and consumes a practically bounded amount of memory. uBFT is based on a novel abstraction called Consistent Tail Broadcast, which we use to prevent equivocation while bounding memory. We implement uBFT using RDMA-based disaggregated memory and obtain an end-to-end latency of as little as 10 us. This is at least 50× faster than MinBFT, a state-of-the-art 2f+1 BFT SMR based on Intel’s SGX. We use uBFT to replicate two KV-stores (Memcached and Redis), as well as a financial order matching engine (Liquibook). These applications have low latency (up to 20 us) and become Byzantine tolerant with as little as 10 us more. The price for uBFT is a small amount of reliable disaggregated memory (less than 1 MiB), which in our prototype consists of a small number of memory servers connected through RDMA and replicated for fault tolerance.

  • Details
  • Metrics
Type
conference paper
DOI
10.1145/3575693.3575732
Author(s)
Aguilera, Marcos K.
Ben-David, Naama
Guerraoui, Rachid  
Murat, Antoine  
Xygkis, Athanasios  
Zablotchi, Igor
Date Issued

2023

Published in
ASPLOS 2023: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems
Volume

2

Start page

862

End page

877

Subjects

Byzantine fault tolerance

•

microsecond scale

•

replication

•

disaggregated memory

•

fast path

•

finite memory

•

RDMA

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
DCL  
Event nameEvent placeEvent date
28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2023)

Vancouver, BC, Canada

March 25–29, 2023

Available on Infoscience
February 24, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/195078
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés