Failure Detection vs. Group Membership in Fault-Tolerant Distributed Systems: Hidden Trade-Offs

Schiper, A.

doi:10.1007/3-540-45605-8_1

Schiper, A.

2002

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Failure detection and group membership are two important components of fault-tolerant distributed systems. Understanding their role is essential when developing efficient solutions, not only in failure-free runs, but also in runs in which processes do crash. While group membership provides consistent information about the status of processes in the system, failure detectors provide inconsistent information. This paper discusses the trade-offs related to the use of these two components, and clarifies their roles using three examples. The first example shows a case where group membership may favourably be replaced by a failure detection mechanism. The second example illustrates a case where group membership is mandatory. Finally, the third example shows a case where neither group membership nor failure detectors are needed (they may be replaced by weak ordering oracles).

Details

Title Failure Detection vs. Group Membership in Fault-Tolerant Distributed Systems: Hidden Trade-Offs

Author(s) Schiper, A.

Published in Process Algebra and Probabilistic Methods: Performance Modeling and Verification. PAPM-PROBMIV 2002

Pages 1-15

Conference Second Joint International Workshop PAPM-PROBMIV 2002, Copenhagen, Denmark, July 25–26, 2002

Date 2002

Publisher Springer Verlag

Note Invited talk

DOI https://doi.org/10.1007/3-540-45605-8_1

Laboratories LSR

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IC Archives > LSR - Distributed Systems Laboratory
Conference Papers
Work produced at EPFL
Published

Record creation date 2005-05-20

Actions

Preview

Select file: