Replicated Processes in Manetho

This paper presents the process­replication protocol of Manetho, a system whose goal is to provide efficient, application­transparent fault tolerance to long­running distributed computations. Manetho uses a new negative­ acknowledgment multicast protocol to enforce the same receipt order of application messages among all replicas of a process. The protocol depends on a combination of antecedence graph maintenance, a form of sender­based message logging, and the fact that the receivers of each multicast execute the same deterministic program. This combination allows our protocol to avoid the delay in application message delivery that is common in exist­ ing negative­acknowledgment multicast protocols, without giving up the advantage of requiring only a small number of control messages.


Presented at:
Proceedings of the Twentysecond Fault-Tolerant Computing Symposium, July 1992
Year:
1992
Laboratories:




 Record created 2005-10-20, last modified 2018-03-17

n/a:
Download fulltext
PS.PDF

Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)