首页> 外文期刊>Acta Informatica >A Paxos based algorithm to minimize the overhead of process recovery in consensus
【24h】

A Paxos based algorithm to minimize the overhead of process recovery in consensus

机译:基于Paxos的算法可最大程度地减少共识过程恢复的开销

获取原文
获取原文并翻译 | 示例
           

摘要

Consensus is a fundamental abstraction in distributed systems and its solvability is widely discussed in the literature. In message passing distributed systems where there is a need to solve sequential instances of consensus, it is possible that some processes become faulty during one instance and recover later in another instance. Though consensus algorithms should be equipped both to handle process failures and process recovery, only a little amount of work has been done in the literature to handle process recovery. Handling process recovery is not trivial because a recovered process may broadcast a new message which could hamper the progress made by other processes towards achieving consensus in their current round, and thereby forcing them to start a new round. Therefore algorithms that are not designed to handle process recovery require O(f) rounds or O(f) time to achieve consensus, where at most f processes can recover and is the message delay in the system. But Dutta et al. (in: International conference on dependable systems and networks (DSN'05), pp 22-27), 2005. 10.1109/DSN.2005.54) showed that the overhead of handling process recovery is constant and their algorithm takes 17 time to achieve consensus. In this work, we introduce a new Paxos based algorithm that lowers the upper bound to 11. We also show that if all process failures are initial, the upper bound can be further reduced to 5. Our algorithm selectively enables processes executing lower rounds to decide irrespective of the presence of higher rounds in the system, minimizing the effect of recovered processes starting a higher round.
机译:共识是分布式系统的基本抽象,其可解性已在文献中广泛讨论。在需要解决共识的连续实例的消息传递分布式系统中,某些进程有可能在一个实例期间出现故障,而在另一个实例中稍后恢复。尽管共识算法应同时处理过程故障和过程恢复,但文献中仅进行了少量工作来处理过程恢复。处理流程的恢复并非易事,因为恢复的流程可能会广播新消息,这可能会阻碍其他流程在当前轮次达成共识并因此迫使他们开始新一轮谈判方面取得的进展。因此,未设计用于处理进程恢复的算法需要O(f)轮或O(f)时间才能达成共识,其中最多f个进程可以恢复,这是系统中的消息延迟。但是Dutta等。 (见:可靠系统和网络国际会议(DSN'05),第22-27页),2005年。10.1109 / DSN.2005.54)表明处理过程恢复的开销是恒定的,并且它们的算法需要17个小时才能达成共识。在这项工作中,我们引入了一种新的基于Paxos的算法,该算法将上限降低到11。我们还表明,如果所有流程失败都是初始的,则上限可以进一步降低到5。我们的算法有选择地使执行较低回合的流程决定不管系统中是否存在更高级别的回合,都可以将开始更高级别的恢复过程的影响降至最低。

著录项

  • 来源
    《Acta Informatica》 |2019年第5期|433-446|共14页
  • 作者单位

    Natl Inst Technol Warangal, Dept Comp Sci & Engn, Warangal, Telangana, India;

    Natl Inst Technol Warangal, Dept Comp Sci & Engn, Warangal, Telangana, India;

  • 收录信息 美国《科学引文索引》(SCI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号