留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于自适应心跳算法的分布式系统故障检测器

王明 张春熹 伊小素

王明, 张春熹, 伊小素等 . 基于自适应心跳算法的分布式系统故障检测器[J]. 北京航空航天大学学报, 2013, 39(7): 952-956.
引用本文: 王明, 张春熹, 伊小素等 . 基于自适应心跳算法的分布式系统故障检测器[J]. 北京航空航天大学学报, 2013, 39(7): 952-956.
Wang Ming, Zhang Chunxi, Yi Xiaosuet al. Fault detector of fault-tolerant distributed systems based on self-adaptive heartbeat algorithm[J]. Journal of Beijing University of Aeronautics and Astronautics, 2013, 39(7): 952-956. (in Chinese)
Citation: Wang Ming, Zhang Chunxi, Yi Xiaosuet al. Fault detector of fault-tolerant distributed systems based on self-adaptive heartbeat algorithm[J]. Journal of Beijing University of Aeronautics and Astronautics, 2013, 39(7): 952-956. (in Chinese)

基于自适应心跳算法的分布式系统故障检测器

详细信息
  • 中图分类号: N 945.17

Fault detector of fault-tolerant distributed systems based on self-adaptive heartbeat algorithm

  • 摘要: 故障检测是容错分布式系统中的关键技术之一.为了提高故障检测的性能,提出一种新型的故障检测器——自适应心跳检测器(SA-HD, Self-Adaptive Heartbeat Detector).SA-HD采用了基于拉式(pull)的自适应心跳算法,在考虑故障检测性能的同时也考虑了心跳检测所占用的网络资源对网络性能的影响.SA-HD能够根据网络负载调节自身发送心跳消息的频率,提高了心跳检测的网络环境适应能力,尤其是在高负载的环境下,能够有效改善心跳检测的性能.建立了SA-HD的模型,对其性能进行了仿真分析,并通过试验验证了SA-HD性能要优于传统推式(push)的心跳检测器.

     

  • [1] Xiong Naixue,Yang Yan.A survey on fault-tolerance in distributed network systems //Proceedings of IEEE International Conference on Computational Science and Engineering.New York:IEEE,2009:1065-1070
    [2] Felber P,Defago X,Guerraoui R,et al.Failure detectors as first class objects //Proceedings of IEEE International Symposium on Distributed Objects and Applications.New York:IEEE,1999:132-141
    [3] Wiesmann M,Urban P,Defago X.An SNMP based failure detection service //Proceedings of the 25th IEEE International Symposium on Reliable Distributed Systems.New York:IEEE,2006:365-374
    [4] Zhu Hao,Chen Haopeng.Adaptive failure detection via heartbeat under hadoop //Proceedings of IEEE Asia-Pacific Services Computing Conference.Jeju:IEEE,2011:231-238
    [5] Roberto B,Jean M H,Sara T P.A methodology to design arbitrary failure detectors for distributed protocols[J].Journal of Systems Architecture,2008,54(7):619-637
    [6] Chen W,Sam T,Marcos K A.On the quality of service of failure detectors[J].IEEE Transactions on Computers,2002,51(1):13-32
    [7] Naohiro H,Xavier D,Rami Y,et al.The φ accrual failure detector //Proceedings of the 23th IEEE International Symposium on Reliable Distributed Systems.New York:IEEE,2004:66-78
    [8] Benjamin S,Andreas P,Wolfgang T,et al.A lazy monitoring approach for heartbeat-style failure detectors //Proceedings of the 3th International Conference on Availability,Reliability and Security.New York:IEEE,2008:404-409
    [9] Chandra T D,Toueg S.Unreliable of failure detectors for reliable distributed systems[J].Journal of the ACM,1996,43(2):225-267
    [10] Fetzer C,Raynal M,Tronel F.An adaptive failure detection protocol //Proceedings of the 8th Pacific Rim Symposium on Dependable Computing.New York:IEEE,2001:146-153
    [11] Kleinrock L.Queueing systems,volume 1:theory[M].New York:John Wiley,1962
  • 加载中
计量
  • 文章访问数:  2121
  • HTML全文浏览量:  249
  • PDF下载量:  845
  • 被引次数: 0
出版历程
  • 收稿日期:  2012-08-02
  • 网络出版日期:  2013-07-30

目录

    /

    返回文章
    返回
    常见问答