北京航空航天大学学报 ›› 2016, Vol. 42 ›› Issue (4): 844-850.doi: 10.13700/j.bh.1001-5965.2015.0277

• 论文 • 上一篇    下一篇

基于MDP的诊断策略构建方法

梁雅俊1, 肖明清1, 宋海方1, 杨召1, 梁鹏2   

  1. 1. 空军工程大学 航空航天工程学院, 西安 710038;
    2. 95503部队, 重庆 402360
  • 收稿日期:2015-05-05 修回日期:2015-09-02 出版日期:2016-04-20 发布日期:2016-04-29
  • 通讯作者: 肖明清, Tel.: 13909285251 E-mail: xmqing@sohu.com E-mail:xmqing@sohu.com
  • 作者简介:梁雅俊 女,博士研究生。主要研究方向:机载武器装备测试、诊断自动化与智能化。 Tel.: 15691805351 E-mail: 1214102891@qq.com;肖明清 男,博士,教授,博士生导师。主要研究方向:航空武器综合保障。 Tel.: 13909285251 E-mail: xmqing@sohu.com

Diagnostic strategy building method based on MDP

LIANG Yajun1, XIAO Mingqing1, SONG Haifang1, YANG Zhao1, LIANG Peng2   

  1. 1. Aeronautics and Astronautics Engineering College, Air Force Engineering University, Xi'an 710038, China;
    2. Unit 95503, Chongqing 402360, China
  • Received:2015-05-05 Revised:2015-09-02 Online:2016-04-20 Published:2016-04-29

摘要: 针对传统方法忽略测试通过的不确定性因素,缺乏长周期寻优机制,难以在复杂测试系统中生成全局最优诊断策略的问题,提出了一种基于马尔可夫决策过程(MDP)的诊断策略构建方法。该方法将故障检测、隔离的过程表述为系统故障状态的马尔可夫过程,通过引入折扣因子与目标权重,构造了综合效用准则函数的无限折扣模型,并利用策略迭代算法求解出全局平稳最优诊断策略。实例表明,该方法充分考虑了测试通过的不确定性,可实现全局平稳策略寻优,能够有效地指导测试系统实现快速故障检测和隔离。

关键词: 诊断策略, 马尔可夫决策过程(MDP), 故障检测, 策略迭代算法, 策略优化

Abstract: Aiming at the problem that by the traditional method, it is difficult to get the global optimal diagnostic strategy of the complicated test system in fault detection for ignoring the uncertainty factors in the test execution and lacking of the long cycle optimization mechanism, a new diagnostic strategy building method based on Markov decision processes (MDP) is proposed. The process of fault detection and isolation is expressed as a Markov process; the unlimited discount model of the utility integrated criterion function is structured through the discount factor and objective weights; the global optimal diagnostic strategy is obtained with the policy iteration algorithm. The example shows that the test uncertainty factors are well considered, stable optimal strategy of overall situation can be achieved by this method, and the fast fault detection and isolation in the engineering practice can be guided effectively as well.

Key words: diagnostic strategy, Markov decision processes (MDP), fault detection, policy iteration algorithm, strategy optimization

中图分类号: 


版权所有 © 《北京航空航天大学学报》编辑部
通讯地址:北京市海淀区学院路37号 北京航空航天大学学报编辑部 邮编:100191 E-mail:jbuaa@buaa.edu.cn
本系统由北京玛格泰克科技发展有限公司设计开发