-
摘要:
对战机对抗导弹的措施优化问题进行了研究,将该问题归结为不确定条件下的序贯决策问题,提出了一种基于Markov决策过程(MDP)的导弹对抗措施优化方法。首先, 该方法根据作战过程将作战阶段分为不同的状态,将电子对抗措施(ECM)和战术机动作为可供战机选择的行动;然后, 通过不同战术行动下状态之间的转移概率来反映行动的不确定性,以雷达搜索状态和导弹命中状态的期望值来表征不同策略下的飞机生存力;最后,通过建立Markov决策模型得到飞机生存力最大时各个状态对应的最佳行动。仿真结果表明:飞机生存力随着时间推移而逐渐降低;不同策略下飞机的生存力不同,基于MDP的策略可以有效提高飞机的生存力;单步优化措施不能提高战机的生存力,必须考虑长期状态转移的影响。
-
关键词:
- 飞机生存力 /
- 电子对抗措施(ECM) /
- 战术机动 /
- Markov决策过程(MDP) /
- 导弹对抗 /
- 序贯决策
Abstract:Missile countermeasure optimization problem for fighter aircraft was studied. The missile countermeasure optimization was modeled as a sequential decision-making problem under uncertainty, and a Markov decision process (MDP) based approach was proposed. First, the engagement process was divided into several phases, and these phrases were marked by states. Electronic counter measures (ECM) and strategic maneuvers were treated as actions. Then, the state transition probability was used to reflect the uncertainty of each action, and average occupancy of "hit" state and "search" state was used to evaluate the aircraft survivability under different policies. Finally, the policy iteration algorithm was used to get the optimal policy, which maps the optimal action to be taken in each state. Simulation indicates that the aircraft survivability decreases as time goes on; the proposed MDP-based approach can effectively improve the aircraft survivability; one-step optimization is useless for the aircraft survivability and the influence of the state transition should be considered from the view of long-time horizon.
-
表 1 状态集及其元素
Table 1. State set and its elements
编号 状态名称及缩写 s1 Search s2 Acquisition (ACQ) s3 Non-Adaptive Track (NAT) s4 Range Resolution 1 (RR1) s5 Range Resolution 2 (RR2) s6 Range Resolution 3 (RR3) s7 Track Maintenance (TM) s8 Passive Track (PT) s9 Missile Launching (ML) s10 Mid-Course Guidance (MCG) s11 Terminal Guidance (TG) s12 Miss the target (Miss) s13 Hit the target (Hit) 表 2 行动集及其元素
Table 2. Action set and its elements
行动集 状态相关的行动 行动名称 A1 a1→s1, s2, s10 Multi False Target (MFT) a2→s1~s7, s10 Noise Jamming (NJ) a3→s4 Range False Target 1 (RFT1) a4→s5 Range False Target 2 (RFT2) a5→s6 Range False Target 3 (RFT3) a6→s3 Range-Gate Pull-Off (RGPO) a7→s7, s10 Velocity-Range-Gate Pull-Off (VRGPO) a8→s11 Flares+Chaffs a9→s8 No Jamming (NoJ) A2 a10→s1 No Maneuvering (NoM) a11→s7 Maximum Acceleration (MA) a12→s7 Maximum Deceleration (MD) a13→s10, s11 Maximum Overload Pull-Up (MOPU) a14→s10, s11 Maximum Overload Dive (MOD) a15→s10, s11 Maximum Overload Left Turn (MOLT) a16→s10, s11 Maximum Overload Right Turn (MORT) A3 a17→s2, s7 MOPU/MOD/MOLT/MORT+MFT a18→s7, s10 MOPU/MOD/MOLT/MORT+VRGPO a19→s10, s11 MA/MD+Chaffs+Flares a20→s11 MOPU/MOD/MOLT/MORT+Chaffs+Flares -
[1] ERLANDSSON T, NIKLASSON L. An air-to-ground combat survivability model[J].Journal of Defense Modeling and Simulation, Application, Methodology, Technology, 2013, 12(3):1-15. [2] RANDLEFF L R.Decision support system for fighter pilots[D].Kongens Lyngby:Technical University of Denmark, 2007. [3] ZARCHAN P.Tactical and strategic missile guidance[M].6th ed.Reston:AIAA, 2012:33-58. [4] VIRTANEN K, RAIVIO T.Decision theoretical approach to pilot simulation[J].Journal of Aircraft, 1999, 36(4):18-27. [5] VIRTANEN K, KARELAHTI J, RAIVIO T.Modeling air combat by a moving horizon influence diagram game[J].Journal of Guidance, and Dynamics, 2006, 29(5):1080-1091. doi: 10.2514/1.17168 [6] YOMCHINDA T.A study of autonomous evasive planar-maneuver against proportional-navigation guidance missiles for unmanned aircraft[C]//2015 Asian Conference on Defense Technology.Piscataway, NJ:IEEE Press, 2015. [7] BACHMANN D J, EVANS R J, MORAN B.Game theoretic analysis of adaptive radar jamming[J].IEEE Transactions on Aerospace and Electronic Systems, 2011, 47(2):1081-1100. doi: 10.1109/TAES.2011.5751244 [8] BROOKS R R, PANG J E, GRIFFIN C.Game and information theory analysis of electronic countermeasures in pursuit-evasive games[J].IEEE Transactions on Systems, Man and Cybernetics-Part A:Systems and Humans, 2008, 38(6):1281-1294. doi: 10.1109/TSMCA.2008.2003970 [9] 沈阳, 陈永光, 李修和.基于0-1规划的雷达干扰资源优化分配研究[J].兵工学报, 2007, 28(5):528-532. http://www.cnki.com.cn/Article/CJFDTOTAL-BIGO200705004.htmSHEN Y, CHEN Y G, LI X H.Research on optimal distribution of radar jamming resource based on zero-one programming[J].Acta Armamentarii, 2007, 28(5):528-532(in Chinese). http://www.cnki.com.cn/Article/CJFDTOTAL-BIGO200705004.htm [10] MOORE F W.A methodology for missile countermeasures optimization under uncertainty[J].Evolutionary Computation, 2002, 10(2):129-149. doi: 10.1162/106365602320169820 [11] BALL R E.The fundamentals of aircraft combat survivability analysis and design[M].2nd ed.Reston:AIAA, 2003:445-602. [12] 杨哲, 李曙林, 周莉.机载自卫压制干扰和箔条干扰下飞机生存力研究[J].北京理工大学学报, 2013, 33(4):375-379. http://www.cnki.com.cn/Article/CJFDTOTAL-BJLG201304011.htmYANG Z, LI S L, ZHOU L.Study of aircraft survivability under the conditions of self-defense jamming and chaff jamming[J].Transactions on Beijing Institute of Technology, 2013, 33(4):375-379(in Chinese). http://www.cnki.com.cn/Article/CJFDTOTAL-BJLG201304011.htm [13] 宋海方, 肖明清, 吴华, 等.不同机载电子干扰条件下的飞机敏感性模型[J].航空学报, 2015, 36(11):3630-3639. http://www.cnki.com.cn/Article/CJFDTOTAL-HKXB201511014.htmSONG H F, XIAO M Q, WU H, et al.Genetic model of aircraft susceptibility to different airborne electronic countermeasures[J].Acta Aeronautica et Astroautica Sinica, 2015, 36(11):3630-3639(in Chinese). http://www.cnki.com.cn/Article/CJFDTOTAL-HKXB201511014.htm [14] ERLANDSSON T, NIKLASSON L.Automatic evaluation of air mission routes with respect to combat survival[J].Information Fusion, 2014, 20(1):88-98. [15] HASAN S M, GUITOUNI A.Variable dwell time task scheduling for multifunction radar[J].IEEE Transactions on Automation Science and Engineering, 2014, 11(2):463-472. doi: 10.1109/TASE.2013.2285014 [16] MARTINO A D. 现代电子战系统导论[M]. 姜道安, 等, 译. 北京: 电子工业出版社, 2014: 5-18.MARTINO A D.Introduction to modern EW systems[M].JIANG D A, et al, translated.Beijing:Publishing House of Electronic Industry, 2014:5-18(in Chinese). [17] SIGAUD O, BUFFET O.Markov decision processes in artificial intelligence:MDPs, beyond MDPs and applications[M].New York:John Wiley & Sons, 2010:1-63. [18] RUSSELL S J, NORVIG P.Artificial intelligence:A modern approach[M].3rd ed.Beijing:Tsinghua University Press, 2011:645-684. [19] VISNEVSKI N, KRISHNAMURTHY V, WANG A, et al.Syntactic modeling and signal processing of multifunction radars:A stochastic context-free grammar approach[J].Proceeding of IEEE, 2007, 95(5):1000-1025. doi: 10.1109/JPROC.2007.893252 [20] 傅莉, 谢福怀, 孟光磊, 等.基于滚动时域的无人机空战决策专家系统[J].北京航空航天大学学报, 2015, 41(11):1994-1999.FU L, XIE F H, MENG G L, et al.An UAV air-combat decision expert system based on receding horizon control[J].Journal of Beijing University of Aeronautics and Astronautics, 2015, 41(11):1994-1999(in Chinese). [21] 娄寿春.面空导弹武器系统分析[M].北京:国防工业出版社, 2013:50-132.LOU S C.Surface-to-air missile weapon system analysis[M].Beijing:National Defense Industry Press, 2013:50-132(in Chinese). [22] BANDIERA F, FARINA A, ORLANDO D, et al.Detection algorithms to discriminate between radar targets and ECM signals[J].IEEE Transactions on Signal Processing, 2010, 58(12):5984-5993. doi: 10.1109/TSP.2010.2077283 -