Design of Differential Game Controllers Using Adaptive Critic Neural Networks
-
摘要: 采用由3个神经网络组成的自适应评判神经网络结构求解微分对策的2点边值问题,其中2个 为控制神经网络,分别实现对微分对策系统中双边控制器的优化,一个为协 态神经网络,用于对2点边值问题中的协态变量进行求解,协态网络的输出对控制网络进行 校正,训练以后的2个控制网络作为双边的反馈控制器在线应用.并将神经网络结果与采用 Chebyshev技术的微分对策数字解进行了对比.追逃微分对策仿真结果表明了该方法的有效 性,并且对初始条件和测量噪声具有较强的鲁棒性.Abstract: An adaptive critic structure including three neural networks was developed to so lve the two point boundary value problem of differential games.Two control neur al networks were used to optimize the controllers on two sides of the differenti al games,and a co-state neural network was used to approximate the co-state v ariables in Hamiltonian function.The output of co-state network was used to correct the output of the control networks,and the two convergent control netw o rks can be used as feedback controllers on two sides of the differential games s ystem repectively.The solution of differential games based on neural networks w a s compared with the one based on Chebyshev technique.The simulation results of the pursing-escaping differential games show that the neural network controller s are consistent with the optimal solution and present good robustness with resp ect to the initial conditions and measuring noises.
-
Key words:
- neural networks /
- differential games /
- guided missile guidance
-
[1] 周卿吉,许 诚,周文松. 微分对策制导律研究现状及展望[J]. 系统工程与电子技术,1 997,19(11):40~45 Zhou Qingji, Xu Cheng, Zhou Wensong, et al.Status and prospects of the devel opme nt of differential games guidance law[J].System Engineering and Electronic Tec hnology,1997,19(11):40~45(in Chinese) [2] Rahbar N,Bahrami M. A new neuro-based solution for closed loop optimal guidance with terminal constraints. AIAA-99-4068,1999 [3] Han D C,Balakrishnan S N. Adaptive critic-based neural networks for agi le missile control[J]. Journal of Guidance,Control and Dynamics,2002,25(2) :404~407 [4] Bryson A E,Ho Y C. Applied optimal control[M]. Washington D C:Halste d Press,1975 [5] Fahroo F,Ross I M. Direct trajectory optimization by a Chebyshev pseudos pectral method[J]. Journal of Guidance,Control and Dynamics,2002,25(1):160 ~166 [6] Hagan M T,Menhaj M B. Training feedforward networks with the Marquardt a lgorithm[J]. IEEE Trans on Neural Networks,1994,5(6):989~993
点击查看大图
计量
- 文章访问数: 1411
- HTML全文浏览量: 132
- PDF下载量: 12
- 被引次数: 0