留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于CGAN的避扰通信决策网络离线式训练方法

江民民 李大朋 邱昕 慕福奇 柴旭荣 孙志浩

江民民, 李大朋, 邱昕, 等 . 基于CGAN的避扰通信决策网络离线式训练方法[J]. 北京航空航天大学学报, 2020, 46(7): 1412-1421. doi: 10.13700/j.bh.1001-5965.2019.0448
引用本文: 江民民, 李大朋, 邱昕, 等 . 基于CGAN的避扰通信决策网络离线式训练方法[J]. 北京航空航天大学学报, 2020, 46(7): 1412-1421. doi: 10.13700/j.bh.1001-5965.2019.0448
JIANG Minmin, LI Dapeng, QIU Xin, et al. An offline training method using CGAN for anti-jamming communication decision network[J]. Journal of Beijing University of Aeronautics and Astronautics, 2020, 46(7): 1412-1421. doi: 10.13700/j.bh.1001-5965.2019.0448(in Chinese)
Citation: JIANG Minmin, LI Dapeng, QIU Xin, et al. An offline training method using CGAN for anti-jamming communication decision network[J]. Journal of Beijing University of Aeronautics and Astronautics, 2020, 46(7): 1412-1421. doi: 10.13700/j.bh.1001-5965.2019.0448(in Chinese)

基于CGAN的避扰通信决策网络离线式训练方法

doi: 10.13700/j.bh.1001-5965.2019.0448
详细信息
    作者简介:

    江民民  男, 硕士研究生。主要研究方向:人工智能、认知无线电

    李大朋 男, 博士, 副研究员。主要研究方向:数字信号处理

    邱昕 男, 博士, 研究员。主要研究方向:无线通信系统设计、通信信号处理技术

    慕福奇  男, 研究员, 博士生导师。主要研究方向:无线通信系统与技术、物联网传输与应用

    柴旭荣  男, 硕士, 高级工程师。主要研究方向:无线通信系统与技术、通信信号处理技术

    孙志浩  男, 硕士研究生。主要研究方向:数字信号处理

    通讯作者:

    李大朋, E-mail: insanegtp@sina.cn

  • 中图分类号: TN974;TP181

An offline training method using CGAN for anti-jamming communication decision network

More Information
  • 摘要:

    基于强化学习的避扰通信,由于需要不断地与环境交互从中学习到最优决策,其决策网络的训练时间受环境反馈速率的约束,通常耗时严重。针对这一问题,提出了一种离线式训练方法。构建出一种频谱虚拟环境生成器,可以快速生成大量的逼真合成频谱瀑布图,用于避扰通信决策网络训练。由于所提方法脱离真实环境反馈,形成离线式训练,进而显著提高模型训练效率。实验结果表明:与实时在线训练方法比较,所提离线式训练方法的训练时间可以减少50%以上。

     

  • 图 1  ADRLA训练过程图[3]

    Figure 1.  Training process of ADRLA[3]

    图 2  离线式快速避扰通信模型训练框架

    Figure 2.  An offline fast model training framework for anti-jamming communication

    图 3  产生频谱虚拟环境生成器的细节

    Figure 3.  Details of making spectrum virtual environment generator

    图 4  真实的SW图和相应的标注图

    Figure 4.  Real SW image and corresponding labeled image

    图 5  pix2pix中生成器和判别器功能作用

    Figure 5.  Functions of generator and discriminator in pix2pix

    图 6  增强后的pix2pix生成的合成SW图和原始pix2pix生成的合成SW图对比

    Figure 6.  Comparison between synthesis SW image generated by enhanced pix2pix and synthesis SW image generated by original pix2pix

    图 7  合成SW图和真实SW图

    Figure 7.  Synthesis SW image and real SW image

    图 8  条件图和合成SW图

    Figure 8.  Condition image and synthesis SW image

    图 9  扫频干扰模式的条件图和合成SW图

    Figure 9.  Condition image of sweeping jamming and corresponding synthesis SW image

    图 10  t时刻总体得分变化

    Figure 10.  Total reward variation of time t

    图 11  实验环境

    Figure 11.  Experimental environment

    图 12  在真实环境下的验证

    Figure 12.  Validation in real environment

    图 13  真实环境下本文方法和ADRLA的耗时对比

    Figure 13.  Comparison of time consumption between proposed method and ADRLA for real environment

    表  1  本文方法和ADRLA耗时对比

    Table  1.   Comparison of time consumption between proposed method and ADRLA

    计算机性能 参数 TOFFLINE_ADRLA/min TADRLA/min
    低性能(CPU) Toffline_init =0.5s,
    TCGAN=0.4s,
    Tonline_init=40s,
    Tsample=0.8s,
    TDQN=0.1s,
    k=1000
    8.3 15.7
    中等性能(1块GPU) Toffline_init=0.3s,
    TCGAN=0.2s,
    Tonline_init=40s,
    Tsample=0.8s,
    TDQN=0.05s,
    k=1000
    4.2 14.8
    高性能(4块GPU) Toffline_init=0.1s,
    TCGAN=0.1s,
    Tonline_init=40s,
    Tsample=0.8s,
    TDQN=0.01s,
    k=1000
    1.8 14.2
    下载: 导出CSV
  • [1] DUAN R J, JIA L L, GUO P C.Research on spectrum allocation of HF access network based on intelligent frequency hopping[C]//8th International Symposium on Computational Intelligence and Design(ISCID).Piscataway: IEEE Press, 2015, 2: 295-300.
    [2] ZHANG L Y, GUAN Z Y, MELODIA T.United against the enemy:Anti-jamming based on cross-layer cooperation in wireless networks[J].IEEE Transactions on Wireless Communications, 2016, 15(8):5733-5747. doi: 10.1109/TWC.2016.2569083
    [3] LIU X, XU Y H, JIA L L, et al.Anti-jamming communications using spectrum waterfall:A deep reinforcement learning approach[J].IEEE Communications Letters, 2018, 22(5):998-1001. doi: 10.1109/LCOMM.2018.2815018
    [4] LIU Y, XU Y H, CHENG Y P, et al.A heterogeneous information fusion deep reinforcement learning for intelligent frequency selection of HF communication[J].China Communications, 2018, 15(9):73-84. doi: 10.1109/CC.2018.8456453
    [5] RIYAZ S, SANKLE K, LOANNIDIS S, et al.Deep learning convolutional neural networks for radio identification[J].IEEE Communications Magazine, 2018, 56(9):146-152. doi: 10.1109/MCOM.2018.1800153
    [6] XIAO L, CHEN T H, LIU J L, et al.Anti-jamming transmission Stackelberg game with observation errors[J].IEEE Communications Letters, 2015, 19(6):949-952. doi: 10.1109/LCOMM.2015.2418776
    [7] ERPEK T, SAGDUYU Y E, ERPEK T, et al.Adversarial deep learning for cognitive radio security: Jamming attack and defense strategies[C]//IEEE International Conference on Communications Workshops (ICC Workshops).Piscataway: IEEE Press, 2018, 5: 1-6.
    [8] WUNSCH F, PAOSANA F, RAJENDRAN S, et al.DySPAN spectrum challenge:Situational awareness and opportunistic spectrum access benchmarked[J].IEEE Transactions on Cognitive Communications and Networking, 2017, 3(3):550-562. doi: 10.1109/TCCN.2017.2745682
    [9] MIRZA M, OSINDERO S.Conditional generative adversarial nets[J].Computer Science, 2014, 27(8):2672-2680. http://d.old.wanfangdata.com.cn/Periodical/jsjfzsjytxxxb202006007
    [10] MINH V, KAVUKCUOGLU K, SILVER D, et al.Playing atari with deep reinforcement learning[EB/OL].(2014-11-06)[2019-08-15].https://arxiv.org/abs/1411.1784.
    [11] MNIH V, KAVUKCUOGLU K, SILVER D, et al.Human-level control through deep reinforcement learning[J].Nature, 2015, 518(7540):529-533. doi: 10.1038/nature14236
    [12] WANG J F, LI X, YANG J, et al.Stacked conditional generative adversarial networks for jointly learning shadow detection and shadow removal[EB/OL].(2017-12-07)[2019-08-15].https://arxiv.org/abs/1712.02478.
    [13] ODENA A, OLAH C, SHLENS J.Conditional image synthesis with auxiliary classifier GANs[C]//Proceedings of the 34th International Conference on Machine Learning.Piscataway: IEEE Press, 2017, 70: 2642-2651.
    [14] ISOLA P, ZHU J Y, ZHOU T H, et al.Image-to-image translation with conditional adversarial networks[C]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Piscataway: IEEE Press, 2017: 5967-5976.
    [15] MARTIN A, LÉON B.Towards principled methods for training generative adversarial networks[EB/OL].(2017-01-07)[2019-08-15].https://arxiv.org/abs/1701.04862.
    [16] MARTIN A, CHINTALA S, LÉON B.Wasserstein GAN[EB/OL].(2017-12-06)[2019-08-15].https://arxiv.org/abs/1701.07875.
  • 加载中
图(13) / 表(1)
计量
  • 文章访问数:  617
  • HTML全文浏览量:  102
  • PDF下载量:  91
  • 被引次数: 0
出版历程
  • 收稿日期:  2019-08-16
  • 录用日期:  2020-01-18
  • 网络出版日期:  2020-07-20

目录

    /

    返回文章
    返回
    常见问答