留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于多源图像融合的无人机航拍舰船目标识别方法

姜杰 闫文君 刘凯 张立民

姜杰,闫文君,刘凯,等. 基于多源图像融合的无人机航拍舰船目标识别方法[J]. 北京航空航天大学学报,2026,52(6):1955-1964
引用本文: 姜杰,闫文君,刘凯,等. 基于多源图像融合的无人机航拍舰船目标识别方法[J]. 北京航空航天大学学报,2026,52(6):1955-1964
JIANG J,YAN W J,LIU K,et al. Ship target recognition method based on multi-source image fusion for unmanned aerial vehicle aerial photography[J]. Journal of Beijing University of Aeronautics and Astronautics,2026,52(6):1955-1964 (in Chinese)
Citation: JIANG J,YAN W J,LIU K,et al. Ship target recognition method based on multi-source image fusion for unmanned aerial vehicle aerial photography[J]. Journal of Beijing University of Aeronautics and Astronautics,2026,52(6):1955-1964 (in Chinese)

基于多源图像融合的无人机航拍舰船目标识别方法

doi: 10.13700/j.bh.1001-5965.2024.0289
基金项目: 

国家自然科学基金(62371465);山东省青创团队(2022kj084);山东省自然科学基金(ZR2020QF010)

详细信息
    通讯作者:

    E-mail:iamzlm@hotmail.com

  • 中图分类号: V391.4

Ship target recognition method based on multi-source image fusion for unmanned aerial vehicle aerial photography

Funds: 

National Natural Science Foundation of China (62371465); Shandong Province Youth Innovation Team(2022kj084); Shandong Provincial Natural Science Foundation(ZR2020QF010)

More Information
  • 摘要:

    针对无人机航拍采集到的多源舰船图像,提出一种多源舰船图像融合识别方法,面对现实场景中存在诸多干扰,采用像素级融合方式,通过对红外、可见光舰船图像进行融合,再进行目标识别,相比于特征级识别方法,既可以降低网络对样本的依赖,又可以增强方法的可解释性。所提方法聚焦于解决因传感器参数不同存在的像素偏移情况,为避免传统图像配准容易产生的畸变、伪影等问题,将图像配准转化为端到端下的特征对齐,提出一种多源舰船图像融合识别网络,网络共由交叉调制特征提取模块、特征动态对齐模块、多粒度特征细化模块和金字塔特征融合模块组成,可以充分融合不同模态图像的特征和纹理细节,有效提升对舰船目标的识别性能。经实验验证:所提方法对多源图像的融合效果好、可解释性强,对舰船目标的识别准确度高、鲁棒性强。

     

  • 图 1  MSIFR方法流程

    Figure 1.  MSIFR method flow

    图 2  网络模型

    Figure 2.  Network model

    图 3  CMFEM架构

    Figure 3.  Architecture of CMFEM

    图 4  DKG架构

    Figure 4.  Architecture of DKG

    图 5  CMF架构示意图

    Figure 5.  Architecture of CMF

    图 6  LFR架构

    Figure 6.  Architecture of LFR

    图 7  PFFM架构

    Figure 7.  Architecture of PFFM

    图 8  多源舰船数据集架构

    Figure 8.  Architecture of multi-source ship dataset

    图 9  混淆矩阵结果

    Figure 9.  Confusion matrix results

    图 10  可视化分析

    Figure 10.  Visual analysis

    表  1  舰船类别统计

    Table  1.   Ship category statistics

    类型 数目
    军舰 517
    帆船 7642
    邮轮 436
    散货船 678
    渔船 4251
    游艇 21578
    集装箱船 863
    下载: 导出CSV

    表  2  消融实验结果

    Table  2.   Ablation experimental results

    模型 MI $ Q_{{\mathrm{AB}}/{\mathrm{F}}} $ SSIM EN
    Baseline 1.242 0.356 1.052 5.945
    +CM 1.565 0.376 1.095 6.031
    +FDAM 1.605 0.403 1.241 6.156
    +CMF 1.689 0.467 1.264 6.347
    +PFFM 1.897 0.478 1.279 6.779
    +LFR 1.920 0.513 1.341 7.056
    下载: 导出CSV

    表  3  融合性能对比

    Table  3.   Comparison of fusion performance

    方法CEMI$ Q_{{\mathrm{AB}}/{\mathrm{F}}} $$ Q_{{\mathrm{CB}}} $SSIM$ Q_{{\mathrm{CV}}} $SDEN
    DDcGAN0.9511.0230.4110.4911.1081125.248.087.500
    FusionGAN2.0411.3440.2480.4331.1701149.123.786.892
    NestFuse0.9101.9160.4970.5271.304573.241.347.024
    PMG10.9781.2300.4260.4841.251610.632.136.745
    SDNet1.0760.9970.4340.5171.227902.425.306.868
    U2Fusion0.5911.0660.4460.5531.307632.722.566.456
    GTF0.7511.3540.3520.4371.1701561.725.466.347
    SwinFusion0.4721.8950.4810.5541.225553.138.136.779
    本文方法0.4541.9200.5130.5611.341516.641.387.056
    下载: 导出CSV

    表  4  识别性能对比

    Table  4.   Comparison of recognition performance

    图像 识别准确度/%
    VGG16 ResNet50 DenseNet201 Inception-
    v4
    SqueezeNet-v1.1 Xception AlexNet
    融合图像 0.923 0.931 0.932 0.930 0.927 0.925 0.928
    可见光
    图像
    0.865 0.870 0.871 0.867 0.864 0.862 0.868
    红外图像 0.848 0.853 0.844 0.846 0.841 0.837 0.839
    下载: 导出CSV
  • [1] 江波, 屈若锟, 李彦冬, 等. 基于深度学习的无人机航拍目标检测研究综述[J]. 航空学报, 2021, 42(4): 524519.

    JIANG B, QU R K, LI Y D, et al. Object detection in UAV imagery based on deep learning: review[J]. Acta Aeronautica et Astronautica Sinica, 2021, 42(4): 524519(in Chinese).
    [2] 苑玉彬, 吴一全, 赵朗月, 等. 基于深度学习的无人机航拍视频多目标检测与跟踪研究进展[J]. 航空学报, 2023, 44(18): 28334.

    YUAN Y B, WU Y Q, ZHAO L Y, et al. Research progress of UAV aerial video multi-object detection and tracking based on deep learning[J]. Acta Aeronautica et Astronautica Sinica, 2023, 44(18): 28334(in Chinese).
    [3] 何友, 熊伟, 刘俊, 等. 海上信息感知与融合研究进展及展望[J]. 火力与指挥控制, 2018, 43(6): 1-10.

    HE Y, XIONG W, LIU J, et al. Review and prospect of research on maritime information perception and fusion[J]. Fire Control & Command Control, 2018, 43(6): 1-10(in Chinese).
    [4] ZHANG P F, LI T R, WANG G Q, et al. Multi-source information fusion based on rough set theory: a review[J]. Information Fusion, 2021, 68: 85-117.
    [5] 何友, 王国宏, 关欣, 等. 信息融合理论及应用[M].北京: 电子工业出版社, 2010.

    HE Y, WANG G H, GUAN X, et al. Information fusion theory and applications [M]. Beijing: Publishing House of Electronics Industry, 2010.
    [6] QIU X H, LI M, ZHANG L Q, et al. Deep convolutional feature fusion model for multispectral maritime imagery ship recognition[J]. Journal of Computer and Communications, 2020, 8(11): 23-43.
    [7] LIU J M, CHEN H, WANG Y. Multi-source remote sensing image fusion for ship target detection and recognition[J]. Remote Sensing, 2021, 13(23): 4852.
    [8] TANG L F, YUAN J T, ZHANG H, et al. PIAFusion: a progressive infrared and visible image fusion network based on illumination aware[J]. Information Fusion, 2022, 83-84: 79-92.
    [9] LI Y S, WEI F Y, ZHANG Y J, et al. HS2P: hierarchical spectral and structure-preserving fusion network for multimodal remote sensing image cloud and shadow removal[J]. Information Fusion, 2023, 94: 215-228.
    [10] ZHANG Y, XIAO Q L, DENG X Y, et al. A multi-source information fusion method for ship target recognition based on Bayesian inference and evidence theory[J]. Journal of Intelligent & Fuzzy, 2022, 42(3): 2331-2346.
    [11] LI M J, DONG Y B, WANG X L. Pixel level image fusion based the wavelet transform[C]//Proceedings of the 6th International Congress on Image and Signal Processing. Piscataway: IEEE Press, 2014: 995-999.
    [12] HAN X Y, LV T, SONG X Y, et al. An adaptive two-scale image fusion of visible and infrared images[J]. IEEE Access, 2019, 7: 56341-56352.
    [13] YOU T T, TANG Y. Visual saliency detection based on adaptive fusion of color and texture features[C]//Proceedings of the 3rd IEEE International Conference on Computer and Communications. Piscataway: IEEE Press, 2018: 2034-2039.
    [14] 杨曦, 张鑫, 郭浩远, 等. 基于不变特征的多源遥感图像舰船目标检测算法[J]. 电子学报, 2022, 50(4): 887-899.

    YANG X, ZHANG X, GUO H Y, et al. Invariant features based ship detection model for multi-source remote sensing images[J]. Acta Electronica Sinica, 2022, 50(4): 887-899(in Chinese).
    [15] WANG A L, JIANG J N, ZHANG H Y. Multi-sensor image decision level fusion detection algorithm based on D-S evidence theory[C]//Proceedings of the Fourth International Conference on Instrumentation and Measurement, Computer, Communication and Control. Piscataway: IEEE Press, 2014: 620-623.
    [16] PAUL P P, GAVRILOVA M L, ALHAJJ R. Decision fusion for multimodal biometrics using social network analysis[J]. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2014, 44(11): 1522-1533.
    [17] ZHANG T W, ZHANG X L, LI J W, et al. SAR ship detection dataset (SSDD): official release and comprehensive data analysis[J]. Remote Sensing, 2021, 13(18): 3690.
    [18] REN Y M, YANG J, ZHANG Q N, et al. Ship recognition based on Hu invariant moments and convolutional neural network for video surveillance[J]. Multimedia Tools and Applications, 2021, 80(1): 1343-1373.
    [19] ER M J, ZHANG Y N, CHEN J, et al. Ship detection with deep learning: a survey[J]. Artificial Intelligence Review, 2023, 56(10): 11825-11865.
    [20] SALI S M, MANISHA N L, KING G, et al. A review on object detection algorithms for ship detection[C]//Proceedings of the 7th International Conference on Advanced Computing and Communication Systems. Piscataway: IEEE Press, 2021: 1-5.
    [21] MA J Y, JIANG J J, ZHOU H B, et al. Guided locality preserving feature matching for remote sensing image registration[J]. IEEE Transactions on Geoscience and Remote Sensing, 2018, 56(8): 4435-4447.
    [22] AYMERICH F, STASZEWSKI W J. Experimental study of impact-damage detection in composite laminates using a cross-modulation vibro-acoustic technique[J]. Structural Health Monitoring, 2010, 9(6): 541-553.
    [23] KLUS S, NÜSKE F, HAMZI B. Kernel-based approximation of the Koopman generator and Schrödinger operator[J]. Entropy, 2020, 22(7): 722.
    [24] QIN Z Q, ZHANG P Y, WU F, et al. FcaNet: frequency channel attention networks[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE Press, 2022: 763-772.
    [25] LI J X, GUO X B, LU G M, et al. DRPL: deep regression pair learning for multi-focus image fusion[J]. IEEE Transactions on Image Processing, 2020, 29: 4816-4831.
    [26] LI J Y, ZHA S, CHEN C, et al. Attention guided global enhancement and local refinement network for semantic segmentation[J]. IEEE Transactions on Image Processing, 2022, 31: 3211-3223.
    [27] JOHNSON J, ALAHI A, LI F F. Perceptual losses for real-time style transfer and super-resolution[C]//Proceedings of the Computer Vision – ECCV. Berlin: Springer, 2016: 694-711.
    [28] SONG H H, XU W J, LIU D, et al. Multi-stage feature fusion network for video super-resolution[J]. IEEE Transactions on Image Processing, 2021, 30: 2923-2934.
    [29] SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[EB/OL]. (2015-04-10)[2024-03-01]. https://arxiv.org/abs/1409.1556.
    [30] RUSSAKOVSKY O, DENG J, SU H, et al. ImageNet large scale visual recognition challenge[J]. International Journal of Computer Vision, 2015, 115(3): 211-252.
  • 加载中
图(10) / 表(4)
计量
  • 文章访问数:  21
  • HTML全文浏览量:  5
  • PDF下载量:  11
  • 被引次数: 0
出版历程
  • 收稿日期:  2024-05-07
  • 录用日期:  2024-06-21
  • 网络出版日期:  2024-08-15
  • 整期出版日期:  2026-06-30

目录

    /

    返回文章
    返回
    常见问答