留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于可见光图像引导和递归融合的红外图像超分辨率重建

张艳 孙明磊 刘紫阳 孙叶美 刘树东

张艳,孙明磊,刘紫阳,等. 基于可见光图像引导和递归融合的红外图像超分辨率重建[J]. 北京航空航天大学学报,2025,51(11):3662-3673 doi: 10.13700/j.bh.1001-5965.2023.0590
引用本文: 张艳,孙明磊,刘紫阳,等. 基于可见光图像引导和递归融合的红外图像超分辨率重建[J]. 北京航空航天大学学报,2025,51(11):3662-3673 doi: 10.13700/j.bh.1001-5965.2023.0590
ZHANG Y,SUN M L,LIU Z Y,et al. Infrared image super-resolution reconstruction based on visible image guidance and recursive fusion[J]. Journal of Beijing University of Aeronautics and Astronautics,2025,51(11):3662-3673 (in Chinese) doi: 10.13700/j.bh.1001-5965.2023.0590
Citation: ZHANG Y,SUN M L,LIU Z Y,et al. Infrared image super-resolution reconstruction based on visible image guidance and recursive fusion[J]. Journal of Beijing University of Aeronautics and Astronautics,2025,51(11):3662-3673 (in Chinese) doi: 10.13700/j.bh.1001-5965.2023.0590

基于可见光图像引导和递归融合的红外图像超分辨率重建

doi: 10.13700/j.bh.1001-5965.2023.0590
基金项目: 

天津市科技计划(22YDTPJC00840)

详细信息
    通讯作者:

    E-mail:sunyemei1216@163.com

  • 中图分类号: TP391.41

Infrared image super-resolution reconstruction based on visible image guidance and recursive fusion

Funds: 

Tianjin Scientific and Technological Projects (22YDTPJC00840)

More Information
  • 摘要:

    由于硬件设备的局限,红外图像在获取时普遍存在分辨率低、细节模糊等问题。尽管可见光图像能为红外图像的超分辨率重建提供有效指导,但因两者成像原理不同,图像细节信息的差异易导致重建出现模糊和鬼影等问题。基于此,提出一种基于可见光图像引导和递归融合的红外图像超分辨率重建网络。在该网络中,设计流动傅里叶残差模块,利用不同深度的模块提取可见光图像和红外图像中不同频率的信息,使每个模块关注适当的频率信息。同时,利用混合注意力模块,从通道和空间角度获取多模态图像中的细节信息,并以互补的方式进行融合,有助于消除伪影的生成。在此基础上,设计全局递归融合分支,考虑多层特征之间的相关性,自适应地融合多层特征,生成更加清晰的高分辨率红外图像。实验结果表明:所提方法与Deep-ISTA、PAG-SR等其他方法相比,在客观评价指标上表现出了较好的水平,在主观视觉比较方面,重建图像具有更清晰的纹理和更少的伪影,复杂环境中具有更佳的物体区分度。

     

  • 图 1  不同图像的二维离散傅里叶变换频谱图及幅度相减绝对值

    Figure 1.  2D discrete Fourier transform spectra of different images and absolute values of magnitude subtraction

    图 2  VGRFSR整体网络结构

    Figure 2.  Overall network structure of VGRFSR

    图 3  流动傅里叶残差模块结构

    Figure 3.  FFRM structure

    图 4  傅里叶残差卷积结构

    Figure 4.  Fourier residual convolution structure

    图 5  混合注意力模块结构

    Figure 5.  MAM structure

    图 6  全局融合模块结构

    Figure 6.  GFM structure

    图 7  各模型的视觉效果对比

    Figure 7.  Visual comparison of different models

    图 8  本文方法与其他方法的视觉效果对比(尺度因子为4)

    Figure 8.  Visual comparison between the proposed algorithm and other methods (scale factor is 4)

    图 9  本文方法与其他方法的视觉效果对比(尺度因子为8)

    Figure 9.  Visual comparison between the proposed algorithm and other methods (scale factor is 8)

    图 10  红外图像对应二维离散傅里叶变换的大小及有关幅度值差距

    Figure 10.  2D discrete Fourier transform magnitudes of infrared images and absolute values of magnitude subtraction

    表  1  消融实验结果

    Table  1.   Ablation experiments results

    方法 FFRM MAM GFM PSNR/dB SSIM LPIPS
    Model 1 × × × 30.916 0.918 0.189
    Model 2 × × 31.032 0.921 0.189
    Model 3 × 31.039 0.920 0.189
    Model 4 × 31.038 0.920 0.194
    Model 5 31.048 0.921 0.189
    下载: 导出CSV

    表  2  本文方法在不同损失函数下的定量对比

    Table  2.   Quantitative comparison of the proposed algorithm with different loss functions

    损失函数 PSNR/dB SSIM LPIPS
    $ {L_{\rm{total}}} = {L_1} $ 30.58 0.919 0.268
    $ {L_{\rm{total}}} = {L_1} + {L_{\rm{FR}}} $ 30.66 0.920 0.283
    下载: 导出CSV

    表  3  不同方法的客观评价指标对比

    Table  3.   Comparison of objective evaluation metrics with different algorithms

    方法是否引导PSNR/dBSSIMLPIPS
    尺度因子为4尺度因子为8尺度因子为4尺度因子为8尺度因子为4尺度因子为8
    RDN[14]非引导29.2826.800.9060.8330.2820.389
    RCAN[33]非引导29.1822.350.9080.7580.2280.414
    SAN[34]非引导26.4725.380.8590.8110.2290.536
    TGV2-L2[35]引导28.7726.420.8920.8210.4220.399
    FBS[36]引导25.4825.030.7870.7700.3870.476
    Joint-BU[37]引导27.7725.610.8740.8030.2840.406
    Infrared SR[23]引导28.2126.030.8890.8170.4050.521
    SDF[38]引导28.7026.720.8750.8190.3210.363
    MSF-SR[13]引导29.2127.920.9010.8350.2000.249
    MSG-Net[39]引导29.4627.290.8970.8270.1840.296
    PixTransform[40]引导24.8423.310.7870.8360.3290.371
    Deep-ISTA[41]引导25.8625.560.8280.7780.5290.598
    PAG-SR[24]引导29.5628.770.9120.9190.1470.214
    本文引导31.0530.660.9210.9200.1890.283
    下载: 导出CSV
  • [1] GOLDBERG A C, FISCHER T, DERZKO Z I. Application of dual-band infrared focal plane arrays to tactical and strategic military problems[C]//Proceedings of the Infrared Technology and Applications XXVIII. Bellingham: SPIE, 2003.
    [2] 熊光明, 罗震, 孙冬, 等. 基于红外相机和毫米波雷达融合的烟雾遮挡目标检测与跟踪技术[J]. 兵工学报, 2024, 45(3): 893-906.

    XIONG G M, LUO Z, SUN D, et al. Detection and tracking technology of smoke occlusion target based on infrared camera and millimeter wave radar fusion[J]. Acta Armamentarii, 2024, 45(3): 893-906(in Chinese).
    [3] QI H, DIAKIDES N A. Thermal infrared imaging in early breast cancer detection-a survey of recent research[C]//Proceedings of the 25th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. Piscataway: IEEE Press, 2004: 1109-1112.
    [4] 张倩, 杨颖, 刘刚, 等. 融合数据增强与改进ResNet34的奶牛热红外图像乳腺炎检测[J]. 光谱学与光谱分析, 2023, 43(1): 280-288. doi: 10.3964/j.issn.1000-0593(2023)01-0280-09

    ZHANG Q, YANG Y, LIU G, et al. Detection of dairy cow mastitis from thermal images by data enhancement and improved ResNet34[J]. Spectroscopy and Spectral Analysis, 2023, 43(1): 280-288(in Chinese). doi: 10.3964/j.issn.1000-0593(2023)01-0280-09
    [5] 侯义锋, 丁畅, 刘海, 等. 逆光海况下低质量红外目标的增强与识别[J]. 光学学报, 2023, 43(6): 0612003. doi: 10.3788/AOS221387

    HOU Y F, DING C, LIU H, et al. Enhancement and recognition of infrared target with low quality under backlight maritime condition[J]. Acta Optica Sinica, 2023, 43(6): 0612003(in Chinese). doi: 10.3788/AOS221387
    [6] 翁静, 袁盼, 王铭赫, 等. 基于支持向量机的泄漏气体云团热成像检测方法[J]. 光学学报, 2022, 42(9): 0911002. doi: 10.3788/AOS202242.0911002

    WENG J, YUAN P, WANG M H, et al. Thermal imaging detection method of leak gas clouds based on support vector machine[J]. Acta Optica Sinica, 2022, 42(9): 0911002(in Chinese). doi: 10.3788/AOS202242.0911002
    [7] XIONG K N, JIANG J B, PAN Y Y, et al. Deep learning approach for detection of underground natural gas micro-leakage using infrared thermal images[J]. Sensors, 2022, 22(14): 5322. doi: 10.3390/s22145322
    [8] ZHANG L, WU X L. An edge-guided image interpolation algorithm via directional filtering and data fusion[J]. IEEE Transactions on Image Processing, 2006, 15(8): 2226-2238. doi: 10.1109/TIP.2006.877407
    [9] PAPYAN V, ELAD M. Multi-scale patch-based image restoration[J]. IEEE Transactions on Image Processing, 2016, 25(1): 249-261. doi: 10.1109/TIP.2015.2499698
    [10] ZHANG K, VAN GOOL L, TIMOFTE R. Deep unfolding network for image super-resolution[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2020: 3214-3223.
    [11] SUN Y M, ZHANG Y, LIU S D, et al. Image super-resolution using supervised multi-scale feature extraction network[J]. Multimedia Tools and Applications, 2021, 80(2): 1995-2008. doi: 10.1007/s11042-020-09488-z
    [12] ZHANG X D, ZENG H, ZHANG L. Edge-oriented convolution block for real-time super resolution on mobile devices[C]//Proceedings of the 29th ACM International Conference on Multimedia. New York: ACM, 2021: 4034-4043.
    [13] ALMASRI F, DEBEIR O. Multimodal sensor fusion in single thermal image super-resolution[C]//Proceedings of the 14th Asian Conference on Computer Vision. Berlin: Springer, 2019: 418-433.
    [14] ZHANG Y L, TIAN Y P, KONG Y, et al. Residual dense network for image super-resolution[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2018: 2472-2481.
    [15] ZHANG Y, LIU Z Y, LIU S D, et al. Frequency aggregation network for blind super-resolution based on degradation representation[J]. Digital Signal Processing, 2023, 133: 103837. doi: 10.1016/j.dsp.2022.103837
    [16] KIM J, LEE J K, LEE K M. Deeply-recursive convolutional network for image super-resolution[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2016: 1637-1645.
    [17] LIM B, SON S, KIM H, et al. Enhanced deep residual networks for single image super-resolution[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops. Piscataway: IEEE Press, 2017: 1132-1140.
    [18] ZHANG Y, XU F, SUN Y, et al. Spatial and frequency information fusion transformer for image super-resolution[J]. Neural Networks, 2025, 187: 107351.
    [19] QIU Y J, WANG R X, TAO D P, et al. Embedded block residual network: a recursive restoration model for single-image super-resolution[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE Press, 2019: 4179-4188.
    [20] CHOI Y, KIM N, HWANG S, et al. Thermal image enhancement using convolutional neural network[C]//Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems. Piscataway:IEEE Press, 2016: 223-230.
    [21] ALMASRI F, DEBEIR O. Multimodal sensor fusion in single thermal image super-resolution[C]//Proceedings of the Asian Conference on Computer Vision. Berlin: Springer, 2018: 418-433.
    [22] LEE K, LEE J, LEE J, et al. Brightness-based convolutional neural network for thermal image enhancement[J]. IEEE Access, 2017, 5: 26867-26879. doi: 10.1109/ACCESS.2017.2769687
    [23] HAN T Y, KIM Y J, SONG B C. Convolutional neural network-based infrared image super resolution under low light environment[C]//Proceedings of the 25th European Signal Processing Conference. Piscataway: IEEE Press, 2017: 803-807.
    [24] GUPTA H, MITRA K. Pyramidal edge-maps and attention based guided thermal super-resolution[C]//Proceedings of the European Conference on Computer Vision. Berlin: Springer, 2020: 698-715.
    [25] MAO X, LIU Y, SHEN W, et al. Deep residual fourier transformation for single image deblurring[EB/OL]. (2022-11-29)[2023-09-01]. https://arxiv.org/abs/2111.11745.
    [26] BRIGHAM E O, MORROW R E. The fast Fourier transform[J]. IEEE Spectrum, 1967, 4(12): 63-70. doi: 10.1109/MSPEC.1967.5217220
    [27] CAO J M, LI Y Y, SUN M C, et al. DO-Conv: depthwise over-parameterized convolutional layer[J]. IEEE Transactions on Image Processing, 2022, 31: 3726-3736. doi: 10.1109/TIP.2022.3175432
    [28] JI S W, XU W, YANG M, et al. 3D convolutional neural networks for human action recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, 35(1): 221-231. doi: 10.1109/TPAMI.2012.59
    [29] NIU B, WEN W, REN W, et al. Single image super-resolution via a holistic attention network[C]//Proceedings of the 16th European Conference on Computer Vision. Berlin: Springer, 2020: 191-207.
    [30] ZHANG R, ISOLA P, EFROS A A, et al. The unreasonable effectiveness of deep features as a perceptual metric[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2018: 586-595.
    [31] LEWIS J. FLIR releases machine learning thermal dataset for advanced driver assistance systems[J]. Vision Systems Design, 2018, 23(9): 1-10.
    [32] ZHANG K, ZUO W M, ZHANG L. Learning a single convolutional super-resolution network for multiple degradations[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2018: 3262-3271.
    [33] ZHANG Y L, LI K P, LI K, et al. Image super-resolution using very deep residual channel attention networks[C]//Proceedings of the European Conference on Computer Vision. Berlin: Springer, 2018: 294-310.
    [34] DAI T, CAI J, ZHANG Y, et al. Second-order attention network for single image super-resolution[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2019: 11065-11074.
    [35] FERSTL D, REINBACHER C, RANFTL R, et al. Image guided depth upsampling using anisotropic total generalized variation[C]//Proceedings of the IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2013: 993-1000.
    [36] BARRON J T, POOLE B. The fast bilateral solver[C]//Proceedings of the 14th European Conference on Computer Vision. Berlin: Springer, 2016: 617-632.
    [37] KOPF J, COHEN M F, LISCHINSKI D, et al. Joint bilateral upsampling[J]. ACM Transactions on Graphics, 2007, 26(99): 96. doi: 10.1145/1276377.1276497
    [38] HAM B, CHO M, PONCE J. Robust image filtering using joint static and dynamic guidance[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2015: 4823-4831.
    [39] HUI T W, LOY C C, TANG X O. Depth map super-resolution by deep multi-scale guidance[C]//Proceedings of the 14th European Conference on Computer Vision. Berlin: Springer, 2016: 353-369.
    [40] DE LUTIO R, D’ARONCO S, WEGNER J D, et al. Guided super-resolution as pixel-to-pixel transformation[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Piscataway: IEEE Press, 2019: 8828-8836.
    [41] DENG X, DRAGOTTI P L. Deep coupled ISTA network for multi-modal image super-resolution[J]. IEEE Transactions on Image Processing, 2019, 29: 1683-1698.
  • 加载中
图(10) / 表(3)
计量
  • 文章访问数:  45
  • HTML全文浏览量:  11
  • PDF下载量:  5
  • 被引次数: 0
出版历程
  • 收稿日期:  2023-09-15
  • 录用日期:  2024-01-09
  • 网络出版日期:  2024-02-06
  • 整期出版日期:  2025-11-25

目录

    /

    返回文章
    返回
    常见问答