留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

显著性引导的低光照人脸检测

李可夫 钟汇才 高兴宇 翁超群 陈振宇 李勇周 王师峥

李可夫, 钟汇才, 高兴宇, 等 . 显著性引导的低光照人脸检测[J]. 北京航空航天大学学报, 2021, 47(3): 572-584. doi: 10.13700/j.bh.1001-5965.2020.0469
引用本文: 李可夫, 钟汇才, 高兴宇, 等 . 显著性引导的低光照人脸检测[J]. 北京航空航天大学学报, 2021, 47(3): 572-584. doi: 10.13700/j.bh.1001-5965.2020.0469
LI Kefu, ZHONG Huicai, GAO Xingyu, et al. Saliency guided low-light face detection[J]. Journal of Beijing University of Aeronautics and Astronautics, 2021, 47(3): 572-584. doi: 10.13700/j.bh.1001-5965.2020.0469(in Chinese)
Citation: LI Kefu, ZHONG Huicai, GAO Xingyu, et al. Saliency guided low-light face detection[J]. Journal of Beijing University of Aeronautics and Astronautics, 2021, 47(3): 572-584. doi: 10.13700/j.bh.1001-5965.2020.0469(in Chinese)

显著性引导的低光照人脸检测

doi: 10.13700/j.bh.1001-5965.2020.0469
基金项目: 

国家自然科学基金 61702491

国家自然科学基金 61802390

北京市自然科学基金 4194095

茂名市科技计划 2020028

详细信息
    作者简介:

    李可夫  男,硕士研究生。主要研究方向:人工智能、计算机视觉

    钟汇才  男,博士,研究员。主要研究方向:人工智能

    高兴宇  男,博士,研究员。主要研究方向:人工智能

    通讯作者:

    高兴宇, E-mail: gaoxingyu@ime.ac.cn

  • 中图分类号: TP391

Saliency guided low-light face detection

Funds: 

National Natural Science Foundation of China 61702491

National Natural Science Foundation of China 61802390

Beijing Natural Science Foundation 4194095

Maoming Science and Technology Plan 2020028

More Information
  • 摘要:

    针对卷积神经网络难以对低光照环境拍摄的图像进行人脸检测的问题。提出了一种将图像显著性检测算法和深度学习相结合的算法,并应用于低光照人脸检测。所提算法将图像的显著性信息与图像原始RGB通道融合,用于神经网络训练。在低光照人脸数据集DARK FACE上进行了充分的实验,结果表明:所提方法在DARK FACE数据集上获得了比当前主流人脸检测算法更好的检测精度,进而验证了所提算法的有效性。

     

  • 图 1  Retinex原理示意图

    Figure 1.  Schematic of Retinex principle

    图 2  RetinaNet网络结构

    Figure 2.  Network structure of RetinaNet

    图 3  WIDER FACE数据集图像示例

    Figure 3.  Image examples of WIDER FACE dataset

    图 4  DARK FACE数据集原图

    Figure 4.  Original images of DARK FACE dataset

    图 5  采用MSRCR算法的增强图像

    Figure 5.  Images enhanced using MSRCR algorithm

    图 6  DARK FACE数据集图像的显著图

    Figure 6.  Saliency maps of DARK FACE dataset images

    图 7  显著性引导增强的DARK FACE图像(r=0.1)

    Figure 7.  DARK FACE images enhanced using saliency guidance (r=0.1)

    图 8  显著性引导增强的DARK FACE图像(r=0.2)

    Figure 8.  DARK FACE images enhanced using saliency guidance (r=0.2)

    图 9  原图融合显著图四通道训练示意图

    Figure 9.  Schematic of training using 4 channel input fused by original images and saliency map

    图 10  查准率-召回率曲线

    Figure 10.  Precision-recall curves

    图 11  检测结果可视化

    Figure 11.  Visualization of detection results

    表  1  DARK FACE数据集发布的检测精度(使用图像增强算法)

    Table  1.   Detection accuracies published from DARK FACE dataset (with image enhancement)

    检测算法 精度
    DSFD+MF 0.414
    DSFD+MSRCR 0.413
    DSFD+LIME 0.403
    DSFD+BIMEF 0.402
    DSFD+Dehazing 0.365
    DSFD+RetinexNet 0.332
    DSFD+JED 0.179
    PyramidBox+MF 0.263
    PyramidBox+Dehazing 0.249
    PyramidBox+LIME 0.248
    PyramidBox+MSRCR 0.246
    PyramidBox+BIMEF 0.245
    PyramidBox+RetinexNet 0.207
    PyramidBox+JED 0.146
    下载: 导出CSV

    表  2  DARK FACE数据集发布的检测精度(未使用图像增强算法)

    Table  2.   Detection accuracies published from DARK FACE dataset (without image enhancement)

    检测算法 精度
    DSFD 0.153
    Faster R-CNN 0.017
    PyramidBox 0.132
    SSH 0.076
    下载: 导出CSV

    表  3  CVPR 2019 UG2+国际竞赛发布的基准检测精度(使用图像增强算法)

    Table  3.   Benchmark accuracies published from CVPR 2019 UG2+ competition (with image enhancement)

    检测算法 精度
    DSFD+MF 0.393
    DSFD+MSRCR 0.393
    DSFD+BIMEF 0.383
    DSFD+LIME 0.383
    DSFD+Dehazing 0.348
    DSFD+RetinexNet 0.316
    DSFD+JED 0.170
    PyramidBox+MF 0.251
    PyramidBox+Dehazing 0.237
    PyramidBox+LIME 0.237
    PyramidBox+MSRCR 0.235
    PyramidBox+BIMEF 0.234
    PyramidBox+RetinexNet 0.199
    PyramidBox+JED 0.138
    下载: 导出CSV

    表  4  CVPR 2019 UG2+国际竞赛发布的基准检测精度(未使用图像增强算法)

    Table  4.   Benchmark accuracies published from CVPR 2019 UG2+ competition (without image enhancement)

    检测算法 精度
    DSFD 0.136
    Faster R-CNN 0.125
    PyramidBox 0.069
    SSH 0.017
    下载: 导出CSV

    表  5  本文实验模型检测精度

    Table  5.   Detection accuracy of experimental models from this paper

    训练方法 精度
    DARK FACE数据集原图训练 0.504
    MSRCR增强DARK FACE训练 0.522
    显著性引导增强DARK FACE训练(r=0.1) 0.540
    显著性引导增强DARK FACE训练(r=0.2) 0.533
    DARK FACE原图融合显著图四通道训练 0.560
    下载: 导出CSV
  • [1] GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Reocgnition. Piscataway: IEEE Press, 2014: 580-587.
    [2] GIRSHICK R. Fast RCNN[C]//Proceedings of the 2015 IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2015: 10-15.
    [3] REN S, HE K, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1147. doi: 10.1109/TPAMI.2016.2577031
    [4] REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: Unifified, real-time object detection[C]. In Proceedings of the IEEE Computer Vision & Pattern Recognition. Piscataway: IEEE Press, 2016: 779-788.
    [5] LI J, WANG Y, WANG C, et al. DSFD: Dual shot face detector[C]//Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2020: 5060-5069.
    [6] TANG X, DU D K, HE Z Q, et al. PyramidBox: A context-assisted single shot face detector[EB/OL]. (2018-08-17)[2020-09-12]. https: arxiv.orglabs/1803.07737 context=cs.
    [7] JAIN V, LEARNED-MILLER E G. FDDB: A benchmark for face detection in unconstrained settings[EB/OL]. [2020-09-12]. http://works.bepress.com/erik_learned_miller/55/.
    [8] YANG S, LUO P, LOY C C, et al. WIDER FACE: A face detection benchmark[C]//Proceedings of the 2016 IEEE Conference on Computer Version and Pattern Recognition. Piscataway: IEEE Press, 2016: 5525-5533.
    [9] YE Y, YANG W H, REN W Q, et al. UG2+ Track 2: A collective benchmark effort for evaluating and advancing[EB/OL]. [2020-03-31]. http://arxiv.org/abs/1904.04474v4.
    [10] JOBSON D J, RAHMAN Z, WOODELL G. A multiscale retinex for bridging the gap between color images and the human observation of scenes[J]. IEEE Transactions on Image Processing, 1997, 6(7): 965-976. doi: 10.1109/83.597272
    [11] LIU W, ANGUELOV D, ERHAN D, et al. SSD: Single shot multibox detector[C]//European Conference on Computer Version. Berlin: Springer, 2016: 21-37.
    [12] LI Z H, TANG X, HAN J Y, et al. PyramidBox++: High performance detector for finding tiny face[EB/OL]. [2020-09-12]. https://arxiv.org/abs/1904.00386.
    [13] LIU S, HUANG D, WANG Y H. Receptive field block net for accurate and fast object detection[C]//European Conference on Computer Vision. Berlin: Springer, 2018: 404-419.
    [14] LAND E H. The Retinex theory of color vision[J]. Scientific American, 1977, 237(6): 108. doi: 10.1038/scientificamerican1277-108
    [15] JOBSON D J, RAHMAN Z, WOODELL G A. Properties and performance of a center/surround Retinex[J]. IEEE Transactions on Image Processing, 1997, 6(3): 451-462. doi: 10.1109/83.557356
    [16] RAHMAN Z U, JOBSON D J, WOODELL G A. Multi-scale Retinex for color image enhancement[C]//Proceedings of 3rd International Conference on Image Processing. Piscataway: IEEE Press, 1996: 1003-1006.
    [17] GUO X J, LI Y, LING H B. LIME: Low-light image enhancement via illumination map estimation[J]. IEEE Transactions on Image Processing, 2017, 26(2): 982-993. doi: 10.1109/TIP.2016.2639450
    [18] YUN Z, MUBARAK S. Visual attention detection in video sequences using spatiotemporal cues[C]//Proceedings of the 14th ACM International Conference on Multimedia. New York: ACM Press, 2006: 815-824.
    [19] CHENG M M, ZHANG G X, MITRA N J. Global contrast based salient region detection[C]//Proceedings of the 2011 IEEE Conference on omputer Vision and Pattern Recognition. Piscataway: IEEE Press, 2011: 569-582.
    [20] ACHANTA R, ESTRADA F, WILS P, et al. Salient region detection and segmentation[C]//Proceedings of the 6th International Conference on Computer Version Systems. Berlin: Springer, 2008.
    [21] 张冬明, 靳国庆, 代锋, 等. 基于深度融合的显著性目标检测算法[J]. 计算机学报, 2019, 42(9): 2076-2086. https://www.cnki.com.cn/Article/CJFDTOTAL-JSJX201909012.htm

    ZHANG D M, JIN G Q, DAI F, et al. Salient object detection based on deep fusion of hand-crafted features[J]. Chinese Journal of Computers, 2019, 42(9): 2076-2086(in Chinese). https://www.cnki.com.cn/Article/CJFDTOTAL-JSJX201909012.htm
    [22] JIAN M W, LAM K M, DONG J Y, et al. Visual-patch-attention-aware saliency detection[J]. IEEE Transactions on Cybernetics, 2015, 45(8): 1575-1586. doi: 10.1109/TCYB.2014.2356200
    [23] TSUNG-YI L, PRIYA G, ROSS B G, et al. Focal loss for dense object detection[C]//Proceedings of the 2017 IEEE International Conference on Computer Version. Piscataway: IEEE Press, 2017: 2999-3007.
    [24] HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]//Proceedings of the 2016 IEEE Conference on Computer Version and Pattern Recognition. Piscataway: IEEE Press, 2016: 770-778.
    [25] LIN T Y, DOLLA P, CIRSHICK R, et al. Feature pyramid networks for object detection[C]//Proceedings of the 2017 IEEE Conference on Computer Version and Pattern Recognition. Piscataway: IEEE Press, 2017: 936-944.
    [26] LIN T Y, MAIRE M, BELONGIE S, et al. Microsoft COCO: Common objects in context[C]//European Conference on Computer Version. Berlin: Springer, 2014: 740-755.
    [27] REDMON J, FARHADI A. YOLO9000: Better, faster, stronger[C]//Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2017: 6517-6525.
    [28] FU C Y, LIU W A, RANGA A. et al. DSSD: Deconvolutional single shot detector[EB/OL]. [2019-09-12]. http://arxiv.org/abs/170.06659.
    [29] FAEN Z, FAN X Y, AI G, et al. Accurate face detection for high performance[EB/OL]. (2019-05-24)[2020-09-12]. http://arxiv.org/abs/1905.01585.
    [30] HOU X D, HAREL J, KOCH C. Image signature: Highlighting sparse salient regions[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(1): 194-201. doi: 10.1109/TPAMI.2011.146
    [31] NAJIBI M, SAMANGOUEI P, CHELLAPPA R, et al. SSH: Single stage headless face detector[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2017: 4875-4884.
    [32] YING Z Q, LI G, GAO W. A bio-inspired multi-exposure fusion framework for low-light image enhancement[EB/OL]. [2020-09-12]. https://arxiv.org/abs/1711.00591.
    [33] DONG X, WANG G, PANG Y, et al. Fast efficient algorithm for enhancement of low lighting video[C]//Proceedings of the 2011 IEEE International Conference on Multimedia and Expo. Piscataway: IEEE Press, 2011: 1-6.
    [34] FU X Y, ZENG D, HUANG Y H, et al. A fusion-based enhancing method for weakly illuminated images[J]. Signal Processing, 2016, 129: 82-96. doi: 10.1016/j.sigpro.2016.05.031
    [35] REN X D, LI M D, CHENG W H, et al. Joint enhancement and denoising method via sequential decomposition[C]//2018 IEEE International Symposium on Circuits and Systems (ISCAS). Piscataway: IEEE Press, 2018: 1-5.
    [36] WEI C, WANG W, YANG W, et al. Deep Retinex decomposition for low-light enhancement[C]//Proceedings of British Machine Vision Conference, 2018: 155.
    [37] YANG W H, YUAN Y, REN W Q, et al. Advancing image understanding in poor visibility environments: A collective benchmark study[J]. IEEE Transanctions on Processing, 2020, 29: 5737-5752.
  • 加载中
图(11) / 表(5)
计量
  • 文章访问数:  808
  • HTML全文浏览量:  128
  • PDF下载量:  104
  • 被引次数: 0
出版历程
  • 收稿日期:  2020-08-27
  • 录用日期:  2020-10-21
  • 网络出版日期:  2021-03-20

目录

    /

    返回文章
    返回
    常见问答