Target detection method for region proposal network with variable anchor box

LI Chenghao; RU Le; HE Linyuan; CHI Wensheng

doi:10.13700/j.bh.1001-5965.2019.0531

Volume 46 Issue 8

Aug. 2020

Turn off MathJax

Article Contents

Journal of Beijing University of Aeronautics and Astronautics > 2020 > 46(8): 1610-1617.

LI Chenghao, RU Le, HE Linyuan, et al. Target detection method for region proposal network with variable anchor box[J]. Journal of Beijing University of Aeronautics and Astronautics, 2020, 46(8): 1610-1617. doi: 10.13700/j.bh.1001-5965.2019.0531(in Chinese)

Citation:

LI Chenghao, RU Le, HE Linyuan, et al. Target detection method for region proposal network with variable anchor box[J]. Journal of Beijing University of Aeronautics and Astronautics, 2020, 46(8): 1610-1617. doi: 10.13700/j.bh.1001-5965.2019.0531(in Chinese)

Citation:

PDF( 3612 KB)

Target detection method for region proposal network with variable anchor box

doi: 10.13700/j.bh.1001-5965.2019.0531

LI Chenghao¹,
RU Le^{2
,
,},
HE Linyuan¹,
CHI Wensheng^{2, 3}

1.
Aeronautics Engineering Institute, Air Force Engineering University, Xi'an 710038, China
2.
Equipment Management and Unmanned Aerial Vehicle Engineering College, Air Force Engineering University, Xi'an 710038, China
3.
School of Mechano-Electronic Engineering, Xidian University, Xi'an 710126, China

Funds:

National Natural Science Foundation of China 61701524

China Postdoctoral Science Foundation 2019M653742

Natural Science Basic Research Program of Shaanxi 2017JM6071

More Information

Corresponding author: RU Le. E-mail:ru-le@163.com
Received Date: 29 Sep 2019
Accepted Date: 01 Nov 2019
Publish Date: 20 Aug 2020

Abstract

Abstract

Object detection is a hot topic in the field of computer vision. At present, object detection methods based on deep learning can be divided into two categories: two-stage detection and one-stage detection. The former has higher accuracy, while the latter has better speed. In order to improve the performance of detection, anchor mechanism is introduced in both categories. In this paper, Attention Guidance (AG) module is introduced in the two-stage detection method based on the deep convolutional neural network, which guides the anchor mechanism of Region Proposal Network (RPN), making the selection of preselected box shape more diversified. At the same time, to solve the problem of false detection and missed detection in the traditional post-processing Non-Maximum Suppression (NMS) algorithm, a Confidence factor NMS (Cf-NMS) method is proposed, which makes a great contribution to the overall performance of the model. Experiment results showed that, although it has a slight decrease in speed performance, the proposed method has an improvement in accuracy in both the RPN variant and the existing advanced method.
- computer vision,
- object detection,
- deep learning,
- Region Proposal Network (RPN),
- Non-Maximum Suppression (NMS)

FullText(HTML)

References(19)

References

[1]	方路平, 何杭江, 周国民.目标检测算法研究综述[J].计算机工程与应用, 2018, 54(13):11-18. doi: 10.3778/j.issn.1002-8331.1804-0167 FANG L P, HE H J, ZHOU G M.Research overview of object detection methods[J].Computer Engineering and Applications, 2018, 54(13):11-18(in Chinese). doi: 10.3778/j.issn.1002-8331.1804-0167
[2]	刘栋, 李素, 曹志冬.深度学习及其在图像物体分类与检测中的应用综述[J].计算机科学, 2016, 43(12):13-23. doi: 10.11896/j.issn.1002-137X.2016.12.003 LIU D, LI S, CAO Z D.State-of-the-art on deep learning and its application in image object classification and detection[J].Computer Science, 2016, 43(12):13-23(in Chinese). doi: 10.11896/j.issn.1002-137X.2016.12.003
[3]	REDMON J, DIVVALA S, GIRSHICK R, et al.You only look once: Unified, real-time object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Piscataway: IEEE Press, 2016: 779-788.
[4]	LIU W, ANGUELOV D, ERHAN D, et al.SSD: Single shot multibox detector[C]//European Conference on Computer Vision.Berlin: Springer, 2016: 21-37.
[5]	CHEN C, LIU M Y, TUZEL O, et al.R-CNN for small object detection[C]//Asian Conference on Computer Vision.Berlin: Springer, 2016: 214-230
[6]	GIRSHICK R.Fast R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision.Piscataway: IEEE Press, 2015: 1440-1448.
[7]	REN S, HE K, GIRSHICK R, et al.Faster R-CNN:Towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031
[8]	CAI Z, VASCONCELOS N.Cascade R-CNN: Delving into high quality object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Piscataway: IEEE Press, 2018: 6154-6162.
[9]	HE K, GKIOXARI G, DOLLÁR P, et al.Mask R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision.Piscataway: IEEE Press, 2017: 2961-2969.
[10]	LIN T Y, DOLLÁR P, GIRSHICK R, et al.Feature pyramid networks for object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Piscataway: IEEE Press, 2017: 2117-2125.
[11]	YAN Q, GONG D, SHI Q, et al.Attention-guided network for ghost-free high dynamic range imaging[EB/OL].(2019-04-23)[2019-09-20].https://arxiv.org/abs/1904.10293.
[12]	VELIČKOVIĆ P, CUCURULL G, CASANOVA A, et al.Graph attention networks[EB/OL].(2018-02-04)[2019-09-20].https://arxiv.org/abs/1710.10903.
[13]	GUAN Q, HUANG Y, ZHONG Z, et al.Diagnose like a radiologist: Attention guided convolutional neural network for thorax disease classification[EB/OL].(2018-01-30)[2019-09-20].https://arxiv.org/abs/1801.09927.
[14]	LIN T Y, MAIRE M, BELONGIE S, et al.Microsoft COCO: Common objects in context[C]//European Conference on Computer Vision.Berlin: Springer, 2014: 740-755.
[15]	DALAL N, TRIGGS B.Histograms of oriented gradients for human detection[C]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Piscataway: IEEE Press, 2005: 886-893.
[16]	LOWE D G.Distinctive image features from scale-invariant keypoints[J].International Journal of Computer Vision, 2004, 60(2):91-110. doi: 10.1023/B:VISI.0000029664.99615.94
[17]	张强, 张陈斌, 陈宗海.一种改进约束条件的简化非极大值抑制[J].中国科学技术大学学报, 2016, 46(1):6-11. http://www.cqvip.com/QK/94257X/201601/669806218.html ZHANG Q, ZHANG C B, CHEN Z H.A simplified non-maximum suppression with improved constraints[J].Journal University of Science and Technology of China, 2016, 46(1):6-11(in Chinese). http://www.cqvip.com/QK/94257X/201601/669806218.html
[18]	赵文清, 严海, 邵绪强.改进的非极大值抑制算法的目标检测[J].中国图象图形学报, 2018, 23(11):1676-1685. doi: 10.11834/jig.180275 ZHAO W Q, YAN H, SHAO X Q.Object detection based on improved non-maximums suppression algorithm[J].Journal of Image and Graphics, 2018, 23(11):1676-1685(in Chinese). doi: 10.11834/jig.180275
[19]	KRIZHEVSKY A, SUTSKEVER I, HINTON G E.ImageNet classification with deep convolutional neural networks[C]//Advances in Neural Information Processing Systems.Red Hook: Curran Associates Inc., 2012: 1097-1105.

Relative Articles

Supplements(0)

Cited By

Proportional views

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(4) / Tables(3)

Get Citation

PDF

XML

Article Metrics

Article views(681) PDF downloads(184)

Target detection method for region proposal network with variable anchor box

doi: 10.13700/j.bh.1001-5965.2019.0531

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Target detection method for region proposal network with variable anchor box

doi: 10.13700/j.bh.1001-5965.2019.0531

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Export File

Citation

Format

Content