A lightweight multi-target real-time detection model

QIU Bo; LIU Xiang; SHI Yunyu; SHANG Yanfeng

doi:10.13700/j.bh.1001-5965.2020.0066

Volume 46 Issue 9

Sep. 2020

Turn off MathJax

Article Contents

Journal of Beijing University of Aeronautics and Astronautics > 2020 > 46(9): 1778-1785.

QIU Bo, LIU Xiang, SHI Yunyu, et al. A lightweight multi-target real-time detection model[J]. Journal of Beijing University of Aeronautics and Astronautics, 2020, 46(9): 1778-1785. doi: 10.13700/j.bh.1001-5965.2020.0066(in Chinese)

Citation:

QIU Bo, LIU Xiang, SHI Yunyu, et al. A lightweight multi-target real-time detection model[J]. Journal of Beijing University of Aeronautics and Astronautics, 2020, 46(9): 1778-1785. doi: 10.13700/j.bh.1001-5965.2020.0066(in Chinese)

Citation:

PDF( 4333 KB)

A lightweight multi-target real-time detection model

doi: 10.13700/j.bh.1001-5965.2020.0066

1.
School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
2.
Internet of Things Technology R & D Center, The Third Research Institute of the Ministry of Public Security, Shanghai 200031, China

Funds:

National Key R & D Program of China 2016YFC0801304

Shanghai Science and Technology Innovation Action Plan in Hi-tech Field 17511106803

More Information

Corresponding author: LIU Xiang, E-mail:xliu@sues.edu.cn
Received Date: 02 Mar 2020
Accepted Date: 18 Apr 2020
Publish Date: 20 Sep 2020

Abstract

Abstract

For the public security monitoring system, a lightweight multi-target real-time detection algorithm is proposed in order to realize the accurate intelligence of the content analysis and improve the actual service ability. First, the multi-fusion gradient cascade structure of CBNet is added based on CenterNet detection network, which effectively solves the problem of insufficient feature extraction capability of the backbone network in daily monitoring videos. Second, the number of parameters is reduced through the model pruning and compression, which can speed up the analysis speed of monitoring videos. During the experiments, the dataset for training and testing consists of a part of COCO datasets and a number of field data collected by ourselves. The ablation experiments are conducted with other mainstream detection algorithms (YOLO, Faster-RCNN, SSD, etc.). The experimental results show that the presented model can effectively balance the speed and precision in the analysis of monitoring videos for public security and has stronger universality.
- target detection,
- deep learning,
- model compression,
- model distillation,
- cascade fusion

FullText(HTML)

References(19)

References

[1]	KALIA R, LEE K D, SAMIR B V R, et al.An analysis of the effect of different image preprocessing techniques on the performance of SURF: Speeded up robust features[C]//Workshop on Frontiers of Computer Vision.Piscataway: IEEE Press, 2011: 1-6.
[2]	LOWE D G.Distinctive image features from scale-invariant keypoints[J].International Journal of Computer Vision, 2004, 60(2):91-110. doi: 10.1023/B:VISI.0000029664.99615.94
[3]	MUNRO S, THOMAS K L, ABU-SHAAR M.Molecular characterization of a peripheral receptor for cannabinoids[J].Nature, 1993, 365(6441):61-65. doi: 10.1038/365061a0
[4]	PLATT J C.A fast algorithm for training support vector machines[J].Journal of Information Technology, 1998, 2(5):1-28. http://www.researchgate.net/publication/242613062_A_fast_algorithm_for_training_support_vector_machines
[5]	FREUND Y, SCHAPIRE R E.A decision-theoretic generalization of on-line learning and an application to boosting[C]//Proceedings of the 2nd European Conference on Computational Learning Theory.Berlin: Springer, 1995: 22-37. https://www.researchgate.net/publication/225540813_Lecture_Notes_in_Computer_Science
[6]	GIRSHICK R, DONAHUE J, DARRELL T, et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Piscataway: IEEE Press, 2014: 580-587.
[7]	GIRSHICK R.Fast-RCNN[C]//Proceedings of 2015 IEEE In-ternational Conference on Computer Vision.Piscataway: IEEE Press, 2015: 10-15.
[8]	REN S, HE K, GIRSHICK R, et al.Faster R-CNN: Towards real-time object detection with region proposal networks[C]//Proceedings of the 28th International Conference on Neural Information Processing Systems.Cambridge: MIT Press, 2015: 1-15.
[9]	REDMON J, FARHADI A.YOLO9000: Better, faster, stronger[EB/OL].(2016-12-25)[2020-02-27].https://arxiv.org/abs/1612.08242.
[10]	LIU W, ANGUELOV D, ERHAN D, et al.SSD: Single shot multibox detector[C]//Proceedings of 2016 European Conference on Computer Vision and Pattern Recognition.Berlin: Springer, 2016: 13-17. https://www.researchgate.net/publication/286513835_SSD_Single_Shot_MultiBox_Detector
[11]	LAW H, DENG J.CornerNet:Detecting objects as paired keypoints[J].International Journal of Computer Vision, 2018, 128:642-656. doi: 10.1007/s11263-019-01204-1
[12]	KONG T, SUN F, LIU H, et al.FoveaBox: Beyond anchor-based object detector[EB/OL].(2019-04-08)[2020-02-27].https://arxiv.org/abs/1904.03797.
[13]	ZHOU X, WANG D, KRÄHENBVHL P.Objects as points[EB/OL].(2019-04-16)[2020-02-27].https://arxiv.org/abs/1904.07850.
[14]	HE K M, ZHANG X Y, REN S Q.Deep residual learning for image recognition[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Piscataway: IEEE Press, 2016: 770-778.
[15]	NEWELL A, YANG K, JIA D.Stacked hourglass networks for human pose estimation[EB/OL].(2016-03-22)[2020-02-27].https://arxiv.org/abs/1603.06937.
[16]	LIU Y, WANG Y, WANG S, et al.CBNet: A novel composite backbone network architecture for object detection[EB/OL].(2019-09-09)[2020-02-27].https://arxiv.org/abs/1909.03625.
[17]	CHOLLET F.Xception: Deep learning with depthwise separable convolutions[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Piscataway: IEEE Press, 2017: 1800-1807.
[18]	TAN M, PANG R, LE Q V.EfficientDet: Scalable and efficient object detection[EB/OL].(2019-11-20)[2020-02-27].https://arxiv.org/abs/1911.09070.
[19]	HE Y, ZHANG X Y, SUN J.Channel pruning for accelerating very deep neural networks[EB/OL].(2017-08-21)[2020-02-27].https://arxiv.org/abs/1707.06168.

Relative Articles

Supplements(0)

Cited By

Proportional views

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(7) / Tables(3)

Get Citation

PDF

XML

Article Metrics

Article views(565) PDF downloads(83)

A lightweight multi-target real-time detection model

doi: 10.13700/j.bh.1001-5965.2020.0066

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

A lightweight multi-target real-time detection model

doi: 10.13700/j.bh.1001-5965.2020.0066

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Export File

Citation

Format

Content