Citation: | QIU Bo, LIU Xiang, SHI Yunyu, et al. A lightweight multi-target real-time detection model[J]. Journal of Beijing University of Aeronautics and Astronautics, 2020, 46(9): 1778-1785. doi: 10.13700/j.bh.1001-5965.2020.0066(in Chinese) |
For the public security monitoring system, a lightweight multi-target real-time detection algorithm is proposed in order to realize the accurate intelligence of the content analysis and improve the actual service ability. First, the multi-fusion gradient cascade structure of CBNet is added based on CenterNet detection network, which effectively solves the problem of insufficient feature extraction capability of the backbone network in daily monitoring videos. Second, the number of parameters is reduced through the model pruning and compression, which can speed up the analysis speed of monitoring videos. During the experiments, the dataset for training and testing consists of a part of COCO datasets and a number of field data collected by ourselves. The ablation experiments are conducted with other mainstream detection algorithms (YOLO, Faster-RCNN, SSD, etc.). The experimental results show that the presented model can effectively balance the speed and precision in the analysis of monitoring videos for public security and has stronger universality.
[1] |
KALIA R, LEE K D, SAMIR B V R, et al.An analysis of the effect of different image preprocessing techniques on the performance of SURF: Speeded up robust features[C]//Workshop on Frontiers of Computer Vision.Piscataway: IEEE Press, 2011: 1-6.
|
[2] |
LOWE D G.Distinctive image features from scale-invariant keypoints[J].International Journal of Computer Vision, 2004, 60(2):91-110. doi: 10.1023/B:VISI.0000029664.99615.94
|
[3] |
MUNRO S, THOMAS K L, ABU-SHAAR M.Molecular characterization of a peripheral receptor for cannabinoids[J].Nature, 1993, 365(6441):61-65. doi: 10.1038/365061a0
|
[4] |
PLATT J C.A fast algorithm for training support vector machines[J].Journal of Information Technology, 1998, 2(5):1-28. http://www.researchgate.net/publication/242613062_A_fast_algorithm_for_training_support_vector_machines
|
[5] |
FREUND Y, SCHAPIRE R E.A decision-theoretic generalization of on-line learning and an application to boosting[C]//Proceedings of the 2nd European Conference on Computational Learning Theory.Berlin: Springer, 1995: 22-37. https://www.researchgate.net/publication/225540813_Lecture_Notes_in_Computer_Science
|
[6] |
GIRSHICK R, DONAHUE J, DARRELL T, et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Piscataway: IEEE Press, 2014: 580-587.
|
[7] |
GIRSHICK R.Fast-RCNN[C]//Proceedings of 2015 IEEE In-ternational Conference on Computer Vision.Piscataway: IEEE Press, 2015: 10-15.
|
[8] |
REN S, HE K, GIRSHICK R, et al.Faster R-CNN: Towards real-time object detection with region proposal networks[C]//Proceedings of the 28th International Conference on Neural Information Processing Systems.Cambridge: MIT Press, 2015: 1-15.
|
[9] |
REDMON J, FARHADI A.YOLO9000: Better, faster, stronger[EB/OL].(2016-12-25)[2020-02-27].https://arxiv.org/abs/1612.08242.
|
[10] |
LIU W, ANGUELOV D, ERHAN D, et al.SSD: Single shot multibox detector[C]//Proceedings of 2016 European Conference on Computer Vision and Pattern Recognition.Berlin: Springer, 2016: 13-17. https://www.researchgate.net/publication/286513835_SSD_Single_Shot_MultiBox_Detector
|
[11] |
LAW H, DENG J.CornerNet:Detecting objects as paired keypoints[J].International Journal of Computer Vision, 2018, 128:642-656. doi: 10.1007/s11263-019-01204-1
|
[12] |
KONG T, SUN F, LIU H, et al.FoveaBox: Beyond anchor-based object detector[EB/OL].(2019-04-08)[2020-02-27].https://arxiv.org/abs/1904.03797.
|
[13] |
ZHOU X, WANG D, KRÄHENBVHL P.Objects as points[EB/OL].(2019-04-16)[2020-02-27].https://arxiv.org/abs/1904.07850.
|
[14] |
HE K M, ZHANG X Y, REN S Q.Deep residual learning for image recognition[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Piscataway: IEEE Press, 2016: 770-778.
|
[15] |
NEWELL A, YANG K, JIA D.Stacked hourglass networks for human pose estimation[EB/OL].(2016-03-22)[2020-02-27].https://arxiv.org/abs/1603.06937.
|
[16] |
LIU Y, WANG Y, WANG S, et al.CBNet: A novel composite backbone network architecture for object detection[EB/OL].(2019-09-09)[2020-02-27].https://arxiv.org/abs/1909.03625.
|
[17] |
CHOLLET F.Xception: Deep learning with depthwise separable convolutions[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Piscataway: IEEE Press, 2017: 1800-1807.
|
[18] |
TAN M, PANG R, LE Q V.EfficientDet: Scalable and efficient object detection[EB/OL].(2019-11-20)[2020-02-27].https://arxiv.org/abs/1911.09070.
|
[19] |
HE Y, ZHANG X Y, SUN J.Channel pruning for accelerating very deep neural networks[EB/OL].(2017-08-21)[2020-02-27].https://arxiv.org/abs/1707.06168.
|