Citation: | LIU Fang, SUN Yanan, WANG Hongjuan, et al. Adaptive UAV target tracking algorithm based on residual learning[J]. Journal of Beijing University of Aeronautics and Astronautics, 2020, 46(10): 1874-1882. doi: 10.13700/j.bh.1001-5965.2019.0551(in Chinese) |
UAVs have been widely used in military and civilian applications, and target tracking technology is one of the key technologies for UAV applications. Aimed at the problem that the target is prone to scale change and occlusion during the target tracking process of the UAV, an adaptive UAV video target tracking algorithm based on residual learning is proposed. Firstly, by combining the advantages of residual learning and dilated convolution, a depth network is constructed to extract target features and overcome the problem of network degradation. Secondly, the extracted feature information is input into the kernel correlation filtering algorithm, and a positioning filter is constructed to determine the central position of the target. Finally, adaptive segmentation is performed according to the different appearance characteristics of the target and the scaling coefficient of the target scale is calculated. The simulation results show that the algorithm can effectively deal with the influence of scale change and occlusion on tracking performance, and has higher tracking success rate and accuracy than other comparison algorithms.
[1] |
WANG N, LI S, GUPTA A, et al.Transferring rich feature hierarchies for robust visual tracking[EB/OL].(2015-01-19)[2019-10-20].https://arxiv.org/abs/1501.04587.
|
[2] |
WANG L, OUYANG W, WANG X, et al.Visual tracking with fully convolutional networks[C]//Proceedings of the IEEE International Conference on Computer Vision.Piscataway: IEEE Press, 2015: 3119-3127.
|
[3] |
HE K M, ZHANG X Y, REN S Q, et al.Deep residual learning for image recognition[C]//IEEE Conference on Computer Vision and Patern Recognition (CVPR).Piscataway: IEEE Press, 2016: 770-778.
|
[4] |
NEJHUM S M S, HO J, YANG M H.Visual tracking with histograms and articulating blocks[C]//IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Piscataway: IEEE Press, 2008: 546-553.
|
[5] |
段伟伟, 杨学志, 方帅, 等.分块核化相关滤波目标跟踪[J].计算机辅助设计与图形学学报, 2016, 28(7):1160-1168. http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=jsjfzsjytxxxb201607016
DUAN W W, YANG X Z, FANG S, et al.Block nucleation correlation filtering target tracking[J].Journal of Computer-Aided Design & Computer Graphics, 2016, 28(7):1160-1168(in Chinese). http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=jsjfzsjytxxxb201607016
|
[6] |
KUDO Y, AOKI Y.Dilated convolutions for image classification and object localization[C]//Fifteenth IAPR International Conference on Machine Vision Applications.Piscataway: IEEE Press, 2017: 452-455.
|
[7] |
YU F, KOLTUN V.Multi-scale context aggregation by dilated convolutions[EB/OL].(2015-11-23)[2019-10-20].https://arxiv.org/abs/1511.07122.
|
[8] |
CHEN L C, PAPANDREOU G, KOKKINOS I, et al.DeepLab:Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4):834-848. doi: 10.1109/TPAMI.2017.2699184
|
[9] |
CHEN L C, PAPANDREOU G, SCHROFF F, et al.Rethinking atrous convolution for semantic image segmentation[EB/OL].(2017-06-17)[2019-10-20].https://arxiv.org/abs/1706.05587.
|
[10] |
IOFFE S, SZEGEDY C.Batch normalization: Accelerating deep network training by reducing internal covariate shift[EB/OL].(2015-02-11)[2019-10-20].https://arxiv.org/abs/1502.03167.
|
[11] |
MUELLER M, SMITH N, GHANEM B.A benchmark and simulator for UAV tracking[C]//European Conference on Computer Vision.Berlin: Springer, 2016: 445-461.
|
[12] |
ZHU P, WEN L, BIAN X, et al.Vision meets drones: A challenge[EB/OL].(2018-04-20)[2019-10-20].https://arxiv.org/abs/1804.07437.
|
[13] |
HARE S, SAFFARI A, TORR P H S.Struck: Structured output tracking with kernels[C]//IEEE International Conference on Computer Vision.Piscataway: IEEE Press, 2011: 263-270.
|
[14] |
MA C, YANG X, ZHANG C, et al.Long-term correlation tracking[C]//Computer Vision & Pattern Recognition, 2015: 5388-5396. https://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Ma_Long-Term_Correlation_Tracking_2015_CVPR_paper.pdf
|
[15] |
WANG R, ZOU J, CHE M, et al.Robust and real-time visual tracking based on single-layer convolutional features and accurate scale estimation[C]//Chinese Conference on Image and Graphics Technologies, 2018: 471-482.
|
[16] |
ZHANG J, MA S, SCLAROFF S.MEEM: Robust tracking via multiple experts using entropy minimization[C]//European Conference on Computer Vision.Berlin: Springer, 2014: 188-203.
|
[17] |
GALOOGAHI H K, FAGG A, LUCEY S.Learning background-aware correlation filters for visual tracking[C]//Proceedings of the IEEE International Conference on Computer Vision.Piscataway: IEEE Press, 2017: 1135-1143.
|
[18] |
VALMADRE J, BERTINETTO L, HENRIQUES J, et al.End-to-end representation learning for correlation filter based tracking[C]//IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Piscataway: IEEE Press, 2017: 2805-2813.
|