Citation: | SUN Yibo, ZHANG Wenjing, WANG Rong, et al. Pedestrian re-identification method based on channel attention mechanism[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(5): 881-889. doi: 10.13700/j.bh.1001-5965.2020.0684(in Chinese) |
To address the problem of insufficient expression of pedestrian characteristics, we propose a pedestrian re-identification method based on channel attention mechanism. The channel attention mechanism named SE module is embedded in the backbone network ResNet50 to weight and strengthen the key feature information. The dynamic activation function is used to dynamically adjust the parameters of ReLU according to the input characteristics, and enhance the nonlinear expression ability of the network model. The gradient centralization algorithm is introduced into the Adam optimizer to improve the training speed and generalization ability of the network model. Experiments on the three mainstream datasets: Market1501, DukeMTMC-ReID and CUHK03 show that Rank-1 is increased by 2.17%, 2.38%, and 3.50% respectively, and mAP is increased by 3.07%, 3.39%, and 4.14% respectively. The results indicate that our approach can extract more robust pedestrian expression features and achieve higher recognition accuracy.
[1] |
YE M, SHEN J, LIN G, et al. Deep learning for person re-identification: A survey and outlook[J/OL]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021(2021-01-26)[2021-02-01]. https://ieeexplore.ieee.org/document/9336268.
|
[2] |
ZHENG Z, ZHENG L, YANG Y. A discriminatively learned CNN embedding for person reidentification[J]. ACM Transactions on Multimedia Computing, Communications, and Applications, 2017, 14(1): 1-20.
|
[3] |
WANG F, ZUO W, LIN L, et al. Joint learning of single-image and cross-image representations for person re-identification[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2016: 1288-1296.
|
[4] |
SUN Y, ZHENG L, YANG Y, et al. Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline)[C]//European Conference on Computer Vision. Berlin: Springer, 2018: 480-496.
|
[5] |
ZHAO H, TIAN M, SUN S, et al. Spindle Net: Person re-identification with human body region guided feature decomposition and fusion[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2017: 1077-1085.
|
[6] |
MIAO J, WU Y, LIU P, et al. Pose-guided feature alignment for occluded person re-identification[C]//Proceedings of the IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2019: 542-551.
|
[7] |
LIN Y, ZHENG L, ZHENG Z, et al. Improving person re-identification by attribute and identity learning[J]. Pattern Recognition, 2019, 95: 151-161. doi: 10.1016/j.patcog.2019.06.006
|
[8] |
FAN X, LUO H, ZHANG X, et al. SCPNet: Spatial-channel parallelism network for joint holistic and partial person re-identification[C]//Asian Conference on Computer Vision. Berlin: Springer, 2018: 19-34.
|
[9] |
HERMANS A, BEYER L, LEIBE B. In defense of the triplet loss for person re-identification[EB/OL]. (2017-11-21)[2020-12-01]. https://arxiv.org/abs/1703.07737.
|
[10] |
HU J, SHEN L, SUN G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2018: 7132-7141.
|
[11] |
CHEN Y, DAI X, LIU M, et al. Dynamic ReLU[C]//European Conference on Computer Vision. Berlin: Springer, 2020: 351-367.
|
[12] |
IOFFE S, SZEGEDY C. Batch normalization: Accelerating deep network training by reducing internal covariate shift[C]//International Conference on Machine Learning. New York: ACM, 2015: 448-456.
|
[13] |
QIAO S, WANG H, LIU C, et al. Weight standardization[EB/OL]. (2020-08-09)[2020-12-01]. https://arxiv.org/abs/1903.10520.
|
[14] |
YONG H, HUANG J, HUA X, et al. Gradient centralization: A new optimization technique for deep neural networks[C]//European Conference on Computer Vision. Berlin: Springer, 2020: 635-652.
|
[15] |
ZHENG L, SHEN L, TIAN L, et al. Scalable person re-identification: A benchmark[C]//Proceedings of the IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2015: 1116-1124.
|
[16] |
ZHENG Z, ZHENG L, YANG Y. Unlabeled samples generated by GAN improve the person re-identification baseline in vitro[C]// Proceedings of the IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2017: 3754-3762.
|
[17] |
LI W, ZHAO R, XIAO T, et al. DeepReID: Deep filter pairing neural network for person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2014: 152-159.
|
[18] |
HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2016: 770-778.
|
[19] |
CHEN Y C, ZHENG W S, LAI J H, et al. An asymmetric distance model for cross-view feature mapping in person reidentification[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2016, 27(8): 1661-1675.
|
[20] |
SUN Y, ZHENG L, DENG W, et al. Svdnet for pedestrian retrieval[C]//Proceedings of the IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2017: 3800-3808.
|
[21] |
ZHENG Z, ZHENG L, YANG Y. Pedestrian alignment network for large-scale person re-identification[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2018, 29(10): 3037-3045.
|
[22] |
SU C, LI J, ZHANG S, et al. Pose-driven deep convolutional model for person re-identification[C]//Proceedings of the IEEE International Conference on Computer Vision. Piscataway: IEEE Press, 2017: 3960-3969.
|
[23] |
XU J, ZHAO R, ZHU F, et al. Attention-aware compositional network for person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2018: 2119-2128.
|
[24] |
WEI L, ZHANG S, YAO H, et al. GLAD: Global-local-alignment descriptor for pedestrian retrieval[C]//Proceedings of the 25th ACM International Conference on Multimedia. New York: ACM, 2017: 420-428.
|
[25] |
LI W, ZHU X, GONG S. Harmonious attention network for person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE Press, 2018: 2285-2294.
|