Human action recognition based on locality-constrained linear coding

BAI Chen; SUN Junhua

doi:10.13700/j.bh.1001-5965.2014.0414

Volume 41 Issue 6

Jun. 2015

Turn off MathJax

Article Contents

Abstract

References

Journal of Beijing University of Aeronautics and Astronautics > 2015 > 41(6): 1122-1127.

WU Li-cheng, LU Zhen, YU Shou-qian, et al. Novel Neural Networks Control Approach of Flexible Manipulators[J]. Journal of Beijing University of Aeronautics and Astronautics, 2002, 28(3): 316-318. (in Chinese)

Citation:

BAI Chen, SUN Junhua. Human action recognition based on locality-constrained linear coding[J]. Journal of Beijing University of Aeronautics and Astronautics, 2015, 41(6): 1122-1127. doi: 10.13700/j.bh.1001-5965.2014.0414(in Chinese)

Citation:

PDF( 2096 KB)

Human action recognition based on locality-constrained linear coding

doi: 10.13700/j.bh.1001-5965.2014.0414

BAI Chen,
SUN Junhua^,

School of Instrumentation Science and Opto-electronics Engineering, Beijing University of Aeronautics and Astronautics, Beijing 100191, China

Received Date: 10 Jul 2014
Publish Date: 20 Jun 2015

Abstract

Abstract

Large intra-class variations of action features lead to low classification accuracy of action recognition, on the other hand, current algorithms exist drawbacks in computational complexity and extension of recognizable action classes. A method based on locality-constrained linear coding (LLC) for action recognition from depth images was proposed. In order to reduce the intra-class variations and increase classification accuracy, joints' positions, velocities and acceleration features were concatenated to form local action features, then LLC was used to calculate sparse representations of local action features. Analytical solution of LLC ensures computational speed of our method is up to 760 frames per second. Dictionary is composed by sub-dictionaries learned by K-means from features of each class separately, so global optimization is avoided during extending recognizable action classes. Moreover, to avoid classifier to be over-fitting, a dimensionality reduction method based on labels of dictionary items was proposed. The proposed method was evaluated on MSR-Action3D dataset captured by depth cameras. The experimental results show that the proposed approach achieves classification accuracy of 85.7%.
- action recognition,
- locality-constrained linear coding,
- dictionary learning,
- temporal pyramid matching,
- depth images

FullText(HTML)

References(22)

References

[1]	郑韡, 沈旭昆.基于连续数据流的动态手势识别算法[J].北京航空航天大学学报, 2012, 38(2):273-279. Zheng W, Shen X K.Algorithm based on continuous data stream for dynamic gesture recognition[J].Journal of Beijing University of Aeronautics and Astronautics, 2012, 38(2):273-279(in Chinese).
[2]	史骏, 陈才扣.基于马氏距离的半监督鉴别分析及人脸识别[J].北京航空航天大学学报, 2011, 37(12):1589-1593. Shi J, Chen C K.Mahalanobis distance-based semi-supervised discriminant analysis for face recognition[J].Journal of Beijing University of Aeronautics and Astronautics, 2011, 37(12):1589-1593(in Chinese).
[3]	Weinland D, Ronfard R, Boyer E.A survey of vision-based methods for action representation, segmentation and recognition[J].Computer Vision and Image Understanding, 2011, 115(2):224-241.
[4]	Shotton J, Girshick R, Fitzgibbon A, et al.Efficient human pose estimation from single depth images[J].IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, 35(12):2821-2840.
[5]	Jhuang H, Gall J, Zuffi S, et al.Towards understanding action recognition[C]//Proceedings of IEEE International Conference on Computer Vision (ICCV).Piscataway, NJ:IEEE Press, 2013:3192-3199.
[6]	Xia L, Chen C C, Aggarwal J K.View invariant human action recognition using histograms of 3d joints[C]//Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops(CVPRW).Piscataway, NJ:IEEE Press, 2012:20-27.
[7]	Yang X, Tian Y L.Eigenjoints-based action recognition using naïve bayes nearest neighbor[C]//Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops(CVPRW).Piscataway, NJ:IEEE Press, 2012:14-19.
[8]	Wang J, Liu Z, Wu Y, et al.Mining actionlet ensemble for action recognition with depth cameras[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Piscataway, NJ:IEEE Press, 2012:1290-1297.
[9]	Zanfir M, Leordeanu M, Sminchisescu C.The moving pose:An efficient 3D kinematics descriptor for low-latency action recognition and detection[C]//Proceedings of IEEE International Conference on Computer Vision(ICCV).Piscataway, NJ:IEEE Press, 2013:2752-2759.
[10]	Luo J, Wang W, Qi H.Group sparsity and geometry constrained dictionary learning for action recognition from depth maps[C]//Proceedings of IEEE International Conference on Computer Vision(ICCV).Piscataway, NJ:IEEE Press, 2013:1089-1816.
[11]	Wang J, Yang J, Yu K, et al.Locality-constrained linear coding for image classification[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Piscataway, NJ:IEEE Press, 2010:3360-3367.
[12]	Yu K, Zhang T, Gong Y.Nonlinear learning using local coordinate coding[C]//Advances in Neural Information Processing Systems.La Jolla, CA:Neural Information Processing Systems Foundation, 2009:1-9.
[13]	Chang C C, Lin C J.LIBSVM:A library for support vector machines[J].ACM Transactions on Intelligent Systems and Technology(TIST), 2011, 2(3):27.
[14]	Martens J, Sutskever I.Learning recurrent neural networks with Hessian-free optimization[C]//Proceedings of the 28th International Conference on Machine Learning(ICML).New York:International Machine Learning Society(IMLS), 2011:1033-1040.
[15]	Müller M, Röder T.Motion templates for automatic classification and retrieval of motion capture data[C]//Proceedings of the ACM SIGGRAPH.New York:ACM, 2006:137-146.
[16]	Lv F, Nevatia R.Recognition and segmentation of 3-d human action using hmm and multi-class adaboost[C]//Proceedings of European Conference on Computer Vision(ECCV).Berlin, Heidelberg:Springer, 2006:359-372.
[17]	Morency L, Quattoni A, Darrell T.Latent-dynamic discriminative models for continuous gesture recognition[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Piscataway, NJ:IEEE Press, 2007:1-8.
[18]	Li W, Zhang Z, Liu Z.Action recognition based on a bag of 3d points[C]//Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops(CVPRW).Piscataway, NJ:IEEE Press, 2010:9-14.
[19]	Vieira A W, Nascimento E R, Oliveira G L, et al.Stop:space-time occupancy patterns for 3d action recognition from depth map sequences[C]//Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications.Berlin, Heidelberg:Springer, 2012:252-259.
[20]	Wang J, Liu Z, Chorowski J, et al.Robust 3d action recognition with random occupancy patterns[C]//Proceedings of European Conference on Computer Vision(ECCV).Berlin, Heidelberg:Springer, 2012:872-885.
[21]	Mairal J, Bach F, Ponce J, et al.Online dictionary learning for sparse coding[C]//Proceedings of the 26th Annual International Conference on Machine Learning.New York:ACM, 2009:689-696.
[22]	Lee H, Battle A, Raina R, et al.Efficient sparse coding algorithms[C]//Advances in Neural Information Processing Systems.La Jolla, CA:Neural Information Processing Systems Foundation, 2006:801-808.

Relative Articles

[1]	MA J，LI R Y，ZHANG Q Y，et al. Network time reliability evaluation method based on uncertainty theory[J]. Journal of Beijing University of Aeronautics and Astronautics，2025，51（4）：1267-1276 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2023.0191.
[2]	WANG D W，LIU W，FANG J，et al. Low illumination image enhancement algorithm for UAV aerial photography with color consistency[J]. Journal of Beijing University of Aeronautics and Astronautics，2025，51（4）：1096-1106 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2023.0172.
[3]	XIE Xiaoyan, ZHANG Heng, CHEN Yuxin. Micro-expression recognition method based on capsule network in coding domain[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2024.0841
[4]	TIAN S，WANG Z W，CAO X P，et al. Identification of pulsatile tinnitus and visualization of high pathogenic regions based on CT images[J]. Journal of Beijing University of Aeronautics and Astronautics，2025，51（2）：625-632 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2023.0074.
[5]	REN Liqiang, WANG Haipeng, PAN Xinlong, WAN Bing, TANG Tiantian. A complex maneuver recognition method based on wavelet time-frequency image and lightweight CNN-Transformer hybrid neural network[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2024.0745
[6]	HOU Z Q，MA J Y，HAN R X，et al. A fast long-term visual tracking algorithm based on deep learning[J]. Journal of Beijing University of Aeronautics and Astronautics，2024，50（8）：2391-2403 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2022.0645.
[7]	CHAI G Q，BO X S，LIU H J，et al. Self-supervised scene depth estimation for monocular images based on uncertainty[J]. Journal of Beijing University of Aeronautics and Astronautics，2024，50（12）：3780-3787 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2022.0943.
[8]	CHEN Hao, LIU Tong, ZHANG Yanxin. Satellite platform classification based on deep neural network using Photometric data[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2024.0319
[9]	LI H G，WANG Y F，YANG L C. Meta-learning-based few-shot object detection for remote sensing images[J]. Journal of Beijing University of Aeronautics and Astronautics，2024，50（8）：2503-2513 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2022.0637.
[10]	LI Bowen, LEI Xiaoyong. Flight task recognition and action segmentation based on SVM[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2024.0635
[11]	MA Jin-lin, HE Kang-kang, MA Zi-ping, OUYANG Ke. Weighted bidirectional feature pyramid fusion-based liver tumor detection method for CT images[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0636
[12]	HU Jianping, GAO Zhipeng, MOU Yang, XIE Qi. Deep learning-based image watermarking combining attack classification and multi-channel embedding[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2024.0552
[13]	LI Y H，ZHU M Y，REN J，et al. Text-to-image synthesis based on modified deep convolutional generative adversarial network[J]. Journal of Beijing University of Aeronautics and Astronautics，2023，49（8）：1875-1883 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2021.0588.
[14]	ZHANG Yun-zuo, YANG Yue-hui, DONG Xu, KANG Wei-li, BAI Jing. Dual-stream Deep Network Infrared Gait Recognition Based on Residual Multi-scale Fusion[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0605
[15]	SUN X T，CHENG W，CHEN W J，et al. A visual detection and grasping method based on deep learning[J]. Journal of Beijing University of Aeronautics and Astronautics，2023，49（10）：2635-2644 （in Chinese）. doi: 10.13700/j.bh.1001-5965.2022.0130.
[16]	WANG Zai-sheng, WANG Xiao-feng, SHEN Guo-dong, ZHANG Zeng-jie, QUAN Da-ying. Self-Supervised Learning for Community Detection Based on Deep Graph Convolutional Networks[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0408
[17]	CHEN Yong, ZHOU Fang-chun, ZHANG Jiao-jiao. Railway panoramic segmentation based on recursive gating enhancement and pyramid prediction[J]. Journal of Beijing University of Aeronautics and Astronautics. doi: 10.13700/j.bh.1001-5965.2023.0492
[18]	XIONG Junyao, WANG Rong, SUN Yibo. Appearance and action adaptive target tracking method[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(8): 1525-1533. doi: 10.13700/j.bh.1001-5965.2021.0597
[19]	SONG Yuqin, GAO Shijie, ZENG Hedong, XIONG Gaoqiang. Multi-scale depthwise separable convolution facial expression recognition embedded in attention mechanism[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(12): 2381-2387. doi: 10.13700/j.bh.1001-5965.2021.0114
[20]	SU Kaiqi, YAN Weiqing, XU Jindong. 3D object detection based on multi-path feature pyramid network for stereo images[J]. Journal of Beijing University of Aeronautics and Astronautics, 2022, 48(8): 1487-1494. doi: 10.13700/j.bh.1001-5965.2021.0525

Supplements(0)

Cited By

Proportional views

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Get Citation

PDF

XML

Article Metrics

Article views(1200) PDF downloads(558)

Human action recognition based on locality-constrained linear coding

doi: 10.13700/j.bh.1001-5965.2014.0414

Abstract

References

Relative Articles

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Human action recognition based on locality-constrained linear coding

doi: 10.13700/j.bh.1001-5965.2014.0414

Abstract

References

Relative Articles

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Export File

Citation

Format

Content