[1]Sipiran I, Bustos B. Harris 3D: A robust extension of the Harris operator for interest point detection on 3D meshes[J]. The Visual Computer, 2011,27(11):963-976.〖HJ0.9mm〗
[2]Piotr D, Rabaud V, Cottrell G, et al. Behavior recognition via sparse spatio-temporal features[C]//IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance. 2005:65-72.
[3]Geert W, Tuytelaars T, Luc V G. An efficient dense and scale-invariant spatio-temporal interest point detector[C]//European Conference on Computer Vision. 2008: 650-663.
[4]Christian T, Hlavác V. Pose primitive based human action recognition in videos or still images[C]//IEEE Conference on Computer Vision and Pattern Recognition. 2008:1-8.
[5]Wang Heng, Klser A. Action recognition by dense trajectories[C]//IEEE Conference on Computer Vision and Pattern Recognition. 2011: 3169-3176.
[6]Zhang Zhengyou. Microsoft Kinect sensor and its effect[J]. IEEE Multimedia, 2012,19(2): 4-10.
[7]LeCun Y, Boser B E, Denker J S, et al. Handwritten digit recognition with a back-propagation network[C]//Advances in Neural Information Processing Systems. 1989.
[8]LeCun Y, Bottou L, Bengio Y, et al. Gradient-based learning applied to document recognition[J]. Proceedings of the IEEE, 1998, 86(11): 2278-2324.
[9]Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks[C]//Advances in Neural Information Processing Systems. 2012:1097-1105.
[10]Tran D, Bourdev L, Fergus R, et al. Learning spatiotemporal features with 3D convolutional networks[C]//IEEE International Conference on Computer Vision (ICCV). 2015:4489-4497.
[11]Donahue J, Hendricks L A, Guadarrama S, et al. Long-term recurrent convolutional networks for visual recognition and description[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015:2626-2634.
[12]Simonyan K, Zisserman A. Two-stream convolutional networks for action recognition in videos[C]//Advances in Neural Information Processing Systems. 2014:568-576.
[13]Laptev I, Lindeberg T. On space-time interest points[J]. International Journal of Computer Vision, 2005,64(2-3):107-123.
[14]Lowe D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 2004,60(2):91-110.
[15]Dalal N, Triggs B. Histograms of oriented gradients for human detection[C]//IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05). 2005:886-893.
[16]Scovanner P, Ali S, Shah M. A 3-dimensional sift descriptor and its application to action recognition[C]//Proceedings of the 15th ACM International Conference on Multimedia. 2007:357-360.
[17]Klaser A, Marszalek M, Schmid C. A spatio-temporal descriptor based on 3d-gradients[C]//BMVC 2008-19th British Machine Vision Conference. 2008:275.
[18]Karpathy A, Toderici G, Shetty S, et al. Large-scale video classification with convolutional neural networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2014:1725-1732.
[19]Jordan M I. Serial order: A parallel distributed processing approach[J]. Advances in Psychology, 1997,121:471-495.
[20]Rabiner L R. A tutorial on hidden Markov models and selected applications in speech recognition[J]. Proceedings of the IEEE, 1989,77(2):257-286.
[21]Hochreiter S, Schmidhuber J. Long short-term memory[J]. Neural Computation, 1997,9(8):1735-1780.
[22]Chollet F. Keras[DB/OL].https://github.com/fchollet/keras, 2015-01-01.
[23]Yu Gang, Liu Zicheng, Yuan Junsong. Discriminative orderlet mining for real-time recognition of human-object interaction[C]//Asian Conference on Computer Vision. 2014:50-65.
[24]Xia Lu, Aggarwal J K. Spatio-temporal depth cuboid similarity feature for activity recognition using depth camera[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2013:2834-2841.
[25]Wang Jiang, Liu Zicheng, Wu Ying, et al. Mining actionlet ensemble for action recognition with depth cameras[C]//IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2012:1290-1297.
[26]Yang Xiaodong, Tian Yingli. Eigenjoints-based action recognition using naive-bayes-nearest-neighbor[C]//IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. 2012:14-19. |