[1] LAPTEV I, LINDEBERG T. Space-time interest points[J]. International Journal of Computer Vision, 2005,64(2-3):107-123.
[2] BOIMAN O, IRANI M. Detecting irregularities in images and in video[J]. International Journal of Computer Vision, 2007,74(1):17-31.
[3] KITANI K M, ZIEBART B D, BAGNELL J A, et al. Activity forecasting[C]// European Conference on Computer Vision. Springer, Berlin, Heidelberg, 2012:201-214.
[4] WANG H , SCHMID C. Action recognition with improved trajectories[C]// IEEE International Conference on Computer Vision. 2014:3551-3558.
[5] LE Q V, ZOU W Y, YEUNG S Y, et al. Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis[C]// Proceedings of 2011 IEEE Conference on Computer Vision and Pattern Recognition. 2011:3361-3368.
[6] NG Y H, HAUSKNECHT M, VIJAYANARASIMHAN S, et al. Beyond short snippets: Deep networks for video classification[C]// 2015 IEEE Conference on Computer Vision and Pattern Recognition. 2015:4694-4702.
[7] DIBA A, SHARMA V, GOOL L V. Deep temporal linear encoding networks[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition. 2017:430-443.
[8] TRAN D, BOURDEV L, FERGUS R, et al. Learning spatiotemporal features with 3D convolutional networks[C]// 2015 IEEE International Conference on Computer Vision. 2015:4489-4497.
[9] XU H, DAS A, SAENKO K. R-C3D: Region convolutional 3D network for temporal activity detection[C]// Proceedings of International Conference on Computer Vision. 2017:761-774.
[10]黄凯奇,陈晓棠,康运锋,等. 智能视频监控技术综述[J]. 计算机学报, 2015,20(6):1093-1118.
[11]王松,党建武,王阳萍,等. 实时动作识别方法研究[J]. 计算机工程与应用, 2017,53(3):28-31.
[12]JI S, XU W, YANG M, et al. 3D convolutional neural networks for human action recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013,35(1):221-231.
[13]YAN Z, ZHANG H, PIRAMUTHU R, et al. HD-CNN: Hierarchical deep convolutional neural networks for large scale visual recognition[C]// Proceedings of International Conference on Computer Vision. 2015:2740-2748.
[14]KARPATHY A, TODERICI G, SHETTY S , et al. Large-scale video classication with convolutional neural networks[C]// International Conference on Computer Vision and Pattern Recognition. 2014:1-6.
[15]HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]// IEEE Conference on Computer Vision and Pattern Recognition. 2016:770-778.
[16]HOWARD A G, ZHU M, CHEN B, et al. MobileNets: Efficient convolutional neural networks for mobile vision applications[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition. 2017:21-26.
[17]ZHENG Q, ZHANG Z, ZHANG S, et al. Merging-and-evolution networks for mobile vision applications[J]. IEEE Access, 2018(99):1.
[18]YANG Z, MOCZULSKI M, DENIL M, et al. Deep fried convnets[J]. Computer Science, 2015(12):7-13.
[19]SIGURDSSON G A, DIVVALA S, FARHADI A, et al. Asynchronous temporal fields for action recognition[C]// IEEE Conference on Computer Vision and Pattern Recognition. 2017:5650-5659.
[20]LI X, DING L, WANG L, et al. FPGA accelerates deep residual learning for image recognition[C]// IEEE Information Technology, Networking, Electronic and Automation Control Conference. 2017:834-840. |