强化深度特征融合的行人搜索系统

doi:10.3969/j.issn.1006-2475.2019.08.005

计算机与现代化 ›› 2019, Vol. 0 ›› Issue (08): 23-.doi: 10.3969/j.issn.1006-2475.2019.08.005

强化深度特征融合的行人搜索系统

（1.福州大学平板显示技术国家地方联合工程实验室，福建福州350116;
2.福州大学物理与信息工程学院，福建福州350116）

收稿日期:2019-01-18 出版日期:2019-08-15 发布日期:2019-08-16
作者简介:梅文欣（1995-），女，福建宁德人，硕士研究生，研究方向：数字图像处理，机器学习，深度学习，E-mail: 17759371365@163.com；林志贤（1975-），男，教授，博士，研究方向：信息显示技术，平板显示器件驱动，图像处理技术，E-mail： lzx2005000@163.com。
基金资助:
国家重点研发计划项目(2016YFB0401503)；广东省科技重大专项(2016B090906001)；福建省科技重大专项(2014HZ0003-1)；福建省资助省属高校专项课题(JK2014002)

Person Search System by Enhanced Deep Feature Fusion

（1. National Joint Engineering Laboratory of Flat Panel Display Technology， Fuzhou University, Fuzhou 350116， China;
2. College of Physics and Information Engineering， Fuzhou University, Fuzhou 350116， China）

Received:2019-01-18 Online:2019-08-15 Published:2019-08-16

摘要/Abstract

摘要： 针对行人图像的深度特征缺乏对局部细节的描述，及不完全具备对尺度、旋转、平移及光照变化等各种因素的不变性而导致行人搜索准确率低的问题，本文提出一种具有强化深度特征融合的行人搜索系统。该系统将行人候选网络和行人识别网络两部分整合优化成统一框架。其中，行人候选网络实现行人框的获取及标定，而行人识别网络在获取深度学习特征的基础上融入具有几何不变性的传统特征，建立一个强化深度特征融合网络模型。实验结果表明，本文采用强化深度特征融合的网络模型，在SSM数据集上检测并框出图片中的行人，其Top-1识别正确率高达80.7%，比单纯采用深度特征模型更具优越性。

关键词: 深度特征, 行人搜索, 特征融合, 行人框, 几何不变性

Abstract: The deep feature of pedestrian image lacks the description of local details, and it does not have the invariance of scale, rotation, translation and illumination changes fully, which leads to the low accuracy of person search. A pedestrian search system with enhanced depth feature fusion is proposed. The system integrates the pedestrian candidate network and the pedestrian identification network into a unified framework. Among them, the pedestrian candidate network realizes the acquisition and calibration of the pedestrian boxes, while the pedestrian recognition network integrates the traditional features with geometric invariance on the basis of acquiring the deep learning characteristics, which establishes a network model with enhanced deep feature fusion. The experimental results show that the network model with enhanced depth feature fusion detects and frames pedestrians in images on SSM dataset, and has a top rate of 80.7%, which is superior to the deep feature model.

Key words: deep feature, pedestrian search, feature fusion, pedestrian boxes, geometric invariance

中图分类号:

TP111

梅文欣1,2,林志贤1,2,郭太良1,2. 强化深度特征融合的行人搜索系统[J]. 计算机与现代化, 2019, 0(08): 23-.

MEI Wen-xin1,2,LIN Zhi-xian1,2,GUO Tai-liang1,2 . Person Search System by Enhanced Deep Feature Fusion[J]. Computer and Modernization, 2019, 0(08): 23-.

参考文献

［1］ ZAJDEL W, ZIVKOVIC Z, KROSE B J A. Keeping track of humans: Have I seen this person before?［C］// Proceedings of 2005 IEEE International Conference on Robotics and Automation. 2005: 2081-2086.
［2］ OH S H, HAN S W, CHOI B S, et al. Deep feature learning for person re-identification in a large-scale crowdsourced environment［J］. The Journal of Supercomputing, 2018,74(12):6753-6765.
［3］ QIAN X, FU Y, JIANG Y G, et al. Multi-scale deep learning architectures for person re-identification［C］// Proceedings of IEEE International Conference on Computer Vision. 2017: 5399-5408.
［4］ YAO H, ZHANG S, ZHANG D, et al. Large-scale person re-identification as retrieval［C］// 2017 IEEE International Conference on Multimedia and Expo. 2017:1440-1445.
［5］ CHEN Y, ZHU X, GONG S. Person re-identification by deep learning multi-scale representations［C］// Proceedings of IEEE International Conference on Computer Vision. 2017:2590-2600.
［6］ XU Y, MA B, HUANG R, et al. Person search in a scene by jointly modeling people commonness and person uniqueness［C］// Proceedings of the 22nd ACM International Conference on Multimedia. 2014:937-940.
［7］ XIAO T, LI S, WANG B, et al. Joint detection and identification feature learning for person search［C］// Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition. 2017:3415-3424.
［8］张建虎. 面向目标识别的多特征融合研究与实现［D］. 北京：北京交通大学, 2018.
［9］ HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition［C］// Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition. 2016:770-778.
［10］REN S, HE K, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks［C］// Advances in Neural Information Processing Systems. 2015:91-99.
［11］朱奇光,王梓巍,陈颖. 基于全局特征与局部特征的图像分级匹配算法研究及应用［J］. 中国机械工程, 2016,27(16):2211-2217.
［12］于俊清,吴泽斌,吴飞,等. 多媒体工程:2016——图像检索研究进展与发展趋势［J］. 中国图象图形学报, 2017,22(11):1467-1485.
［13］AMATO G, BOLETTIERI P, FALCHI F, et al. Large scale image retrieval using vector of locally aggregated descriptors［C］// International Conference on Similarity Search and Applications. 2013:245-256.
［14］SU M C, CHOU C H. A modified version of the K-means algorithm with a distance based on cluster symmetry［J］. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2001,23(6):674-680.
［15］HSU C Y, LU C S, PEI S C. Image feature extraction in encrypted domain with privacy-preserving SIFT［J］. IEEE Transactions on Image Processing, 2012,21(11):4593-4607.
［16］CHOU C L, CHEN H T, CHEN Y C, et al. Near-duplicate video retrieval and localization using pattern set based dynamic programming［C］// 2013 IEEE International Conference on Multimedia and Expo. 2013:1-6.
［17］KE Y, SUKTHANKAR R. PCA-SIFT:A more distinctive representation for local image descriptors［C］// Proceedings of 2004 IEEE Computer Society Conference on Computer Vision and Pattem Recognition. 2004:506-513.
［18］KATZ G, BARRETT C, DILL D L, et al. Reluplex: An efficient SMT solver for verifying deep neural networks［C］// International Conference on Computer Aided Verification. 2017:97-117.
［19］YANG B, YAN J, LEI Z, et al. Convolutional channel features［C］// Proceedings of IEEE International Conference on Computer Vision. 2015:82-90.
［20］DOLLR P, APPEL R, BELONGIE S, et al. Fast feature pyramids for object detection［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014,36(8):1532-1545.
［21］ZHAO R, OUYANG W, WANG X. Unsupervised salience learning for person re-identification［C］// Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition. 2013:3586-3593.
［22］ZHENG L, SHEN L, TIAN L, et al. Scalable person re-identification: A benchmark［C］// Proceedings of IEEE International Conference on Computer Vision. 2015:1116-1124.
［23］LIAO S, HU Y, ZHU X, et al. Person re-identification by local maximal occurrence representation and metric learning［C］// Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2015:2197-2206.
［24］MA B, SU Y, JURIE F. Local descriptors encoded by fisher vectors for person re-identification［C］// European Conference on Computer Vision. 2012:413-422.

[1]	张思敏, 刘新妹, 殷俊龄, 李宝玲. 基于YOLOv7改进的PCB缺陷检测方法[J]. 计算机与现代化, 2024, 0(12): 45-52.
[2]	王海洋, 弓同鑫, 杨锦涛, 陈再龙. 多尺度时间编码的工业园区短期负荷预测[J]. 计算机与现代化, 2024, 0(12): 59-65.
[3]	马钰, 杨勇, 任鸽, 帕力旦·吐尔逊. 基于GCN和微调BERT的作文自动评分方法[J]. 计算机与现代化, 2024, 0(09): 33-37.
[4]	郑尚坡1, 陈德富1, 李坚利2, 林国贤2, 王星平3. 基于改进YOLOv5s和DeepSORT的行人跟踪算法[J]. 计算机与现代化, 2024, 0(08): 54-58.
[5]	庞梅, 汪珙, 詹泳, 黄哲法. 基于YOLOv5改进算法的海洋水下垃圾检测方法[J]. 计算机与现代化, 2024, 0(07): 120-126.
[6]	符灵利, 邱宇, 张新晨 . 基于改进U-Net多特征融合的血管分割#br#[J]. 计算机与现代化, 2024, 0(06): 76-82.
[7]	朱纷, 何立风, 孙爽, 张梦颖, 于佳佳. 基于形变残差和级联编码的胰腺分割模型[J]. 计算机与现代化, 2024, 0(06): 83-88.
[8]	武昭盟1, 张成刚2. 适用于网络新闻数据的未配对跨模态哈希方法[J]. 计算机与现代化, 2024, 0(03): 54-60.
[9]	宁娟, 周庆华, 曾小为. 改进YOLOv7算法在西林瓶轧盖缺陷检测中的应用[J]. 计算机与现代化, 2023, 0(12): 82-86.
[10]	谷明轩, 范冰冰. 基于多模态特征融合的抑郁症识别[J]. 计算机与现代化, 2023, 0(10): 17-22.
[11]	陈俊义. 基于图节点动静态特征的健康事件预测模型[J]. 计算机与现代化, 2023, 0(10): 39-44.
[12]	邢世帅, 刘丹凤, 王立国, 潘月涛, 孟灵鸿, 岳晓晗. 基于空间注意力残差网络的图像超分辨率重建模型[J]. 计算机与现代化, 2023, 0(10): 45-52.
[13]	陈嘉敏, 张伯泉, 麦海鹏. 基于特征融合的海马体分割[J]. 计算机与现代化, 2023, 0(08): 1-6.
[14]	王鸿, 葛红. 基于注意力机制和语义相似度的跨模态哈希检索[J]. 计算机与现代化, 2023, 0(08): 44-53.
[15]	王杰, 潘凤, 张艳莎, 谭棉, 严晓波, 王林, . 融合带权非局部模块的铝型材表面缺陷分类[J]. 计算机与现代化, 2023, 0(05): 86-92.

强化深度特征融合的行人搜索系统

Person Search System by Enhanced Deep Feature Fusion

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价