基于DCN-SERes-YOLOv3的人脸佩戴口罩检测算法

摘要/Abstract

摘要： 2020年新冠疫情爆发，佩戴口罩是有效抑制疫情反弹的重要措施之一，研究利用机器视觉技术检测人脸是否佩戴口罩有重要的现实意义。本文针对视频图像中人脸佩戴口罩时存在遮挡、检测目标较小、特征信息不明显、目标靠近群体不易识别等问题，提出一种基于DCN-SERes-YOLOv3的人脸佩戴口罩检测算法。首先，采用ResNet50与YOLOv3相结合的方式，将主干网络替换为ResNet50残差网络，为了平衡模型的精度与速度，对残差块中的卷积层改进并加入平均池化层，降低模型的损失与复杂度，提高检测速度；其次，将ResNet50残差网络中第4个残差块的常规卷积替换为DCN可变形卷积，提高模型适应人脸佩戴口罩时发生几何形变的能力；最后，引入SENet通道注意力机制，增强特征信息的表达能力。实验结果表明，本文算法的平均精度值高达95.36%，比传统YOLOv3算法提高了约4.1个百分点，且检测速度提高了11.7 fps，本文算法提高了检测人脸佩戴口罩任务的精度与速度，有较好的应用前景。

关键词: 口罩佩戴, YOLOv3算法, ResNet50残差网络, 通道注意力机制, 可变形卷积, 疫情防控

Abstract: With the outbreak of the COVID-19 epidemic in 2020, wearing mask is one of the important measures to effectively suppress the rebound of the epidemic. It is of great practical significance to study the use of machine vision technology to detect whether face masks are worn or not. This paper proposes a face mask detection algorithm based on DCN-SERes-YOLOv3 to solve the problems of occlusion, small detection targets, unobvious feature information, and difficult identification of the target group when wearing masks in video image. Firstly, the algorithm uses the combination of ResNet50 and YOLOv3 to replace the backbone network with the ResNet50 residual network. In order to balance the accuracy and speed of the model, the convolutional layer in the residual block is improved and the average pooling layer is added to reduce the model’s loss and complexity, improve the detection speed. Secondly, the conventional convolution of the fourth residual block in the ResNet50 residual network is replaced with DCN deformable convolution to improve the model’s ability to adapt to geometric deformation when wearing masks. Finally, the SENet channel attention mechanism is introduced to enhance the ability to express characteristic information. The experimental results show that the average accuracy of the algorithm proposed in this paper is as high as 95.36%, which is about 4.1 percent point higher than the traditional YOLOv3 algorithm, and the detection speed is increased by 11.7 fps. The proposed algorithm improves the precision and the speed of the task of detecting faces wearing masks and has high application prospect.

Key words: mask wearing, YOLOv3 algorithm, ResNet50 residual network, channel attention mechanism, deformable convolution, epidemic prevention and control

李国进, 荣誉. 基于DCN-SERes-YOLOv3的人脸佩戴口罩检测算法[J]. 计算机与现代化, 2021, 0(09): 12-20.

LI Guo-jin, RONG Yu. Face Mask Detection Algorithm Based on DCN-SERes-YOLOv3[J]. Computer and Modernization, 2021, 0(09): 12-20.

参考文献

［1］彭麟,王卫红. 新冠肺炎(COVID-19)的致死率分析与治疗对策［J］. 基因组学与应用生物学, 2020,39(9):4405-4408.
［2］徐丹. 新冠肺炎疫情对经济社会发展可能产生影响的预判［J］. 今日科苑, 2020(2):4-6.
［3］张吉明. 我国抗击新冠疫情彰显出的优势［J］. 中华魂, 2020(6):13-19.
［4］谢晶仁. 国外传染病疫情防控对中国应对突发疫情的启示［J］. 中国公共安全(学术版), 2020(1):1-4.
［5］多米尼克·吉尔伯特. 新冠病毒传染性有多强［J］. 养猪, 2020(2):7.
［6］致真. 飞沫传播与口罩［J］. 家教世界, 2020(10):44-45.
［7］仁民. 科学戴口罩让疫情防控更精准［N］. 四平日报, 2020-03-20(006).
［8］牛作东,覃涛,李捍东等. 改进RetinaFace的自然场景口罩佩戴检测算法［J］. 计算机工程与应用, 2020,56(12):1-7.
［9］邓黄潇. 基于迁移学习与RetinaNet的口罩佩戴检测的方法［J］. 电子技术与软件工程, 2020(5):209-211.
［10］曹城硕,袁杰. 基于YOLO-Mask算法的口罩佩戴检测方法［J/OL］. 激光与光电子学进展:1-13(2020-12-04)［2020-12-10］. http://kns.cnki.net/kcms/detail/31.1690.TN.20201009.1330.006.html.
［11］王艺皓,丁洪伟,李波等. 复杂场景下基于改进YOLOv3的口罩佩戴检测算法［J］. 计算机工程, 2020,46(11):12-22.

［12］管军霖,智鑫. 基于YOLOv4卷积神经网络的口罩佩戴检测方法［J］. 现代信息科技, 2020,4(11):9-12.

［13］王沣. 改进YOLOv5的口罩和安全帽佩戴人工智能检测识别算法［J］. 建筑与预算, 2020(11):67-69.
［14］GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation［C］// Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. 2014:580-587.
［15］GIRSHICK R. Fast R-CNN［C］// 2015 IEEE International Conference on Computer Vision (ICCV). 2015:1440-1448.
［16］REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017,39(6):1137-1149.
［17］LIU W, ANGUELOV D, ERHAN D, et al. SSD: Single shot multi-box detector［C］// Proceedings of the 2016 European Conference on Computer Vision. 2016:21-37.
［18］REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: Unified, real-time object detection［C］// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016:779-788.
［19］REDMON J, FARHADI A. YOLO9000: Better, faster, stronger［C］// 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2017:6517-6525.
［20］REDMON J, FARHADI A. YOLOv3: An incremental improvement［C］// 2018 IEEE Conference on Computer Vision and Pattern Recogniton (CVPR). 2018:1-6.
［21］HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition ［C］// 2016 IEEE Conference on Computer Vision and Pattern Recognition. 2016:770-778.
［22］DAI J F, QI H Z, XIONG Y W. Deformable convolutional networks［C］// Proceedings of the IEEE International Conference on Computer Vision. 2017:764-773.
［23］HU J, LI S, SUN G. Squeeze and excitation networks［C］// IEEE Conference on Computer Vision and Pattern Recognition. 2018:7132-7141.

[1]	马永, 王俊, 张子健, 赵煜阳, 张靖, 周明. 面向智慧运维系统的改进YOLOv8行为检测算法[J]. 计算机与现代化, 2024, 0(08): 43-48.
[2]	张浩洋, 尹梓名, 乐珺怡, 沈达聪, 束翌俊, 杨自逸, 孔祥勇, 龚伟. 3D-SPRNet: 一种基于并行解码器和双注意力机制的胆囊癌分割模型[J]. 计算机与现代化, 2023, 0(12): 59-66.
[3]	李燕, 卢峥松, 李青云, 杨世海, 张小龙. 基于轻量级结构重参数化网络的口罩检测算法[J]. 计算机与现代化, 2022, 0(07): 40-46.
[4]	梁正友, 耿经邦, 孙宇. 基于改进残差网络的交通标志识别算法[J]. 计算机与现代化, 2022, 0(04): 52-57.
[5]	金鑫, 曾思轲, 刘阳, 武楚涵. 基于改进YOLOv4的口罩佩戴检测算法[J]. 计算机与现代化, 2022, 0(01): 85-90.
[6]	李传栋, 邱磊, 于雁. 基于改进残差密集网络的心律失常自动分类[J]. 计算机与现代化, 2021, 0(11): 106-111.
[7]	吴水明, 朱燕, 王芳, 景栋盛. 一种基于Cascade R-CNN的电子器件容器质检方法[J]. 计算机与现代化, 2020, 0(11): 33-38.
[8]	苏军雄，见雪婷，刘玮，华俊达，张胜祥. 基于可变形卷积神经网络的手势识别方法[J]. 计算机与现代化, 2018, 0(04): 62-.