基于多尺度ResNet融合注意力机制的麦冬细粒度识别

doi:10.3969/j.issn.1006-2475.2023.07.018

计算机与现代化 ›› 2023, Vol. 0 ›› Issue (07): 105-111.doi: 10.3969/j.issn.1006-2475.2023.07.018

基于多尺度ResNet融合注意力机制的麦冬细粒度识别

（1.北京中医药大学管理学院，北京 102488； 2.北京中医药大学中药学院，北京102488）

出版日期:2023-07-26 发布日期:2023-07-27
作者简介:秦竹媛（2002—），女，重庆江北人，本科生，研究方向:医学图像处理，深度学习，E-mail： qzy18723491285@163.com；吴浩忠（1984—），男，北京人，实验师，本科，研究方向:中药鉴定，E-mail： wuhaozhong@126.com；通信作者:唐燕（1977—），女，副教授，硕士，研究方向:医学图像处理，深度学习，E-mail： tangyan97_1017@sina.com。
基金资助:
022教育部产学合作协同育人项目（220500643305240）

Fine-grained Identification of Maidong Based on Multi-scale ResNet Combining Attention Mechanism

（1.School of Management， Beijing University of Chinese Medicine， Beijing 102488， China；
2. School of Chinese Materia Medica， Beijing University of Chinese Medicine， Beijing 102488， China）

Online:2023-07-26 Published:2023-07-27

摘要/Abstract

摘要： 中药材鉴别依赖于中药师的经验，效率低且没有统一的量化标准。针对川麦冬、山麦冬和浙麦冬3类易混淆中药饮片图像细粒度分类问题，本文提出一种基于ResNet-152残差神经网络的改进模型MARNet-152（Multiscale-Attention Residual Network-152），辅助人工自动辨识3种易混淆的麦冬饮片。基于ResNet-152残差神经网络构建改进的模型MARNet-152，对ResNet-152网络结构中Bottleneck的3×3卷积核进行分组卷积以提取和表示多尺度特征；引入结合空间和通道的卷积注意力机制模块（Convolutional Block Attention Module， CBAM），使模型更关注识别目标物体细节并具有更好的解释性。改进后的网络模型在麦冬图像细粒度识别时达到91.42%的分类精度，相较于基础模型提高了6.62个百分点，可为麦冬识别提供参考。MARNet-152模型具有更高的泛化能力，识别效果较原始ResNet-152模型提升非常明显。

关键词: 中药饮片辨识, 图像分类, 深度学习, 残差网络, 注意力机制

Abstract: The identification of traditional Chinese medicinal materials depends on the experience of Chinese pharmacists， with low efficiency and no unified quantitative criteria. Aiming at the fine granularity classification problem of Sichuan Ophiopogon japonicus， Liriope spicata and Zhejiang Ophiopogon japonicus， an improved MARNet-152（Multiscale-Attention Residual Network-152） model based on ResNet-152 neural network is proposed， which assists artificial identification of three easily-confused maidong decoction pieces automatically. An improved model， MARNet-152 is constructed based on ResNet-152 residual neural network， with group convolution of 3×3 convolutional kernels in the Bottleneck of the ResNet-152 network structure to extract and represent multi-scale features. The convolution attention mechanism module（CBAM） combining space and channel is introduced to make the model pay more attention to the recognition of target object details and have better interpretation. The classification accuracy of the improved network model reached 91.42% in the fine grained recognition of maidong image， which is 6.62 percentage points higher than that of the basic model， and could provide reference for the recognition of maidong image. The improved MARNet-152 model has higher generalization ability， and the recognition effect is significantly improved compared with the original ResNet-152 model.

Key words: Chinese medicine tablets identification, image classification, deep learning, residual networks, attention mechanism

中图分类号:

TP183
R2

秦竹媛, 吴浩忠, 谭代庆, 韩爱庆, 臧昊, 王选, 唐燕. 基于多尺度ResNet融合注意力机制的麦冬细粒度识别[J]. 计算机与现代化, 2023, 0(07): 105-111.

QIN Zhu-yuan, WU Hao-zhong, TAN Dai-qing, HAN Ai-qing, ZANG Hao, WANG Xuan, TANG Yan. Fine-grained Identification of Maidong Based on Multi-scale ResNet Combining Attention Mechanism[J]. Computer and Modernization, 2023, 0(07): 105-111.

[1]	何思达, 陈平华. 基于意图的轻量级自注意力序列推荐模型[J]. 计算机与现代化, 2024, 0(12): 1-9.
[2]	赵晨阳, 薛涛, 刘俊华. 基于改进Stable Diffusion的时尚服饰图案生成[J]. 计算机与现代化, 2024, 0(12): 15-23.
[3]	黄庭培1, 马禄彪1, 李世宝2, 刘建航1. 基于WiFi和原型网络的手势识别方法[J]. 计算机与现代化, 2024, 0(12): 34-39.
[4]	张晓东1, 白广芝1, 李敏1, 李昊洋2. 基于经验小波变换的油气井产量预测模型 [J]. 计算机与现代化, 2024, 0(12): 53-58.
[5]	刘云海1, 冯广1, 吴晓婷2, 杨群2. 复杂施工场景下的安全帽佩戴检测算法[J]. 计算机与现代化, 2024, 0(12): 66-71.
[6]	谷岳, 邓松峰, 沈霁, 穆文涛, 赵恩棋. 基于改进YOLOv8的SAR舰船目标检测算法[J]. 计算机与现代化, 2024, 0(12): 78-83.
[7]	王艳媛, 茅正冲. 中英文场景文本图像的检测和识别算法[J]. 计算机与现代化, 2024, 0(12): 84-90.
[8]	李钧超1, 尤菲1, 张超2, 苏乐乐2, 龚龑2. 基于新型多目标浣熊优化算法的BiLSTM-Attention#br# 预测模型及误差分析[J]. 计算机与现代化, 2024, 0(11): 70-76.
[9]	张宇1, 2, 黎靖1, 2, 马铭1, 2, 王众祥1, 2, 孙妍1, 2. YOLOLW:一个新的轻量级目标检测模型[J]. 计算机与现代化, 2024, 0(11): 91-98.
[10]	祁贤, 刘大铭, 常佳鑫. 基于改进自注意力机制的多视图三维重建[J]. 计算机与现代化, 2024, 0(11): 106-112.
[11]	陈凯1, 李宜汀1, 2, 全华凤1 . 基于改进YOLOv8的河道废弃瓶检测方法[J]. 计算机与现代化, 2024, 0(11): 113-120.
[12]	杨骏1, 胡为1, 朱文福2. 基于改进MobileNetV3的视觉SLAM回环检测算法[J]. 计算机与现代化, 2024, 0(10): 21-26.
[13]	魏学诚1, 江凌云1, 李研2, 何非2. 改进YOLOv5的路侧单目视角小目标检测算法[J]. 计算机与现代化, 2024, 0(10): 27-34.
[14]	杜猛俊1, 李昂1, 童俊1, 钱锦1, 康恺1, 王若丁1, 靳文星2. 基于改进极限学习算法的电力信息数据融合模型[J]. 计算机与现代化, 2024, 0(10): 61-64.
[15]	王莹莹, 郝潇. 基于Res2Net和递归门控卷积的细粒度图像分类[J]. 计算机与现代化, 2024, 0(10): 74-79.

基于多尺度ResNet融合注意力机制的麦冬细粒度识别

Fine-grained Identification of Maidong Based on Multi-scale ResNet Combining Attention Mechanism

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价