计算机与现代化 ›› 2023, Vol. 0 ›› Issue (07): 105-111.doi: 10.3969/j.issn.1006-2475.2023.07.018

• 图像处理 • 上一篇    下一篇

基于多尺度ResNet融合注意力机制的麦冬细粒度识别

  

  1. (1.北京中医药大学管理学院,北京 102488; 2.北京中医药大学中药学院,北京102488)
  • 出版日期:2023-07-26 发布日期:2023-07-27
  • 作者简介:秦竹媛(2002—),女,重庆江北人,本科生,研究方向:医学图像处理,深度学习,E-mail: qzy18723491285@163.com; 吴浩忠(1984—),男,北京人,实验师,本科,研究方向:中药鉴定,E-mail: wuhaozhong@126.com; 通信作者:唐燕(1977—),女,副教授,硕士,研究方向:医学图像处理,深度学习,E-mail: tangyan97_1017@sina.com。
  • 基金资助:
    022教育部产学合作协同育人项目(220500643305240)

Fine-grained Identification of Maidong Based on Multi-scale ResNet Combining Attention Mechanism

  1. (1.School of Management, Beijing University of Chinese Medicine, Beijing 102488, China;
    2. School of Chinese Materia Medica, Beijing University of Chinese Medicine, Beijing 102488, China)
  • Online:2023-07-26 Published:2023-07-27

摘要: 中药材鉴别依赖于中药师的经验,效率低且没有统一的量化标准。针对川麦冬、山麦冬和浙麦冬3类易混淆中药饮片图像细粒度分类问题,本文提出一种基于ResNet-152残差神经网络的改进模型MARNet-152(Multiscale-Attention Residual Network-152),辅助人工自动辨识3种易混淆的麦冬饮片。基于ResNet-152残差神经网络构建改进的模型MARNet-152,对ResNet-152网络结构中Bottleneck的3×3卷积核进行分组卷积以提取和表示多尺度特征;引入结合空间和通道的卷积注意力机制模块(Convolutional Block Attention Module, CBAM),使模型更关注识别目标物体细节并具有更好的解释性。改进后的网络模型在麦冬图像细粒度识别时达到91.42%的分类精度,相较于基础模型提高了6.62个百分点,可为麦冬识别提供参考。MARNet-152模型具有更高的泛化能力,识别效果较原始ResNet-152模型提升非常明显。

关键词: 中药饮片辨识, 图像分类, 深度学习, 残差网络, 注意力机制

Abstract: The identification of traditional Chinese medicinal materials depends on the experience of Chinese pharmacists, with low efficiency and no unified quantitative criteria. Aiming at the fine granularity classification problem of Sichuan Ophiopogon japonicus, Liriope spicata and Zhejiang Ophiopogon japonicus, an improved MARNet-152(Multiscale-Attention Residual Network-152) model based on ResNet-152 neural network is proposed, which assists artificial identification of three easily-confused maidong decoction pieces automatically. An improved model, MARNet-152 is constructed based on ResNet-152 residual neural network, with group convolution of 3×3 convolutional kernels in the Bottleneck of the ResNet-152 network structure to extract and represent multi-scale features. The convolution attention mechanism module(CBAM) combining space and channel is introduced to make the model pay more attention to the recognition of target object details and have better interpretation. The classification accuracy of the improved network model reached 91.42% in the fine grained recognition of maidong image, which is 6.62 percentage points higher than that of the basic model, and could provide reference for the recognition of maidong image. The improved MARNet-152 model has higher generalization ability, and the recognition effect is significantly improved compared with the original ResNet-152 model.

Key words: Chinese medicine tablets identification, image classification, deep learning, residual networks, attention mechanism

中图分类号: