计算机与现代化 ›› 2023, Vol. 0 ›› Issue (12): 41-47.doi: 10.3969/j.issn.1006-2475.2013.12.008

• 算法设计与分析 • 上一篇    下一篇

基于双重特征注意力的多标签图像分类模型

  

  1. (广东工业大学计算机学院,广东 广州 510006)
  • 出版日期:2023-12-24 发布日期:2024-01-24
  • 作者简介::邱凯星(1998—),男,广东四会人,硕士研究生,研究方向:图像处理,深度学习,E-mail: 569172944@qq.com; 通信作者:冯广(1973—),男,广东云浮人,教授级高级实验师,博士,研究方向:网络控制,机器学习,大数据,E-mail: von@gdut.edu.cn。
  • 基金资助:
    国家自然科学基金资助项目(62237001); 中国高校产学研创新基金-新一代信息技术创新项目(2020ITA02013)

A Multi-label Image Classification Model Based on Dual Feature Attention

  1. (School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China)
  • Online:2023-12-24 Published:2024-01-24

摘要: 摘要:针对目前多标签图像分类任务中存在的图像多区域特征信息提取不足、图像特征与标签语义关系构建难等问题,提出一种基于双重特征注意力的多标签图像分类模型。首先,构建图像特征注意力模块对图像信息进行全局多区域特征的注意力关联,增强图像特征提取能力;其次,通过构建联合特征注意力模块对图像特征信息和标签嵌入进行相关性表示,从而使标签与图像区域之间进行跨模态融合得到更优的映射关系。实验结果表明,该模型在VOC2007和COCO2014多标签图像分类数据集中均取得了较好的分类效果,其性能指标相比于现有算法有较大的提升,验证了该模型的有效性。

关键词: 关键词:图像分类, 多标签, 注意力机制, 深度学习, 特征关联

Abstract: Abstract: A multi-label image classification model based on dual feature attention is proposed to address the current problems of insufficient extraction of feature information from multiple image regions and difficulty in constructing semantic relationships between image features and labels in multi-label image classification tasks. Firstly, the image feature attention module is constructed to correlate the attention of image information with global multi-region features to enhance image feature extraction. Secondly, a combined feature attention module is constructed to perform correlation representation of image feature information and label embedding, thus enabling cross-modal fusion between labels and image regions to obtain a better mapping relationship. The experimental results show that the model achieves better classification results in both the VOC2007 and COCO2014 multi-label image classification datasets, and its performance metrics have improved significantly compared with existing algorithms, verifying the effectiveness of the model.

Key words: Key words: image classification, multi-label, attention mechanisms, deep learning, feature association

中图分类号: