计算机与现代化 ›› 2023, Vol. 0 ›› Issue (05): 26-31.

• 人工智能 • 上一篇    下一篇

融合注意力机制的非对称深度监督哈希

  

  1. (中北大学软件学院,山西 太原 030051)
  • 出版日期:2023-06-06 发布日期:2023-06-06
  • 作者简介:王欣怡(1997—),女,山西运城人,硕士研究生,研究方向:深度学习,图像检索,E-mail: m18435221863@qq.com; 尹四清(1964—),男,副教授,硕士生导师,E-mail: yinsq@nuc.edu.cn; 洪军(1979—),男,副教授,博士,E-mail: hongjun@nuc.ed.cn。
  • 基金资助:
    山西省自然科学基金资助项目(201901D111149)

Asymmetric Deep Supervised Hashing with Attention Mechanism

  1. (School of Software, North University of China, Taiyuan 030051, China)
  • Online:2023-06-06 Published:2023-06-06

摘要: 随着大数据时代的到来,互联网上的信息数据呈指数级增长。在这些数据中,图像资源占比巨大,因此如何在海量图像中进行准确而高效的图像检索成为当今的重要研究课题之一。目前大多数方法提取到的特征信息含有大量冗余信息,使得在图像检索中不能有效关注到图像的重点区域而导致检索性能差、准确度低等问题。基于以上不足,本文提出一种融合注意力机制的非对称深度哈希算法。以卷积神经网络为基础,对现有的由语义特征引导的混合注意力机制进行改进,将其嵌入进网络中,使得哈希网络将全局语义信息和局部语义信息共同分析。同时设计新的量化函数来减少量化误差,从而增强哈希编码的特征表达能力。并采用mAP作为评价指标,在数据集CIFAR-10和NUS-WIDE数据集上将本文方法与其他哈希方法进行比较,结果表明本文设计的网络模型能很好地结合全局和局部的特征信息,提高图像检索性能。

关键词: 图像检索, 注意力机制, 深度哈希, 卷积神经网络, 特征提取

Abstract: With the advent of the era of big data, the information data on the Internet is growing exponentially. Among these data, image resource accounts for a very large proportion, so how to carry out accurate and efficient image retrieval from massive images has become one of the important research topics today. At present, there are some problems in large-scale image retrieval, such as poor retrieval performance and low accuracy due to the inability to effectively focus on the key areas of the image. Based on the above shortcomings, an asymmetric deep hash algorithm that integrates the attention mechanism is proposed, which is modified based on convolutional neural network. The existing mixed attention mechanism guided by semantic features is improved and embedded into the network, so that the hash network can analyze the global semantic information and local semantic information together. At the same time, a new quantization function is designed to reduce quantization error, so as to enhance the feature expression ability of hash coding. This method is compared with other hashing methods on the CIFAR-10 and NUS-WIDE datasets with evaluation standard mAP. The results show that the proposed network model can combine global and local spatial features well, and improve the image retrieval performance.

Key words: image retrieval, attention mechanism, deep hashing, convolutional neural network, feature extraction