计算机与现代化

• 人工智能 • 上一篇    下一篇

基于Attention-based C-GRU神经网络的文本分类

  

  1. (北京交通大学计算机与信息技术学院,北京100044)
  • 收稿日期:2017-05-22 出版日期:2018-03-08 发布日期:2018-03-09
  • 作者简介:杨东(1991-),男,河北张家口人,北京交通大学计算机与信息技术学院硕士研究生,研究方向:移动与互联网; 王移芝(1953-),女,教授,研究方向:计算机网络与数据库技术
  • 基金资助:
    国家自然科学基金“面上”项目(K13A300050)

An Attention-based C-GRU Neural Network for Text Classification

  1. (School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China)
  • Received:2017-05-22 Online:2018-03-08 Published:2018-03-09

摘要: 文本分类是自然语言处理中一个经典的研究方向,在信息处理中扮演着重要的角色。目前深度学习已经在图像识别、机器翻译等领域取得了突破性的进展,而且它也被证明在自然语言处理任务中拥有着提取句子或文本更高层次表示的能力。本文提出一种新颖的深度学习混合模型Attention-based C-GRU用于文本分类,该模型结合CNN中的卷积层和GRU,通过引入Attention机制,突出关键词和优化特征提取过程。利用该模型去学习文本语义并且在主题分类、问题分类及情感分类等任务上对其做出评估。通过与对比模型和表现最优方法做比较,表明本文模型的有效性。

关键词: 文本分类, 深度学习, Attention机制

Abstract: Text classification is the classical research direction in NLP and plays an important role in information processing. At present, deep learning network has achieved the remarkable performance in image recognition, machine translation and other fields and it also has been proved to be capable of learning higher-level sentences and document representation in NLP tasks. In this paper, based on GRU model and the convolutional layer in CNN, we propose a novel hybrid text classification model called Attention-based C-GRU. Moreover, we introduce Attention model in our model, which effectively highlights the role of key words and optimizing the extraction of features. We leverage the model to learn the meaning of text and evaluate it on topic classification, question classification and sentiment classification tasks. The experiment demonstrates the effectiveness of our approach in comparison with baseline models and state-of-art methods.

Key words: text classification, deep learning, Attention model

中图分类号: