基于Attention-based C-GRU神经网络的文本分类

doi:10.3969/j.issn.1006-2475.2018.02.020

计算机与现代化 ›› 2018, Vol. 0 ›› Issue (02): 96-.doi: 10.3969/j.issn.1006-2475.2018.02.020

基于Attention-based C-GRU神经网络的文本分类

(北京交通大学计算机与信息技术学院，北京100044)

收稿日期:2017-05-22 出版日期:2018-03-08 发布日期:2018-03-09
作者简介:杨东(1991-),男,河北张家口人,北京交通大学计算机与信息技术学院硕士研究生,研究方向：移动与互联网； 王移芝(1953-),女,教授,研究方向：计算机网络与数据库技术
基金资助:
国家自然科学基金“面上”项目(K13A300050)

An Attention-based C-GRU Neural Network for Text Classification

(School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China)

Received:2017-05-22 Online:2018-03-08 Published:2018-03-09

摘要/Abstract

摘要： 文本分类是自然语言处理中一个经典的研究方向，在信息处理中扮演着重要的角色。目前深度学习已经在图像识别、机器翻译等领域取得了突破性的进展，而且它也被证明在自然语言处理任务中拥有着提取句子或文本更高层次表示的能力。本文提出一种新颖的深度学习混合模型Attention-based C-GRU用于文本分类，该模型结合CNN中的卷积层和GRU，通过引入Attention机制，突出关键词和优化特征提取过程。利用该模型去学习文本语义并且在主题分类、问题分类及情感分类等任务上对其做出评估。通过与对比模型和表现最优方法做比较，表明本文模型的有效性。

关键词: 文本分类, 深度学习, Attention机制

Abstract: Text classification is the classical research direction in NLP and plays an important role in information processing. At present, deep learning network has achieved the remarkable performance in image recognition, machine translation and other fields and it also has been proved to be capable of learning higher-level sentences and document representation in NLP tasks. In this paper, based on GRU model and the convolutional layer in CNN, we propose a novel hybrid text classification model called Attention-based C-GRU. Moreover, we introduce Attention model in our model, which effectively highlights the role of key words and optimizing the extraction of features. We leverage the model to learn the meaning of text and evaluate it on topic classification, question classification and sentiment classification tasks. The experiment demonstrates the effectiveness of our approach in comparison with baseline models and state-of-art methods.

Key words: text classification, deep learning, Attention model

中图分类号:

TP391

杨东，王移芝. 基于Attention-based C-GRU神经网络的文本分类[J]. 计算机与现代化, 2018, 0(02): 96-.

YANG Dong, WANG Yi-zhi. An Attention-based C-GRU Neural Network for Text Classification[J]. Computer and Modernization, 2018, 0(02): 96-.

参考文献

［1］ Bengio Y, Dvcharme R, Vincent P, et al. A neural probabilistic language model［J］. Journal of Machine Learning Research, 2003,3(6):1137-1155.〖HJ1.09mm〗
［2］ Graves A, Mohamed A R, Hinton G. Speech recognition with deep recurrent neural networks［C］//2013 IEEE International Conference on Acoustics, Speech and Signal Processing. 2013:6645-6649.
［3］ Sutskever I, Vinyals O, Le Q V. Sequence to sequence learning with neural networks［C］// Proceedings of the 27th International Conference on Neural Information Processing Systems. 2014:3104-3112.
［4］〖JP2〗Lecun Y, Bottou L, Bengio Y, et al. Gradient-based learning applied to document recognition［J］. Proceedings of the IEEE, 1998,86(11):2278-2324.
［5］ Mikolov T, Sutskever I, Chen Kai, et al. Distributed representations of words and phrases and compositionality［C］// Proceedings of the 26th International Conference on Neural Information Processing Systems. 2013:3111-3119.
［6］ Lai Siwei, Xu Liheng, Liu Kang, et al. Recurrent Convolutional neural network for text classification［C］// The 29th AAAI Conference on Artifical Intelligence. 2015,333:2267-2273.
［7］ Kim Y. Convolutional neural networks for sentence classification［J］. Computer Science, 2014: arXiv:1408.5882.
［8］ Mou Lili, Peng Hao, Li Ge, et al. Discriminative neural sentence modeling by tree-based convolution［J］. Computer Science, 2015: arXiv:1504.01106.
［9］ Koutnik J, Greff K, Gomez F, et al. A clockwork RNN［L］. Computer Science, 2014: arXiv:1402.3511.
［10］Cho K, Merrienboer B V, Gulcehre C, et al. Learning Phrase representations using RNN encoder-decoder for statistical machine translation［J］. Computer Science, 2014: arXiv:1406.1078.
［11］Luong M T, Pham H, Manning C D. Effective approaches to attention-based neural machine translation［J］. Computer Science, 2015: arXiv:1508.04025.
［12］Bahdanau D, Cho K, Bengio Y. Neural machine translation by jointly learning to align and translate［J］. Computer Science, 2014: arXiv:1409.0473.
［13］Mendes A C, Wichert A. From symbolic to sub-symbolic information in question classification［J］. Artificial Intelligence Review, 2011,35(2):137-154.
［14］Socher R, Perelygin A, Wu J, et al. Recursive deep models for semantic compositionality over a sentiment treebank［C］// Conference on Empirical Methods in Natural Language Processing (EMNLP 2013). 2013:1631-1642.
［15］Collobert R, Weston J, Bottou L, et al. Natural language processing(almost)from scratch［J］. Journal of Machine Learning Research, 2011,12(1):2493-2537.
［16］张冲. 基于Attention-Based LSTM模型的文本分类技术的研究［D］. 南京:南京大学, 2016.
［17］Yan Yan, Yin Xu-Cheng, Li Sujian, et al. Hybrid deep belief network［J］. Computational Intelligence and Neuroscience, 2015(5):650527:1-650527:9.
［18］Zhu Xiaodan, Sobhani P, Guo Hongyu. Learning document Semantic representation with long short-term memory over recursive structures［C］// Proceedings of the 32nd International Conference on International Conference on Machine Learning. 2015:1604-1612.

[1]	祁贤, 刘大铭, 常佳鑫. 基于改进自注意力机制的多视图三维重建[J]. 计算机与现代化, 2024, 0(11): 106-112.
[2]	陈凯1, 李宜汀1, 2, 全华凤1 . 基于改进YOLOv8的河道废弃瓶检测方法[J]. 计算机与现代化, 2024, 0(11): 113-120.
[3]	杨骏1, 胡为1, 朱文福2. 基于改进MobileNetV3的视觉SLAM回环检测算法[J]. 计算机与现代化, 2024, 0(10): 21-26.
[4]	王莹莹, 郝潇. 基于Res2Net和递归门控卷积的细粒度图像分类[J]. 计算机与现代化, 2024, 0(10): 74-79.
[5]	史星宇1, 李强2, 庄莉3, 梁懿3, 王秋琳3, 陈锴3, 伍臣周3, 常胜1. 一种面向工业部署的目标检测模型蒸馏技术[J]. 计算机与现代化, 2024, 0(10): 93-99.
[6]	张泽1, 张建权2, 3, 周国鹏2, 3. 基于改进YOLOv8s的摄像头模组缺陷检测[J]. 计算机与现代化, 2024, 0(09): 107-113.
[7]	程亚子1, 雷亮1, 2, 陈瀚1, 赵毅然1. 基于转置注意力的多尺度深度融合单目深度估计[J]. 计算机与现代化, 2024, 0(09): 121-126.
[8]	程萌, 李浩. 改进YOLOv5s的落叶树鸟巢检测方法[J]. 计算机与现代化, 2024, 0(08): 24-29.
[9]	王梦溪, 李峻. 老年人跌倒检测技术研究综述[J]. 计算机与现代化, 2024, 0(08): 30-36.
[10]	时现伟1, 范鑫2. 基于轻量化的视频帧场景语义分割方法[J]. 计算机与现代化, 2024, 0(08): 49-53.
[11]	徐新爱, 李钢. 基于DCGAN的课堂表情图像生成方法[J]. 计算机与现代化, 2024, 0(08): 88-91.
[12]	高帅鹏, 王怡凡. 基于图像的群体情绪识别综述[J]. 计算机与现代化, 2024, 0(08): 98-107.
[13]	周宪溪, 牟莉. 基于改进TF-IDF和AGLCNN的新闻长文本分类模型[J]. 计算机与现代化, 2024, 0(08): 120-126.
[14]	黄文栋, 王怡凡. 基于模态类别的多模态信息处理与融合综述[J]. 计算机与现代化, 2024, 0(07): 47-62.
[15]	武丽1, 张征浩2, 葛彩成2, 俞俊2. 基于改进SCNN网络的车道线检测算法[J]. 计算机与现代化, 2024, 0(07): 87-92.

基于Attention-based C-GRU神经网络的文本分类

An Attention-based C-GRU Neural Network for Text Classification

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价