基于混合神经网络的问题分类方法

doi:10.3969/j.issn.1006-2475.2018.09.001

计算机与现代化 ›› 2018, Vol. 0 ›› Issue (09): 1-.doi: 10.3969/j.issn.1006-2475.2018.09.001

• 人工智能 • 下一篇

基于混合神经网络的问题分类方法

(1.中国科学院大学电子电气与通信工程学院,北京100049; 2.中国科学院电子学研究所,北京100190; 3.中国科学院空间信息处理与应用系统技术重点实验室，北京100190)

收稿日期:2018-03-09 出版日期:2018-09-29 发布日期:2018-09-30
作者简介:陈柯锦(1993-)，男，重庆人，中国科学院大学电子电气与通信工程学院、中国科学院电子学研究所硕士研究生，研究方向：问答系统，知识图谱; 许光銮(1978-)，男，研究员，博士，研究方向：地理空间信息挖掘与应用; 郭智(1975-)，男，研究员，博士，研究方向：数据挖掘，知识工程; 梁霄(1981-)，男，助理研究员，博士，研究方向：复杂网络，知识工程，问答系统。
基金资助:
国家自然科学基金资助项目(61725105, 61331017)

Question Classification Based on Hybrid Neural Network Model

(1. School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 100049, China;
2. Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China;
3. Key Laboratory of Technology in Geo-Spatial Information Processing and Application System,
Chinese Academy of Sciences, Beijing 100190, China)

Received:2018-03-09 Online:2018-09-29 Published:2018-09-30

摘要/Abstract

摘要： 自动问答系统对用户自然语言方式提出的问题，给出快速准确的答案，引起了学术界与工业界的广泛关注。问题分类任务通过自动判断问题类型，对提高问答系统回答问题的准确率具有重要意义。本文利用问题和答案的上下文信息，结合卷积神经网络和循环神经网络各自的优势，提出一种混合深度学习模型。除此之外，为了增强问题特征的表达能力，该模型引入注意力机制，提升模型的泛化能力。在360问答数据集进行对比实验验证，实验表明，本文模型相比于传统方法提升了1.6%~5.6%。

关键词: 问题分类, 联合表示, 深度学习, 注意力机制

Abstract: The automatic question answering system gives fast and accurate answers to the questions proposed by the users in natural language, arousing widespread concern in academia and industry. By automatically determining the type of question, question classification task is of great significance to improve the accuracy of the question answering system. Based on the contextual information of the question and answer, combined with the respective advantages of convolutional neural networks and recurrent neural networks, this paper proposes a hybrid deep learning model. In addition, in order to strengthen the representation capacity of the question, this model adopts attention mechanism and enhances the generalization ability of the model. In this paper, we conduct a comparative experiment on 360 QA datasets, results show that this model has improved 1.6%~5.6% compared with the traditional method.

Key words: question classification, joint representation, deep learning, attention mechanism

中图分类号:

TP391

陈柯锦1,2,3，许光銮2,3，郭智2,3，梁霄2,3. 基于混合神经网络的问题分类方法[J]. 计算机与现代化, 2018, 0(09): 1-.

CHEN Ke-jin1,2,3, XU Guang-luan2,3, GUO Zhi2,3, LIANG Xiao2,3. Question Classification Based on Hybrid Neural Network Model[J]. Computer and Modernization, 2018, 0(09): 1-.

参考文献

［1］ Joachims T. Text categorization with support vector machines: Learning with many relevant features［C］// Proceedings of the 10th European Conference on Machine Learning. 1998:137-142.
［2］ Aikawa N, Sakai T, Yamana H. Community QA question classification: Is the asker looking for subjective answers or not?［J］. IPSJ Online Transactions, 2011,4:160-168.
［3］ Li Xin, Roth D. Learning question classifiers［C］// Proceedings of the 19th International Conference on Computational Linguistics. 2002,1:556-562.
［4］ Zhang D, Lee W S. Question classification using support vector machines［C］// Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2003:26-32.
［5］ Kim Y. Convolutional neural networks for sentence classification［C］// Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 2014:1746-1751.
［6］ Lecun Y, Bottou L, Bengio Y, et al. Gradient-based learning applied to document recognition［J］. Proceedings of the IEEE, 1998,86(11):2278-2324.
［7］ Ma Mingbo, Huang Liang, Xiang Bing, et al. Group sparse CNNs for question classification with answer sets［C］// Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. 2017,2:335-340.
［8］ Gers F A, Schmidhuber J, Cummins F. Learning to forget: Continual prediction with LSTM［J］. Neural Computation, 2000,12(10):2451-2471.
［9］ Bahdanau D, Cho K, Bengio Y. Neural Machine Translation by Jointly Learning to Align and Translate［DB/OL］. https://arxiv.org/pdf/1409.0473v7.pdf, 2016-05-19.
［10］Ding Zixiang, Xia Rui, Yu Jianfei, et al. Densely Connected Bidirectional LSTM with Applications to Sentence Classification［DB/OL］. https://arxiv.org/pdf/1802.00889v1.pdf, 2018-02-03.
［11］Graves A, Schmidhuber J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures［J］. Neural Networks, 2005,18(5-6):602-610.
［12］Yang Zichao, Yang Diyi, Dyer C, et al. Hierarchical attention networks for document classification［C］// Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2016:1480-1489.
［13］Jain L C, Medsker L R. Recurrent Neural Networks: Design and Applications［M］. CRC Press, 1999.
［14］Hochreiter S, Schmidhuber J. Long short-term memory［J］. Neural Computation, 1997,9(8):1735-1780.
［15］Graves A. Generating Sequences with Recurrent Neural Networks［DB/OL］. https://arxiv.org/pdf/1308.0850v5.pdf, 2014-06-05.
［16］张栋,李寿山,王晶晶. 基于问题与答案联合表示学习的半监督问题分类方法［J］. 中文信息学报, 2017,31(1):1-7.
［17］Mikolov T, Sutskever I, Chen Kai, et al. Distributed representations of words and phrases and their compositionality［C］// Proceedings of the 2013 Advances in Neural Information Processing Systems. 2013:3111-3119.
［18］Prechelt L. Early stopping-but when?［M］// Neural Networks: Tricks of the Trade. Springer, 1998:55-69.
［19］Srivastava N, Hinton G, Krizhevsky A, et al. Dropout: A simple way to prevent neural networks from overfitting［J］. Journal of Machine Learning Research, 2014,15(1):1929-1958.

[1]	何思达, 陈平华. 基于意图的轻量级自注意力序列推荐模型[J]. 计算机与现代化, 2024, 0(12): 1-9.
[2]	赵晨阳, 薛涛, 刘俊华. 基于改进Stable Diffusion的时尚服饰图案生成[J]. 计算机与现代化, 2024, 0(12): 15-23.
[3]	黄庭培1, 马禄彪1, 李世宝2, 刘建航1. 基于WiFi和原型网络的手势识别方法[J]. 计算机与现代化, 2024, 0(12): 34-39.
[4]	张晓东1, 白广芝1, 李敏1, 李昊洋2. 基于经验小波变换的油气井产量预测模型 [J]. 计算机与现代化, 2024, 0(12): 53-58.
[5]	刘云海1, 冯广1, 吴晓婷2, 杨群2. 复杂施工场景下的安全帽佩戴检测算法[J]. 计算机与现代化, 2024, 0(12): 66-71.
[6]	谷岳, 邓松峰, 沈霁, 穆文涛, 赵恩棋. 基于改进YOLOv8的SAR舰船目标检测算法[J]. 计算机与现代化, 2024, 0(12): 78-83.
[7]	王艳媛, 茅正冲. 中英文场景文本图像的检测和识别算法[J]. 计算机与现代化, 2024, 0(12): 84-90.
[8]	李钧超1, 尤菲1, 张超2, 苏乐乐2, 龚龑2. 基于新型多目标浣熊优化算法的BiLSTM-Attention#br# 预测模型及误差分析[J]. 计算机与现代化, 2024, 0(11): 70-76.
[9]	张宇1, 2, 黎靖1, 2, 马铭1, 2, 王众祥1, 2, 孙妍1, 2. YOLOLW:一个新的轻量级目标检测模型[J]. 计算机与现代化, 2024, 0(11): 91-98.
[10]	祁贤, 刘大铭, 常佳鑫. 基于改进自注意力机制的多视图三维重建[J]. 计算机与现代化, 2024, 0(11): 106-112.
[11]	陈凯1, 李宜汀1, 2, 全华凤1 . 基于改进YOLOv8的河道废弃瓶检测方法[J]. 计算机与现代化, 2024, 0(11): 113-120.
[12]	杨骏1, 胡为1, 朱文福2. 基于改进MobileNetV3的视觉SLAM回环检测算法[J]. 计算机与现代化, 2024, 0(10): 21-26.
[13]	魏学诚1, 江凌云1, 李研2, 何非2. 改进YOLOv5的路侧单目视角小目标检测算法[J]. 计算机与现代化, 2024, 0(10): 27-34.
[14]	杜猛俊1, 李昂1, 童俊1, 钱锦1, 康恺1, 王若丁1, 靳文星2. 基于改进极限学习算法的电力信息数据融合模型[J]. 计算机与现代化, 2024, 0(10): 61-64.
[15]	王莹莹, 郝潇. 基于Res2Net和递归门控卷积的细粒度图像分类[J]. 计算机与现代化, 2024, 0(10): 74-79.

基于混合神经网络的问题分类方法

Question Classification Based on Hybrid Neural Network Model

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价