[1] Joachims T. Text categorization with support vector machines: Learning with many relevant features[C]// Proceedings of the 10th European Conference on Machine Learning. 1998:137-142.
[2] Aikawa N, Sakai T, Yamana H. Community QA question classification: Is the asker looking for subjective answers or not?[J]. IPSJ Online Transactions, 2011,4:160-168.
[3] Li Xin, Roth D. Learning question classifiers[C]// Proceedings of the 19th International Conference on Computational Linguistics. 2002,1:556-562.
[4] Zhang D, Lee W S. Question classification using support vector machines[C]// Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2003:26-32.
[5] Kim Y. Convolutional neural networks for sentence classification[C]// Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 2014:1746-1751.
[6] Lecun Y, Bottou L, Bengio Y, et al. Gradient-based learning applied to document recognition[J]. Proceedings of the IEEE, 1998,86(11):2278-2324.
[7] Ma Mingbo, Huang Liang, Xiang Bing, et al. Group sparse CNNs for question classification with answer sets[C]// Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. 2017,2:335-340.
[8] Gers F A, Schmidhuber J, Cummins F. Learning to forget: Continual prediction with LSTM[J]. Neural Computation, 2000,12(10):2451-2471.
[9] Bahdanau D, Cho K, Bengio Y. Neural Machine Translation by Jointly Learning to Align and Translate[DB/OL]. https://arxiv.org/pdf/1409.0473v7.pdf, 2016-05-19.
[10]Ding Zixiang, Xia Rui, Yu Jianfei, et al. Densely Connected Bidirectional LSTM with Applications to Sentence Classification[DB/OL]. https://arxiv.org/pdf/1802.00889v1.pdf, 2018-02-03.
[11]Graves A, Schmidhuber J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures[J]. Neural Networks, 2005,18(5-6):602-610.
[12]Yang Zichao, Yang Diyi, Dyer C, et al. Hierarchical attention networks for document classification[C]// Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2016:1480-1489.
[13]Jain L C, Medsker L R. Recurrent Neural Networks: Design and Applications[M]. CRC Press, 1999.
[14]Hochreiter S, Schmidhuber J. Long short-term memory[J]. Neural Computation, 1997,9(8):1735-1780.
[15]Graves A. Generating Sequences with Recurrent Neural Networks[DB/OL]. https://arxiv.org/pdf/1308.0850v5.pdf, 2014-06-05.
[16]张栋,李寿山,王晶晶. 基于问题与答案联合表示学习的半监督问题分类方法[J]. 中文信息学报, 2017,31(1):1-7.
[17]Mikolov T, Sutskever I, Chen Kai, et al. Distributed representations of words and phrases and their compositionality[C]// Proceedings of the 2013 Advances in Neural Information Processing Systems. 2013:3111-3119.
[18]Prechelt L. Early stopping-but when?[M]// Neural Networks: Tricks of the Trade. Springer, 1998:55-69.
[19]Srivastava N, Hinton G, Krizhevsky A, et al. Dropout: A simple way to prevent neural networks from overfitting[J]. Journal of Machine Learning Research, 2014,15(1):1929-1958. |