[1] 罗永莲,赵昌垣,贾玉芳,等. 基于朴素贝叶斯Web新闻内容的抽取方法[J]. 计算机与现代化, 2016(1):59-63.
[2] 张剑,吴青,羊昕旖,等. 基于条件随机场的农业命名实体识别[J]. 计算机与现代化, 2018(1):123-126.
[3] Kalchbrenner N, Grefenstette E, Blunsom P. A convolutional neural network for modelling sentences[C]// Proceedings of the 52nd Annual Meeting of the Association for Computational Linguisties. 2014:655-665.〖HJ1.4mm〗
[4] Yin Wenpeng, Schütze H. Convolutional neural network for paraphrase identification[C]// Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2015:901-911.
[5] Zhang Ye, Wallace B C. A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification[DB/OL].https://arxiv.org/pdf/1510.03820v4.pdf, 2016-04-06.
[6] Sun Shichang, Liu Hongbo, Lin Hongfei, et al. Twitter part-of-speech tagging using pre-classification hidden Markov model[C]// Proceedings of the 2012 IEEE International Conference on Systems, Man, and Cybernetics. 2012:1118-1123.
[7] Mikolov T, Sutskever I, Chen Kai, et al. Distributed representations of words and phrases and their compositionality[C]// Proceedings of the 2013 International Conference on Neural Information Processing Systems. 2013:3111-3119.
[8] Zhiyuli A, Liang Xun, Xu Zhiming. Learning distributed representations for large-scale dynamic social networks[C]// Proceedings of the 2017 IEEE Conference on Computer Communications. 2017, doi: 10.1109/INFOCOM. 2017.8057104.
[9] Mikolov T, Chen Kai, Corrado G, et al. Efficient Estimation of Word Representations in Vector Space[DB/OL].https://arxiv.org/pdf/1301.3781v3.pdf, 2013-09-07.
[10]Rong Xin. Word2vec Parameter Learning Explained[DB/OL]. https://arxiv.org/pdf/1411.2738v4.pdf, 2016-06-05.
[11]Goldberg Y, Levy O. Word2vec Explained: Deriving Mikolov et al.’s Negative-sampling Word-embedding method[DB/OL]. https://arxiv.org/pdf/1402.3722v1.pdf, 2014-02-15.
[12]Kim Y. Convolutional neural networks for sentence classification[C]// Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 2014:1746-1751.
[13]Oyama Y, Nomura A, Sato I, et al. Predicting statistics of asynchronous SGD parameters for a large-scale distributed deep learning system on GPU supercomputers[C]// Proceedings of the 2016 IEEE International Conference on Big Data. 2016:66-75.
[14]Zeiler M D, Taylor G W, Fergus R. Adaptive deconvolutional networks for mid and high level feature learning[C]// Proceedings of the 2011 International Conference on Computer Vision. 2011:2018-2025.
[15]Hinton G E, Srivastava N, Krizhevsky A, et al. Improving Neural Networks by Preventing Co-adaptation of Feature Detectors[DB/OL]. https://arxiv.org/pdf/1207.0580v1.pdf, 2012-07-03.
[16]Kone〖XCC2.TIF〗ny J, Liu Jie, Richtárik P, et al. Mini-batch semi-stochastic gradient descent in the proximal setting[J]. IEEE Journal of Selected Topics in Signal Processing, 2016,10(2):242-255.
[17]黄卫春,范少帅,熊李艳,等. 基于特征选择的人物关系抽取方法[J]. 科学技术与工程, 2015,15(3):254-259. |