融合FGM和指针标注的实体关系联合抽取方法

doi:10.3969/j.issn.1006-2475.2023.11.001

摘要/Abstract

摘要： 摘要：实体关系联合抽取是信息抽取的一项重要任务。由于传统的实体关系联合抽取方法把实体之间的关系建模为离散类型，因此不能很好地解决重叠三元组的问题。为了解决难以抽取重叠三元组的问题，本文提出一种融合FGM和指针标注的实体关系联合抽取BERT-FGM模型。该模型将实体之间的关系建模为函数，通过在BERT训练词向量的过程中融入FGM提高模型的鲁棒性。模型首先通过指针标注策略抽取头实体，然后将头实体与句子向量进行融合作为一个新向量，最终将其在预定义的关系条件下抽取头实体对应的尾实体。实验使用的是公开数据集WebNLG，实验结果表明该模型F1值达到90.7%，有效地解决了三元组重叠问题。

关键词: 关键词：实体关系联合抽取, 重叠三元组, BERT, FGM, 指针标注

Abstract: Abstract: Joint extraction of entities and relations is an important task of information extraction. The traditional entity relationship joint extraction method cannot solve the problem of overlapping triples well， because it models the relationship between entities as discrete types. In order to solve the problem that it is difficult to extract overlapping triples， this paper proposes a BERT-FGM model for entity relationship joint extraction， which combines FGM and pointer annotation. In this model， the relationship between entities is modeled as a function， and the robustness of the model is improved by incorporating FGM into the process of BERT training word vector. The model firstly extracts the subjects through the pointer annotation strategy， then fuses the subjects into a sentence vector as a new vector， and finally uses it to extract objects under a predefined relationship condition. Experiments are carried out on public dataset WebNLG， the experimental result shows that the F1 value of the model is 90.7%， it can effectively solve the problem of relationship triples overlapping.

Key words: Key words: joint extraction of entities and relations, overlapping triples, BERT, FGM, pointer annotation

中图分类号:

TP391

刘玉鹏, 葛艳, 杜军威, 陈卓. 融合FGM和指针标注的实体关系联合抽取方法[J]. 计算机与现代化, 2023, 0(11): 1-5.

LIU Yu-peng, GE Yan, DU Jun-wei, CHEN Zhuo. Joint Extraction Method of Entities and Relations Based on FGM and Pointer Annotation[J]. Computer and Modernization, 2023, 0(11): 1-5.

参考文献

［1］ ZHANG L， ZHAO H. Named entity recognition for Chinese microblog with convolutional neural network［C］// 2017 13th International Conference on Natural Computation， Fuzzy Systems and Knowledge Discovery （ICNC-FSKD）. IEEE， 2017:87-92.
［2］陈宇，郑德权，赵铁军. 基于Deep Belief Nets的中文名实体关系抽取［J］. 软件学报， 2012，23（10）:2572-2585.
［3］ CUCERZAN S， YAROWSKY D. Language independent named entity recognition combining morphological and contextual evidence［C］// 1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora. 1999:90-99.
［4］ ZHOU G D， SU J. Named entity recognition using an HMM-based chunk tagger［C］// Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. 2002:473-480.
［5］ LIU X H， ZHANG S D， WEI F R， et al. Recognizing named entities in tweets［C］// Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. 2011:359-367.
［6］ LAMPLE G， BALLESTEROS M， SUBRAMANIAN S， et al. Neural architectures for named entity recognition ［C］// Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2016:260-270.
［7］柏兵，侯霞，石松. 基于CRF和BI-LSTM的命名实体识别方法［J］. 北京信息科技大学学报（自然科学版）， 2018，33（6）:27-33.
［8］ MA X Z， HOVY E. End-to-end sequence labeling via bi-directional LSTM-CNNS-CRF［C］// Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. 2016:1064-1074.
［9］李明扬，孔芳. 融入自注意力机制的社交媒体命名实体识别［J］. 清华大学学报（自然科学版）， 2019，59（6）:461-467.
［10］ VASWANI A， SHAZEER N， PARMAR N， et al. Attention is all you need［C］// Proceedings of the Advances in Neural Information Processing Systems. 2017:5998-6008.
［11］秦娅，申国伟，赵文波，等. 基于深度神经网络的网络安全实体识别方法［J］. 南京大学学报（自然科学版）， 2019，55（1）:29-40.
［12］ SOCHER R， HUVAL B， MANNING C D， et al. Semantic compositionality through recursive matrix-vector spaces［C］// Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. 2012:1201-1211.
［13］ ZENG D J， LIU K， CHEN Y B， et al. Distant supervision for relation extraction via piecewise convolutional neural networks［C］// Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 2015:1753-1762.
［14］ ZHOU P， SHI W， TIAN J， et al. Attention-based bidirectional long short-term memory networks for relation classification［C］// Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. 2016:207-212.
［15］鄂海红，张文静，肖思琪，等. 深度学习实体关系抽取研究综述［J］. 软件学报， 2019，30（6）:1793-1818.
［16］ MIWA M， BANSAL M. End-to-end relation extraction using LSTMs on sequences and tree structures［C］// Proceedings of the Meeting of the Association for Computational Linguistics. 2016:1105-1116.
［17］ LI F， ZHANG M S， FU G H， et al. A neural joint model for extracting bacteria and their locations［C］// Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer， 2017:15-26.
［18］ KATIYAR A， CARDIE C. Going out on a limb: Joint extraction of entity mentions and relations without dependency trees［C］// Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. 2017:917-928.
［19］ ZHENG S C， WANG F， BAO H Y， et al. Joint extraction of entities and relations based on a novel tagging scheme［C］// Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics（ACL 2017）. 2017:1227-1236.
［20］ ZENG X R， ZENG D J， HE S Z， et al. Extracting relational facts by an end-to-end neural model with copy mechanism［C］// Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. 2018:506-514.
［21］ FU T J， LI P H， MA W Y. Graphrel: Modeling text as relational graphs for joint entity and relation extraction［C］// Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 2019:1409-1418.
［22］ ZENG X R， HE S Z， ZENG D J， et al. Learning the extraction order of multiple relational facts in a sentence with reinforcement learning［C］// Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing（EMNLP-IJCNLP）. 2019:367-377.
［23］ HANG T T， FENG J， WU Y R， et al. Joint extraction of entities and overlapping relations using source-target entity labeling［J］. Expert Systems with Applications， 2021， 177: 114853.1-114853.15.
［24］ YE H B， ZHANG N Y， DENG S M， et al. Contrastive triple extraction with generative transformer［C］// Proceedings of the AAAI Conference on Artificial Intelligence. 2021:14257-14265.
［25］ WEI Z P， SU J L， WANG Y， et al. A novel cascade binary tagging framework for relational triple extraction［C］// Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020:1476-1488.
［26］ DEVLIN J， CHANG M W， LEE K， et al. BERT: Pretraining of deep bidirectional transformers for language understanding［C］// Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2019:4171-4186.
［27］ GOODFELLOW I J，SHLENS J，SZEGEDY C. Explaining and harnessing adversarial examples［J］. arXiv preprint arXiv:1412.6572， 2014.
［28］ ZENG X R， ZENG D J， HE S E， et al. Extracting relational facts by an end-to-end neural model with copy mechanism［C］// Proceedings of the 56th Annual Meeting of the ACL. 2018:506-514.

[1]	郑久超, 赵新元. 基于主题与描述信息的实体链接方法[J]. 计算机与现代化, 2024, 0(12): 10-14.
[2]	马钰, 杨勇, 任鸽, 帕力旦·吐尔逊. 基于GCN和微调BERT的作文自动评分方法[J]. 计算机与现代化, 2024, 0(09): 33-37.
[3]	赵盾1, 佘学兵2, 邬昌兴3. 基于BERT-BiLSTM-CRF党建领域命名实体识别[J]. 计算机与现代化, 2024, 0(09): 91-94.
[4]	王谭, 陈金广, 马丽丽. 融合词典信息和句子语义的中文命名实体识别[J]. 计算机与现代化, 2024, 0(03): 24-28.
[5]	郑立瑞, 肖晓霞, 邹北骥, 刘彬, 周展. 基于BERT的电子病历命名实体识别[J]. 计算机与现代化, 2024, 0(01): 87-91.
[6]	唐诗琪, 周瑞平, 谢仕斌, 刘梦赤, 肖文, . 基于栈式降噪编码器的跨语言多标签情感分类[J]. 计算机与现代化, 2023, 0(11): 6-12.
[7]	李诗月, 孟佳娜, 于玉海, 李雪莹, 许英傲. 基于知识增强的方面级情感分析方法[J]. 计算机与现代化, 2023, 0(10): 1-8.
[8]	谢世超, 黄蔚, 任祥辉. 一种基于BERT的文本实体链接方法[J]. 计算机与现代化, 2023, 0(02): 58-61.
[9]	于清, 马志龙, 徐春. 基于BERT和非自回归的医疗知识抽取[J]. 计算机与现代化, 2023, 0(01): 120-126.
[10]	朱亚军, 拥措, 尼玛扎西, . 基于藏文BERT的藏医药医学实体识别[J]. 计算机与现代化, 2023, 0(01): 43-48.
[11]	黄忠祥, 李明. ALBERT结合双向网络的文本分类[J]. 计算机与现代化, 2022, 0(10): 8-12.
[12]	陈钢. 融合RoBERTa和特征提取的政务热线工单分类[J]. 计算机与现代化, 2022, 0(06): 21-26.
[13]	张军, 邱龙龙. 一种基于BERT和池化操作的文本分类模型[J]. 计算机与现代化, 2022, 0(06): 1-7.
[14]	樊海玮, 秦佳杰, 孙欢, 张丽苗, 鲁芯丝雨. 基于BERT与BiGRU-CRF的交通事故文本信息提取模型[J]. 计算机与现代化, 2022, 0(05): 10-15.
[15]	刘梦颖, 王勇. 基于文本双表示模型的微博热点话题发现[J]. 计算机与现代化, 2021, 0(12): 110-115.