计算机与现代化 ›› 2022, Vol. 0 ›› Issue (05): 10-15.

• 算法设计与分析 • 上一篇    下一篇

基于BERT与BiGRU-CRF的交通事故文本信息提取模型

  

  1. (长安大学信息工程学院,陕西西安710064)
  • 出版日期:2022-06-08 发布日期:2022-06-08
  • 作者简介:樊海玮(1974—),男,陕西西安人,副教授,硕士生导师,研究方向:软件系统设计,机器学习,E-mail: fanhaiwei@chd.edu.com; 通信作者:秦佳杰(1995—),男,江苏南通人,硕士研究生,研究方向:深度学习,自然语言处理,E-mail: 1920909528@qq.com; 孙欢(1995—),男,硕士研究生,研究方向:深度学习,E-mail: 346371539@qq.com; 张丽苗(1996—),女,硕士研究生,研究方向:深度学习,E-mail: 346371539@qq.com; 鲁芯丝雨(1997—),女,硕士研究生,研究方向:自然语言处理,E-mail: 1065472693@qq.com。
  • 基金资助:
    国家自然科学基金资助项目(52172325); 国家自然科学基金青年科学基金资助项目(61702050)

 Traffic Accident Text Information Extraction Model Based on BERT and BiGRU-CRF Fusion

  1. (Institute of Information Engineering, Chang’an University, Xi’an710064, China)

  • Online:2022-06-08 Published:2022-06-08

摘要: 针对现存交通事故文本信息中存在的大量时间、地点、伤亡损失等关键异构数据难以有效提取,以及用静态词向量深度学习模型提取交通事故文本信息精确度较低的问题,本文利用BERT(Bidirectional Encoder Representations from Transformers)对文本字符进行动态向量映射,从数据表达源头解决一词多义、上下文依赖不充分等问题;利用BiGRU(Bi-Gate Recurrent Unit)提取文本向量化后的特征,输出高特征的文本序列;利用CRF(Conditional Random Fields)计算全局最优输出节点的概率优势,优化文本序列特征结果,提出一种基于动态字向量的BERT-BiGRU-CRF融合模型,用于交通事故文本关键信息提取。通过对比实验表明,该模型在交通事故文本信息提取中平均准确率为0.952,F1为0.925,比基于静态词向量Word2Vec模型的精确率与F1值分别提高了6.3个百分点和7.9个百分点。

关键词: 深度学习, 文本信息提取, 异构数据, BERT, BiGRU, CRF

Abstract: Aiming at existing traffic accident text data has difficulties in effectively extracting a large number of key heterogeneous data such as time, place and casualty loss, and the accuracy of traffic accident text information extraction methods based on static word vector deep learning model is low. The BERT (Bidirectional Encoder Representations from Transformers) is used for a dynamic vector mapping of the text characters in order to resolve the problem of ambiguity and context dependence insufficient from the source of data representation. The vectored features of text are extracted by using BiGRU(Bi-Gate Recurrent Unit) and text sequences with high features are output. Based on CRF (Conditional Random Fields), the probabilistic advantage of the global optimal output node is calculated to optimize the feature results of text sequence, and a BERT-BiGRU-CRF fusion model based on dynamic word vector is proposed forextracting the key information of traffic accident text. The comparison experiment shows that the average accuracy of the model in traffic accident text information extraction is 0.952 and F1 is 0.925, and 6.3 percentage points and 7.9 percentage points higher respectively than those of the model based on static word vector Word2Vec.

Key words: deep learning, text information extraction, heterogeneous information, BERT, BiGRU, CRF