基于BERT与BiGRU-CRF的交通事故文本信息提取模型

计算机与现代化 ›› 2022, Vol. 0 ›› Issue (05): 10-15.

基于BERT与BiGRU-CRF的交通事故文本信息提取模型

(长安大学信息工程学院,陕西西安710064)

出版日期:2022-06-08 发布日期:2022-06-08
作者简介:樊海玮（1974—），男，陕西西安人，副教授，硕士生导师，研究方向：软件系统设计，机器学习，E-mail: fanhaiwei@chd.edu.com；通信作者：秦佳杰（1995—），男，江苏南通人，硕士研究生，研究方向：深度学习，自然语言处理，E-mail: 1920909528@qq.com；孙欢（1995—），男，硕士研究生，研究方向：深度学习，E-mail: 346371539@qq.com；张丽苗（1996—），女，硕士研究生，研究方向：深度学习，E-mail: 346371539@qq.com；鲁芯丝雨（1997—），女，硕士研究生,研究方向：自然语言处理，E-mail: 1065472693@qq.com。
基金资助:
国家自然科学基金资助项目（52172325）；国家自然科学基金青年科学基金资助项目(61702050)

Traffic Accident Text Information Extraction Model Based on BERT and BiGRU-CRF Fusion

(Institute of Information Engineering， Chang’an University， Xi’an710064, China)

Online:2022-06-08 Published:2022-06-08

摘要/Abstract

摘要： 针对现存交通事故文本信息中存在的大量时间、地点、伤亡损失等关键异构数据难以有效提取，以及用静态词向量深度学习模型提取交通事故文本信息精确度较低的问题，本文利用BERT(Bidirectional Encoder Representations from Transformers)对文本字符进行动态向量映射，从数据表达源头解决一词多义、上下文依赖不充分等问题；利用BiGRU(Bi-Gate Recurrent Unit)提取文本向量化后的特征，输出高特征的文本序列；利用CRF(Conditional Random Fields)计算全局最优输出节点的概率优势，优化文本序列特征结果，提出一种基于动态字向量的BERT-BiGRU-CRF融合模型，用于交通事故文本关键信息提取。通过对比实验表明，该模型在交通事故文本信息提取中平均准确率为0.952，F1为0.925，比基于静态词向量Word2Vec模型的精确率与F1值分别提高了6.3个百分点和7.9个百分点。

关键词: 深度学习, 文本信息提取, 异构数据, BERT, BiGRU, CRF

Abstract: Aiming at existing traffic accident text data has difficulties in effectively extracting a large number of key heterogeneous data such as time, place and casualty loss, and the accuracy of traffic accident text information extraction methods based on static word vector deep learning model is low. The BERT (Bidirectional Encoder Representations from Transformers) is used for a dynamic vector mapping of the text characters in order to resolve the problem of ambiguity and context dependence insufficient from the source of data representation. The vectored features of text are extracted by using BiGRU(Bi-Gate Recurrent Unit) and text sequences with high features are output. Based on CRF (Conditional Random Fields), the probabilistic advantage of the global optimal output node is calculated to optimize the feature results of text sequence, and a BERT-BiGRU-CRF fusion model based on dynamic word vector is proposed forextracting the key information of traffic accident text. The comparison experiment shows that the average accuracy of the model in traffic accident text information extraction is 0.952 and F1 is 0.925, and 6.3 percentage points and 7.9 percentage points higher respectively than those of the model based on static word vector Word2Vec.

Key words: deep learning, text information extraction, heterogeneous information, BERT, BiGRU, CRF

樊海玮, 秦佳杰, 孙欢, 张丽苗, 鲁芯丝雨. 基于BERT与BiGRU-CRF的交通事故文本信息提取模型[J]. 计算机与现代化, 2022, 0(05): 10-15.

FAN Hai-wei, QIN Jia-jie, SUN Huan, ZHANG Li-miao, LU Xin-siyu. Traffic Accident Text Information Extraction Model Based on BERT and BiGRU-CRF Fusion[J]. Computer and Modernization, 2022, 0(05): 10-15.

参考文献

［1］张亚丽. 世界卫生组织发布《2018年全球道路安全现状报告》［J］. 中华灾害救援医学, 2019,7(2):48-49.
［2］曾祥坤,张俊辉,石拓. 基于主题提取模型的交通违法行为文本数据的挖掘［J］. 电子技术应用, 2019,45(6):41-45.
［3］FELDMAN R, DAGAN I. Knowledage discovery in textual databases(KDT)［C］// Proceedings of the 1st International Conference on Knowledge Discovery and Data Mining (KDD-95). 1995,95:112-117.
［4］FRANKS B. Taming the Big Data Tidal Wave: Finding Opportunities in Huge Data Streams with Advanced Analytics［M］.Wiley, 2012.
［5］NAYAK R, PIYATRAPOOMI N, WELIGAMAGE J. Application of text mining in analysing road crashes for road asset management［M］// Engineering Asset Lifecycle Management. Springer, 2010:49-58.
［6］GAO L, WU H. Verb-based text mining of road crash report［C］// Transportation Research Board the 92nd Annual Meeting. 2013:174-181.
［7］YOU J R, SANG H B. Analysis of the unstructured traffic report from traffic broadcasting network by adapting the text mining methodology［J］. The Journal of the Korea Institute of Intelligent Transport Systems, 2018,17(3):87-97.
［8］GOPALAKRISHNAN K, KHAITAN S K. Text mining transportation research grant big data: Knowledge extraction and predictive modeling using fast neural nets［J］. International Journal for Traffic and Transport Engineering (IJTTE), 2017,7(3):354-367.
［9］GASMI H, LAVAL J, BOURAS A. Information extraction of cybersecurity concepts: An LSTM approach［J］. Applied Sciences, 2019,9(19):39-45.
［10］GRAVES A, JAITLY N, MOHAMEDA R. Hybrid speech recognition with deep bidirectional LSTM［C］// 2013 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). IEEE, 2013. DOI:10.1109/ASRU.2013.6707742.
［11］赵瑞晨. 基于深度学习的铁路设备事故数据挖掘与分析［D］. 北京:北京交通大学, 2020.
［12］NGUYEN N, GUO Y. Comparisons of sequence labeling algorithms and extensions［C］// International Conference on Machine Learning. ACM, 2007:681-688.
［13］余同瑞,金冉,韩晓臻,等. 自然语言处理预训练模型的研究综述［J］. 计算机工程用, 2020,56(23):12-18.
［14］PETERS M, NEUMANN M, IYYER M, et al. Deep contextualized word representations［C］// Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2018:253-261.
［15］VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need［C］// Proceedings of the 31st International Conference on NeuralInformation Processing Systems. 2017:6000-6010.
［16］北京创想安科科技有限公司. 安全管理网［EB/OL］. ［2021-09-21］. http://www.safehoo.com/NewsSpecial/Traffic/.
［17］贾熹滨,叶颖婕,陈军成. 基于关联规则的交通事故影响因素的挖掘［J］. 计算机科学, 2018,45(S1):447-452.
［18］张振宇. 基于自然语言理解的安全事故信息处理系统的设计与实现［D］. 北京:华北电力大学(北京), 2017.
［19］PENNINGTON J, SOCHER R, MANNING C D. Glove: Global vectors for word representation［C］// Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing(EMNLP). 2014:1532-1543.
［20］ABEGAZ T, BERHANE Y, WORKUA, et al. Effects of excessive speeding and falling asleep while driving on crash injury severity in Ethiopia: A generalized ordered logit model analysis［J］. Accident Analysis & Prevention, 2014,71:15-21.
［21］WU Q, CHEN F, ZHANG G H, et al. Mixed logit model-based driver injury severity investigations in single- and multi-vehicle crashes on rural two-lane highways［J］. Accident Analysis & Prevention, 2014,72:105-115.
［22］PENG Y Y, BOYLE L N. Commercial driver factors in run-off-road crashes［J］. Transportation Research Record Journal of the Transportation Research Board, 2012,2281:128-132.
［23］DELEN D, SHARDA R, BESSONOV M. Identifying significant predictors of injury severity in traffic accidents using a series of artificial neural networks［J］. Ergonomics, 2001,44(1):107-117.
［24］蔡娜. 女性驾驶员道路交通事故影响因素分析［D］. 北京:北京工业大学, 2010.
［25］韦瑜佳,陈梦甜. 基于网络节点重要性排序的较大交通事故影响因素分析［J］. 科技创新与应用, 2019(21):44-47.
［26］黄合来,周汉楚,潘震宇,等. 一种文本挖掘应用于道路交通事故数据处理的方法［P］. 湖南省：CN110134963A, 2019-08-16.
［27］王莉. 基于知识图谱的城市轨道交通建设安全管理智能知识支持研究［D］. 徐州:中国矿业大学, 2019.

[1]	郑久超, 赵新元. 基于主题与描述信息的实体链接方法[J]. 计算机与现代化, 2024, 0(12): 10-14.
[2]	祁贤, 刘大铭, 常佳鑫. 基于改进自注意力机制的多视图三维重建[J]. 计算机与现代化, 2024, 0(11): 106-112.
[3]	陈凯1, 李宜汀1, 2, 全华凤1 . 基于改进YOLOv8的河道废弃瓶检测方法[J]. 计算机与现代化, 2024, 0(11): 113-120.
[4]	杨骏1, 胡为1, 朱文福2. 基于改进MobileNetV3的视觉SLAM回环检测算法[J]. 计算机与现代化, 2024, 0(10): 21-26.
[5]	杜猛俊1, 李昂1, 童俊1, 钱锦1, 康恺1, 王若丁1, 靳文星2. 基于改进极限学习算法的电力信息数据融合模型[J]. 计算机与现代化, 2024, 0(10): 61-64.
[6]	王莹莹, 郝潇. 基于Res2Net和递归门控卷积的细粒度图像分类[J]. 计算机与现代化, 2024, 0(10): 74-79.
[7]	史星宇1, 李强2, 庄莉3, 梁懿3, 王秋琳3, 陈锴3, 伍臣周3, 常胜1. 一种面向工业部署的目标检测模型蒸馏技术[J]. 计算机与现代化, 2024, 0(10): 93-99.
[8]	马钰, 杨勇, 任鸽, 帕力旦·吐尔逊. 基于GCN和微调BERT的作文自动评分方法[J]. 计算机与现代化, 2024, 0(09): 33-37.
[9]	赵盾1, 佘学兵2, 邬昌兴3. 基于BERT-BiLSTM-CRF党建领域命名实体识别[J]. 计算机与现代化, 2024, 0(09): 91-94.
[10]	张泽1, 张建权2, 3, 周国鹏2, 3. 基于改进YOLOv8s的摄像头模组缺陷检测[J]. 计算机与现代化, 2024, 0(09): 107-113.
[11]	程亚子1, 雷亮1, 2, 陈瀚1, 赵毅然1. 基于转置注意力的多尺度深度融合单目深度估计[J]. 计算机与现代化, 2024, 0(09): 121-126.
[12]	程萌, 李浩. 改进YOLOv5s的落叶树鸟巢检测方法[J]. 计算机与现代化, 2024, 0(08): 24-29.
[13]	王梦溪, 李峻. 老年人跌倒检测技术研究综述[J]. 计算机与现代化, 2024, 0(08): 30-36.
[14]	时现伟1, 范鑫2. 基于轻量化的视频帧场景语义分割方法[J]. 计算机与现代化, 2024, 0(08): 49-53.
[15]	徐新爱, 李钢. 基于DCGAN的课堂表情图像生成方法[J]. 计算机与现代化, 2024, 0(08): 88-91.