计算机与现代化 ›› 2023, Vol. 0 ›› Issue (04): 7-14.

• 人工智能 • 上一篇    下一篇

基于CNN-BiLSTM网络的数控机床故障文本自动分类

  

  1. (1.河海大学商学院,江苏 南京 211100; 2.常州市工业大数据挖掘与知识管理重点实验室,江苏 常州 213022)
  • 出版日期:2023-05-09 发布日期:2023-05-09
  • 作者简介:徐涯昕(2000—),女,四川乐山人,本科生,研究方向:文本挖掘,深度学习,E-mail: 1963810115@hhu.edu.cn; 何泽恩(2001—),男,广西平南人,本科生,研究方向:机器学习,文本挖掘,E-mail: hezeen@hhu.edu.cn; 通信作者:徐绪堪(1976—),男,湖北武汉人,博士生导师,博士,研究方向:情报挖掘,E-mail: xxkwh@hhu.edu.cn。
  • 基金资助:
    国家社科重大基金资助项目(20&ZD125); 国家级大学生创新创业训练计划项目(202110294082)

Automatic Classification Method of CNC Machine Tool Fault Text Based on CNN-BiLSTM

  1. (1. Business School of Hohai University, Nanjing 211100, China;
    2. Changzhou Key Laboratory of Industrial Big Data Mining and Knowledge Management, Changzhou 213022, China)
  • Online:2023-05-09 Published:2023-05-09

摘要: 中小数控机床企业在运营维护中积累了大量以人工文本记录的故障维修数据。为了实现高效精准分类,帮助维修人员高效开展工作,本文提出一种基于卷积神经网络和双向长短时记忆网络的故障文本分类预测方法。首先通过构建专业特征词库完成预处理,并采用Word2Vec训练词向量;其次CNN层提取文本向量的局部特征后,前后向LSTM提取上下文特征;全连接层对CNN和BiLSTM层进行特征融合和加权后,全连接层通过Softmax激活函数找到概率最大的输出作为预测结果,并用混淆矩阵呈现各个类别预测准确率。本文依据长三角某企业的故障数据进行实验分析,并与单个CNN和BiLSTM模型对比,实验结果表明新方法预测精度可达94%,平均准确率提高11个百分点,P值、R值和F值均达95%,可作为在小数据量故障文本分类领域的有效方法。

关键词: 文本分类, 数控机床故障, 卷积神经网络, 双向长短时记忆网络

Abstract: Small and medium-sized CNC machine tool firms have accumulated a large amount of fault maintenance data recorded in manual text during operation and maintenance. In order to accomplish efficient and accurate classification and help maintenance personnel carry out their work efficiently, this paper proposes a fault text classification and prediction approach based on convolution neural network and bi-directional long-short-term memory network. Firstly, the pre-processing is completed by creating a professional feature word database, and Word2Vec is used to train the word vector. Secondly, after the CNN layer extracts local features from the text vector, context features are extracted from the forward and backward LSTM. After the feature fusion and weighting of CNN and BiLSTM layers in the full connected layer, the full connected layer finds the output with the highest probability as the prediction result through the Softmax activation function, and presents the prediction accuracy of each category with the confusion matrix. Based on the fault data of an enterprise in the Yangtze River Delta, this paper makes an experimental analysis, and compares it with a single CNN and BiLSTM model. The experimental results indicate that the prediction accuracy of the new method is up to 94%, the average accuracy is increased by 11 percentage points, and the P value, R value and F value are all up to 95%, which can be used as an effective method in the field of small data volume fault text classification.

Key words: text classification, CNC machine tool fault, convolutional neural network, bi-directional long short-term memory