Computer and Modernization

Previous Articles     Next Articles

biRNN-based Method for Processing Unbalanced Text Data Sets of Naval Ordnance

  

  1. (1. Naval Aeronautical University, Yantai 264001, China; 2. Naval 92665th Troop, Zhangjiajie 427000, China)
  • Received:2019-04-23 Online:2019-12-11 Published:2019-12-11

Abstract: Traditional unbalanced data sets processing methods are characterized by complicated artificial settings and poor universality, which are difficult to be applied to naval ordnance unbalanced text data sets processing. Aiming at this problem, this paper proposes a method of processing unbalanced text data sets of naval ordnance based on biRNN model. The biRNN model is used to automatically learn the features of text sequences and expand a few types of texts by two-way text sequence prediction to achieve the goal of text data balancing. The whole text data set is expanded on the basis of balanced data set. Text classification experiments are carried out on the original data set, the balanced data set and the extended data set. The experimental results show that the unbalanced data set expansion method based on biRNN can effectively improve the performance of text classification by balancing and extending the original data set.

Key words: deep learning, naval ordnance, unbalanced data set, bidirectional recurrent neural network, text data mining

CLC Number: