Computer and Modernization ›› 2024, Vol. 0 ›› Issue (09): 56-60.doi: 10.3969/j.issn.1006-2475.2024.09.010

Previous Articles     Next Articles

Text Clustering Method for Fragmented Reply Based on Dissimilarity Matrix

  

  1. (1. State Grid Fujian Electric Power Company, Fuzhou 350000, China; 2. Fujian Yirong Information Technology Co., Ltd., Fuzhou 350003, China; 3. State Grid Corporation of China, Beijing 100000, China)
  • Online:2024-09-27 Published:2024-09-29

Abstract:  In response to the problem of effectively extracting the required text information from fragmented reply texts in Q&A communities, this paper proposes a clustering method for fragmented reply texts based on dissimilarity matrix. Firstly, the clustering center is designed based on dissimilarity between texts and the fragmented reply texts in the community are classified by the clustering way. Then, the text features of user questions are extracted based on RNN+CNN. Finally, the automatic extraction of fragmented response text is achieved based on TF-IDF algorithm using the extracted question text features. The experimental results show that the proposed method can automatically extract the required text information with high accuracy and stability, and can be applied to the extraction of fragmented reply texts in question answering communities.

Key words: question-answer community, fragmented reply text, automatic extraction, clustering, dissimilarity

CLC Number: