Computer and Modernization ›› 2023, Vol. 0 ›› Issue (01): 30-36.

Previous Articles     Next Articles

Chinese Short Text Entity Disambiguation Based on Multi-feature Factor Fusion

  

  1. (School of Software,  Jiangxi Normal University, Nanchang 330022, China)
  • Online:2023-03-02 Published:2023-03-02

Abstract: Most of the existing Chinese short text entity disambiguation models only consider the semantic matching features between the mention context and the description of the candidate entity in the disambiguation process, and do not consider the effective disambiguation features such as the co-occurrence features between the candidate entities in the same query text and the similarity features between the mention type of the candidate entities and entities. To solve these problems, this paper first uses the pre-training language model to obtain the semantic matching features of mention context and candidate entity description. Then, co-occurrence feature and type feature are proposed for entity embedding and mention type embedding. Finally, by fusing the above features, the entity disambiguation model based on multi feature factors is realized. The experimental results show that the co-occurrence features and type features proposed in this paper are feasible and effective in entity disambiguation, and the entity disambiguation method based on multi-feature factor fusion proposed in this paper can achieve better disambiguation effect.

Key words: co-occurrence feature, type feature, multi-feature factor, multi-head attention, Ernie