计算机与现代化 ›› 2022, Vol. 0 ›› Issue (03): 53-58.

• 数据库与数据挖掘 • 上一篇    下一篇

基于LDA的隐式标签协同过滤推荐算法

  

  1. (1.长沙理工大学物理与电子科学学院,湖南长沙410114;
    2.长沙理工大学近地空间电磁环境监测与建模湖南省普通高校重点实验室,湖南长沙410114)
  • 出版日期:2022-04-29 发布日期:2022-04-29
  • 作者简介:文勇军(1975—),男,湖南东安人,讲师,硕师生导师,博士,研究方向:大数据挖掘与分析,E-mail: micowen@csust.edu.cn; 何环晶(1996—),女,江西萍乡人,硕士研究生,研究方向:数据挖掘及应用,E-mail: 1724851946@qq.com; 通信作者:唐立军(1963—),男,湖南邵阳人,教授,博士生导师,研究方向:信号检测与处理,E-mail: tanglj2009@163.com。
  • 基金资助:
    湖南省重点研发计划项目(2018GK2054); 近地空间电磁环境监测与建模湖南省高校重点实验室开放基金资助项目(N201907)

Implicit Tag Collaborative Filtering Recommendation Algorithm Based on LDA

  1. (1. School of Physical & Electric Science, Changsha University of Science & Technology, Changsha 410114, China; 
    2. Hunan Province Higher Education Key Laboratory of Modeling and Monitoring 
    on the Near-earth Electromagnetic Environments, Changsha 410114, China)
  • Online:2022-04-29 Published:2022-04-29

摘要: 固定标签协同过滤推荐算法,未充分考虑标签因子的多样化,主要依靠人工标记,扩展性不强,主观因素多。本文从用户的喜好特征因素角度出发,在固定标签协同过滤推荐算法的基础上,提出一种隐式标签协同过滤推荐算法。该算法利用LDA主题模型生成项目文本的隐式标签,得到项目-标签特征权重,根据算法性能优化的要求选择标签数量,将项目-标签矩阵与用户评分矩阵结合得到用户对标签的偏好矩阵,最后通过协同过滤算法产生推荐。实验结果表明,本文提出的基于LDA的隐式标签协同过滤推荐算法缓解了数据稀疏性问题,项目推荐的召回率、准确度和F1值有较大提升。

关键词: 固定标签, 协同过滤, LDA主题模型, 隐式标签, 算法改进

Abstract: The fixed tag collaborative filtering recommendation algorithm does not fully consider the diversity of tag factors, and mainly relies on manual tagging, which is not scalable and has many subjective factors. In this paper, based on the fixed tag collaborative filtering recommendation algorithm, an implicit tag collaborative filtering recommendation algorithm is proposed from the perspective of user preferences. This algorithm uses LDA topic model to generate implicit tags of item text, and obtains item-tag feature weights. The number of tags is selected according to the requirements of algorithm performance optimization, and the user’s preference matrix for tags is obtained by combining the item-tag matrix with the user scoring matrix. Finally, the recommendation is generated by collaborative filtering algorithm. The experimental results show that the user-based LDA tag collaborative filtering algorithm proposed in this paper alleviates the problem of data sparsity, and greatly improves the recall rate, accuracy and F1 value of item recommendation.

Key words: fixed label, collaborative filtering, LDA theme model, implicit label, algorithm improvement