计算机与现代化

• 数据库与数据挖掘 • 上一篇    

基于多特征融合的电影推荐系统

  

  1. (中山大学新华学院信息科学学院,广东广州510520)
  • 收稿日期:2019-06-21 出版日期:2019-08-15 发布日期:2019-08-16
  • 作者简介:黎丹雨(1990-),女,湖北枣阳人,助教,硕士,研究方向:数据挖掘,机器学习,E-mail: 593171487@qq.com。
  • 基金资助:
    广州市科技计划项目(201804010265); 广东省新工科研究与实践项目(2017CXQX001); 中山大学新华学院专业综合改革试点“软件工程”(2018Z001)

Movie Recommendation System Based on Multi-feature Fusion

  1. (School of Information Science, Xinhua College of Sun Yat-sen University, Guangzhou 510520, China)
  • Received:2019-06-21 Online:2019-08-15 Published:2019-08-16

摘要: 协同过滤算法(CF)根据用户-物品的评分矩阵做推荐,未考虑物品自身属性。本文将MovieLens数据集上的电影属性,作为影响推荐结果的因素,融合电影的简介、评论、评分、导演和演员等多种因素,进行推荐。使用CNN(卷积神经网络)和Word2Vec(Word to Vector,词向量模型)处理电影简介;使用AFINN(Finn rup Nielsen情感词典)处理评论,并对结果进行映射;对导演和演员数据进行建模,得到该因素下的预测评分,最后将各因素下的结果进行加权融合,通过调整权重,得到最佳效果。经验证,该方法的推荐性能优于传统的CF算法。

关键词: 多因素, 融合, 电影, 推荐系统

Abstract: Collaborative Filtering Algorithm(CF) makes recommendations based on the user-item scoring matrix, without considering the item’s own attributes. In this paper, the movie attributes on the MovieLens dataset are used as factors influencing the recommendation results, and are combined with various factors such as the introduction, comments, ratings, directors, and actors of the movie. CNN (Convolutional Neural Network) and Word2Vec (Word to Vector) word vector model are used to process the movie introduction; AFINN (Finn rup Nielsen Emotion Dictionary) is used to process the comments and the results are mapped; the director and actor data are modeled to get the prediction score under the factors, and finally the results under the various factors are weighted and combined, and the weight is adjusted to obtain the best effect. It is verified that the recommended performance of this method is better than the traditional CF algorithm.

Key words:  multi-factor, fusion, film, recommendation system

中图分类号: