计算机与现代化

• 人工智能 • 上一篇    下一篇

基于Period-Near算法的用户移动位置预测

  

  1. (山东大学计算机科学与技术学院,山东 济南 250101)
  • 收稿日期:2016-09-09 出版日期:2017-06-23 发布日期:2017-06-23
  • 作者简介:高夏(1988-),男,江苏镇江人,山东大学计算机科学与技术学院硕士研究生,研究方向:数据挖掘,大数据处理。

Prediction of User’s Moving Location Based on Period-Near Algorithm

  1. (School of Computer Science & Technology, Shandong University, Jinan 250101, China)
  • Received:2016-09-09 Online:2017-06-23 Published:2017-06-23

摘要: 新浪微博是一种允许大量用户彼此分享包括位置在内的个人信息的电子媒介,它使得掌握用户的运动轨迹成为可能。尽管用户的运动和移动模式有着高度的自由性和多样性,但是周期性的运动是非常频繁的现象,因此寻找用户的周期行为对于了解用户的动作至关重要。在本文中将这个问题定义为“预测用户将要去哪里”,该问题涉及2个子问题:如何发现用户的历史行为以及如何应用用户的历史行为来预测其将来的行为。假设用户的行为是周期性的,并且如果用户在一个位置的时间足够长,那么他/她将会一直待在这个位置。基于这2个假设,提出一个4阶段算法Period-Near来解决这个问题。在算法的第1阶段挖掘用户的周期性行为,第2阶段发现其较为频繁的移动,第3阶段了解用户在最近一段时间所处的位置,第4阶段是根据前3个阶段来预测用户接下来将要去哪里。无论是在综合数据上还是实际数据上的实验研究均表明本文方法具有一定的有效性。

关键词: 行为预测, 频繁移动, 实际生活需求, 交叉定位

Abstract: Sina Weibo is an electronic medium that allows a large number of users to share personal information with each other including location information, which makes it possible to know users’ movement. Even though users’ movement and mobility patterns have a high degree of freedom and variation, periodicity is a frequently happening phenomenon for users. Finding periodic behaviors is essential for understanding user movements. In this paper, we address the problem as to predict where a user will go. It involves two sub-problems: how to detect users’ historical behavior, and how to use historical behaviors to predict the behavior in the future. Our main assumptions are that users’ behaviors are periodic and a user will stay in one location if he or she stays in this location for a long time. Based on these assumptions, we propose a 4-stage algorithm, Period-Near, to solve the problem. At the first stage, we mine the periodic behaviors of a user, then, find frequent transfers. At the third stage, we aim to know where the user is in the nearest time. At last, we consider the three stages together to predict where the user will go in the next time. Empirical studies on both synthetic and real data sets demonstrate the effectiveness of our method.

Key words: behavior prediction, frequent movement, real-life needs, cross location

中图分类号: