计算机与现代化

• 人工智能 • 上一篇    下一篇

一种改进的基于反k近邻的流数据离群点检测算法

  

  1. (西北工业大学理学院,陕西 西安 710114)
  • 收稿日期:2015-12-29 出版日期:2016-08-18 发布日期:2016-08-11
  • 作者简介:呼妮(1990-),女,陕西延安人,西北工业大学理学院硕士研究生,研究方向:离群点检测; 通信作者:王勇(1973-),男,副教授,硕士生导师,博士,研究方向:运筹学,数据挖掘,人工智能。
  • 基金资助:
    西北工业大学基础研究基金资助项目(JC201273)

An Improved Stream Data Outlier Mining Algorithm Based on Reverse k Nearest Neighbors

  1. (School of Natural and Applied Sciences, Northwestern Polytechnical University, Xi’an 710114, China)
  • Received:2015-12-29 Online:2016-08-18 Published:2016-08-11

摘要: 现有反k邻域的流数据离群点挖掘算法存在一些不足之处,即需要遍历每个数据对象,计算复杂度较高,稳定性较差。为了解决这些问题,本文提出一种改进的基于反k近邻的离群点检测算法OL-ORND。该算法采用细胞邻域思想,加入伪反k邻域点概念(反k邻域为空集的点对象),增加了算法的严密性,从而大大提高了算法的效率和准确率。实验表明,算法具有较好的性能。

关键词: 流数据, 反k近邻, 细胞邻域, 离群点

Abstract: The existing stream data outliers mining algorithms based on the reverse k neighbors need to traverse each data object, so the computational complexity is higher and the stability is lower. In order to solve these problems, this paper puts forward an improved outliers detection algorithm based on reverse k nearest neighbors named OL-ORND. Using the idea of cell neighbors, adding the false k reverse neighbors object concept that does not belong to the reverse k neighborhood. So that it can improve the efficiency and accuracy of the algorithm. Through the experiment, we can see that the algorithm has good performance.

Key words: stream data;reverse k nearest neighbors, cell neighbors, outliers

中图分类号: