计算机与现代化

• 数据库与数据挖掘 • 上一篇    下一篇

基于离均差的时间序列相似性度量

  

  1. 江南大学数字媒体学院,江苏无锡214122
  • 收稿日期:2016-07-13 出版日期:2017-05-26 发布日期:2017-05-31
  • 作者简介:曹洋洋(1991-),女,河北邯郸人,江南大学数字媒体学院硕士研究生,研究方向:数据挖掘; 林意(1960-),男,江苏无锡人,副教授,硕士生导师,博士,研究方向:数据挖掘,计算机图形学。

 Similarity Measure of Time Series Based on Mean Deviations

  1. College of Digital and Media, Jiangnan University, Wuxi 214122, China
  • Received:2016-07-13 Online:2017-05-26 Published:2017-05-31

摘要:  时间序列的相似性度量是数据挖掘领域研究的一个热点,高维多元时间序列数据一般含有大量的噪声不利于相似性的比较。针对现有的时间序列度量方法存在的问题,在改进的时间序列自底向上融合算法的基础上,提出一种新的基于离均差的时间序列相似性度量的夹角余弦算法(Angle Cosine Metric Similarity,ACMS)。ACMS算法将时间序列等价为一个多维度的向量,充分考虑2个向量的方向和大小特征,增强振幅变化的鲁棒性,减少人为干扰,对数据挖掘中的聚类和预测具有帮助作用。

关键词:  , 时间序列, 向量, 夹角余弦, 相似性度量, 离均差

Abstract:  The similarity measure of time series is a hot spot in the field of data mining. High dimensional multivariate time series data generally contain a large amount of noise which is not conducive to the comparison of similarity. In view of the problems in the existing time series measure method, the paper presents a new angle cosine algorithm based on the piecewise linear representation of time series shortly called ACMS. ACMS algorithm will be equivalent to a multi-dimensional vector of time series, the direction and size characteristics of the two vectors are fully considered, which enhances the robustness of time and amplitude variation, reduces human disturbance, and is helpful to the clustering and prediction in data mining.

Key words: time series, vector quantity, cosine algorithm, similarity measure, mean deviation