基于自步数据重构正则化的模糊C均值聚类算法改进

计算机与现代化 ›› 2020, Vol. 0 ›› Issue (06): 120-.

• • 上一篇

基于自步数据重构正则化的模糊C均值聚类算法改进

(1.西安航空学院，陕西西安710077；2.中国特种设备检测研究院，北京100029；
3.西安交通大学数学与统计学院，陕西西安710049)

收稿日期:2019-10-22 出版日期:2020-06-24 发布日期:2020-06-28
作者简介:陈怡君（1984-），女，宁夏固原人，硕士研究生，研究方向：数字图书馆，数据挖掘技术及应用，E-mail： xiaoyifighting@stu.xjtu.edu.cn；曹逻炜（1985-），男，高级工程师，博士，研究方向：承压设备风险控制，结构完整性评价，损伤预测，E-mail： lwcao_1794@126.com；杜玉倩（1994-），女，硕士研究生，研究方向:大数据处理与分析方法，E-mail: 809009017@qq.com。

Improvement of Fuzzy C-Means Clustering Algorithm Based on Self-paced Data Reconstruction Regularization

(1. Xi’an Aeronautical University, Xi’an 710077, China；
2. China Special Equipment Inspection and Research Institute, Beijing 100029, China；
3. School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, China)

Received:2019-10-22 Online:2020-06-24 Published:2020-06-28

摘要/Abstract

摘要： 为了有效降低模糊C均值算法对奇异值和噪声点的敏感性，本文提出一种自步数据重构正则化模糊C均值聚类算法。传统算法是在C均值算法的目标函数中引入加权参数来实现对数据的模糊性划分，而本文提出的方法则是通过对C均值的目标函数进行数据重构正则化来实现，并以自步学习的方式逐步对数据点进行聚类。实验结果表明，本文算法在模拟数据、实际数据以及在图像分割中都能显著降低算法对奇异值和噪声数据的敏感性，聚类更为准确高效。

关键词: 模糊C均值, 聚类划分, 自步学习, 数据重构正则化

Abstract: In order to reduce the sensitivity of fuzzy C-means clustering algorithm for outliers and noise data points, a self-paced data reconstruction is proposed. Traditional fuzzy C-means algorithm realizes fuzzification of memberships by introducing a weighting parameter into the objective function of the C-means clustering. This paper achieves fuzzification of memberships through regularization of hard C-means clustering by data reconstruction. In addition, the proposed algorithm gradually carries out the clustering of data points in a self-paced manner. Experimental results show that the algorithm can significantly reduce the sensitivity to singular value and noise data in simulation data, actual data and image segmentation, and clustering is more accurate and efficient.

Key words: fuzzy C-means, clustering partition, self-paced learning, data reconstruction regularization

中图分类号:

TP391

陈怡君, 曹逻炜, 杜玉倩. 基于自步数据重构正则化的模糊C均值聚类算法改进[J]. 计算机与现代化, 2020, 0(06): 120-.

CHEN Yi-jun, CAO Luo-wei, DU Yu-qian. Improvement of Fuzzy C-Means Clustering Algorithm Based on Self-paced Data Reconstruction Regularization[J]. Computer and Modernization, 2020, 0(06): 120-.

参考文献

［1］孙吉贵，刘杰，赵连宇. 聚类算法研究［J］. 软件学报, 2008,19(1):48-61.
［2］吴非,毛宇光. 一种基于k维树的模糊C均值聚类算法［J］. 计算机与现代化， 2015(11):1-5.
［3］张萍,王剑钢. 结合空间信息的FCM聚类噪声图像分割方法［J］. 计算机与现代化， 2012(3):52-54.
［4］杨慧婕,刘微,黄先莉,等. 基于模糊C均值改进算法和ANFIS的蓄电池SOC预测［J］. 计算机与现代化， 2017(12):111-116.
［5］李向荣，范福海，孟向海. 改进K-means聚类算法在停车用户价值分群中的应用［J］. 计算机与数字工程， 2019，47(7):1596-1600.
［6］朱州，吴漾. 基于改进K-means聚类算法在电力客户价值分群的应用［J］. 计算机与数字工程， 2017,45(6):1049-1054.
［7］洪向共,周世芬. 基于改进模糊C均值聚类的光伏面板红外图像分割［J］. 计算机系统应用, 2019,28(5):35-41.
［8］郑平. 融合纹理特征的模糊聚类图像分割算法研究［D］. 沈阳：东北大学, 2015.
［9］ BEZDEK J C. Pattern Recognition With Fuzzy Objective Function Algorithms ［M］. New York: Plenum Press, 1981：238-241.
［10］MIYAMOTO S, MUKAIDONO M. Fuzzy C-means as a regularization and maximum entropy approach［C］// Proceedings of the 7th International Fuzzy Systems Association World Congress. 1997：86-92.
［11］MIYAMOTO S, UMAYAHARA K. Fuzzy clustering by quadratic regularization［C］// Proceedings of 1998 IEEE International Conference on Fuzzy Systems. 1998:1394-1399.
［12］HONDA K, ICHIHASHI H. A new approach to fuzzification of memberships in cluster analysis［C］// International Conference on Modeling Decisions for Artificial Intelligence. 2005:172-182.
［13］刘晏明,易鑫,李超. 基于模糊C均值聚类的比色传感器阵列图像分割算法［J］. 计算机系统应用, 2019,28(6):110-117.
［14］魏光杏,周献中,卜锡滨. 基于模糊C均值与人工蜂群优化的灰度图像分割［J］. 兰州大学学报(自然科学版), 2019,55(2):250-254.
［15］薛涛,李婷. 基于Spark的模糊C均值算法改进［J］. 西安工程大学学报, 2019,33(1):100-105.
［16］ICHIHASHI H, MIYAGISHI K, HONDA K. Fuzzy C-means clustering with regularization by K-L information［C］// Proceedings of the 10th IEEE International Conference on Fuzzy Systems. 2001:924-927.
［17］HONDA K, ICHIHASHI H. Regularized linear fuzzy clustering and probabilistic PCA mixture models［J］. IEEE Transactions on Fuzzy Systems, 2005,13(4):508-516.
［18］BENGIO Y, LOURADOUR J, COLLOBERT R, et al. Curriculum learning［C］// Proceedings of IEEE International Conference on Machine Learning. 2009:120-125.
［19］KUMAR M P, PACKER B, KOLLER D. Self-paced learning for latent variable models［C］// Proceedings of the 23th Annual Conference on Neural Information Processing Systems. 2010:1189-1197.
［20］JIANG L, MENG D Y, ZHAO Q, et al. Self-paced curriculum learning［C］// Proceedings of AAAI Conference on Artificial Intelligence. 2015:219-223.
［21］JIANG L, MENG D Y, YU S I, et al. Self-paced learning with diversity［C］// Proceedings of the 27th Annual Conference on Neural Information Processing Systems. 2014:56-60.
［22］KRISHNAPURAM A, KELLER J. A possibilistic approach to clustering［J］. IEEE Transactions on Fuzzy Systems, 1993,1(2):987-110.
［23］PAL N, PAL K, BEZDEK J C. A mixed C-means clustering model［C］// Proceedings of the 6th IEEE International Conference on Fuzzy Systems. 1997,1:11-21.

[1]	冯俊淇, 张正军, 章曼, 严涛. 基于熵与邻域约束的模糊C均值改进算法[J]. 计算机与现代化, 2021, 0(11): 89-94.
[2]	梁慈1,陈世平2. 基于包簇映射架构的包漂移策略研究[J]. 计算机与现代化, 2019, 0(11): 112-.
[3]	张驰，韩立新，徐国夏. 自步上下文感知的相关滤波跟踪算法[J]. 计算机与现代化, 2018, 0(11): 35-.
[4]	刘微，杨慧婕，刘守印. 基于ACCA-FCM和SVM-RFE的蓄电池SOH特征选择算法[J]. 计算机与现代化, 2018, 0(01): 11-18.
[5]	杨慧婕，刘微，黄先莉，刘守印. 基于模糊C均值改进算法和ANFIS的蓄电池SOC预测[J]. 计算机与现代化, 2017, 0(12): 111-116.
[6]	霍旭1，吴涛1,2. 对于不平衡数据的模糊时间序列预测[J]. 计算机与现代化, 2017, 0(12): 108-110.
[7]	田沁怡 1，田小林 2. 基于隐马尔科夫随机场邻域选择的细节保护图像分割[J]. 计算机与现代化, 2017, 0(10): 15-19.
[8]	张勇亮1，李国林1，林珍玉1，李高扬2. 一种基于粒度相关向量机的故障预测方法[J]. 计算机与现代化, 2016, 0(9): 91-95+99.
[9]	吴非，毛宇光. 一种基于k维树的模糊C均值聚类算法[J]. 计算机与现代化, 2015, 0(11): 1-5+11.
[10]	曲晓燕;王梓旭. 基于FCM和AHP的多Agent智能决策支持系统[J]. 计算机与现代化, 2013, 1(8): 209-213.
[11]	廖璠;李瑞昌;刘雅琳. 改进FCM算法在肺结节自动检测中的应用研究[J]. 计算机与现代化, 2012, 1(10): 41-45.
[12]	张萍;王剑钢. 结合空间信息的FCM聚类噪声图像分割方法[J]. 计算机与现代化, 2012, 1(03): 52-54.

基于自步数据重构正则化的模糊C均值聚类算法改进

Improvement of Fuzzy C-Means Clustering Algorithm Based on Self-paced Data Reconstruction Regularization

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 12

编辑推荐

Metrics

本文评价