Computer and Modernization ›› 2021, Vol. 0 ›› Issue (01): 94-99.

Previous Articles     Next Articles

Semi-supervised Clustering Ensemble with Pairwise Constraints Based on PCA Dimension Reduction 

  

  1. (1. School of Information Engineering, Yancheng Institute of Technology, Yancheng 224002, China;
    2. School of Computer Science, Jiangsu University of Science and Technology, Zhenjiang 212003, China)
  • Online:2021-01-28 Published:2021-01-29

Abstract: Aiming at the problem that the existing clustering integration algorithms are unsupervised and cannot deal with high-dimensional data well, this paper proposes a semi-supervised clustering ensemble with pairwise constraints based on PCA dimension reduction (SSCEDR), the SSCEDR method uses PCA principal component analysis to reduce the dimension of the original data. Combined with semi-supervised clustering integration technology, the prior knowledge such as pairwise constraints is substituted into the clustering integration process in the reduced dimension space. The effectiveness of the algorithm is verified by experiments on multiple data sets.

Key words: clustering ensemble, dimension reduction, pairwise constraints, semi-supervised, principal component analysis