Journal of University of Science and Technology of China ›› 2016, Vol. 46 ›› Issue (9): 757-763.DOI: 10.3969/j.issn.0253-2778.2016.09.007

• Original Paper • Previous Articles    

Core-points based spectral clustering for big data analysis

YANG Yi, MA Runing   

  1. College of Science, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
  • Received:2016-03-01 Revised:2016-09-17 Accepted:2016-09-17 Online:2016-09-17 Published:2016-09-17

Abstract: With regard to failures in applying spectral clustering to big data due to its computation complexity, a new spectral clustering algorithm for big data was proposed. Firstly, core-points based on random sampling and data similarity were selected, with which, the big data were grouped. Secondly, spectral clustering was applied to the core-points. Finally, the clustering of whole data was completed by combining the clustering result of the core-points and the grouped big data information. The algorithm both promotes the spectral clustering to big data and reduces the influence of noise or abnormal data by the core-points. A large number of experiments fully verify the effectiveness of the method proposed in this paper.

Key words: big data, spectral clustering, core-points, data group

CLC Number: