Journal of University of Science and Technology of China ›› 2018, Vol. 48 ›› Issue (4): 284-289.DOI: 10.3969/j.issn.0253-2778.2018.04.003

• Original Paper • Previous Articles     Next Articles

An accelerator for kernel ridge regression algorithms based on data partition

LIU Enjiang, SONG Yunsheng, LIANG Jiye,   

  1. 1. School of Computer and Information Technology,Shanxi University,Taiyuan 030006, China;
    2. Key Laboratory of Computational Intelligence and Chinese Information Processing, Taiyuan 030006, China)
  • Received:2017-05-23 Revised:2017-06-24 Online:2018-04-30 Published:2018-04-30

Abstract: Kernel ridge regression (KRR) is an important regression algorithm widely used in pattern recognition and data mining for its interpretability and strong generalization capability. However, it has the defect of low training efficiency when faced with large-scale data. To address this problem, an accelerating algorithm is proposed which uses the concept of divide-and-conquer for kernel ridge regression based on data partition (PP-KRR). Firstly, the current training data space is divided into m mutually disjoint regions by a bunch of parallel hyperplanes. Secondly, each KRR model is trained on each region respectively. Finally, each unlabeled instance is predicted by the KRR model within the same region. Comparisons with three traditional algorithms on real datasets show that the proposed algorithm obtains similar prediction accuracy with less training time.

Key words: kernel ridge regression, divide-and-conquer, parallel partition, principal component analysis

CLC Number: