Journal of University of Science and Technology of China ›› 2017, Vol. 47 ›› Issue (8): 686-694.DOI: 10.3969/j.issn.0253-2778.2017.08.008

• Original Paper • Previous Articles     Next Articles

A two-stage feature selection method based on Fisher’s ratio and prediction risk for telecom customer churn prediction

XU Ziwei, WANG Peng, CHEN Zonghai   

  1. Department of Automation, University of Science and Technology of China, Hefei, 230027, China
  • Received:2016-03-18 Revised:2016-11-07 Online:2017-08-31 Published:2017-08-31
  • Contact: CHEN Zonghai
  • About author:XU Ziwei, male, born in 1986, PhD candidate. Research field: Prediction control. E-mail: xziwei@mail.ustc.edu.cn
  • Supported by:
    Supported by the National Natural Science Foundation of China ( 61375079).

Abstract: Telecom customer churn prediction is crucial to the customer relationship management systems of telecom operators. It aims to predict a particular customer who is at a high risk of churning. The predicting process includes the steps of data pre-processing, imbalance processing, feature selection, classifier training and evaluation. A two-stage feature selection method based on fisher’s ratio and prediction risk was proposed, which took advantage of the filter feature selection method and wrapper feature selection method to solve the high dimensionality problem of telecom customer churn prediction. The method was evaluated on a real-world dataset, and the experimental results verify that it is able to reduce feature dimensionality and improve the performance of classifiers.

Key words: big data, churn prediction, two stage feature selection, Spark

CLC Number: