Journal of University of Science and Technology of China ›› 2015, Vol. 45 ›› Issue (1): 61-68.DOI: 10.3969/j.issn.0253-2778.2015.01.010

• Original Paper • Previous Articles    

A hierarchical classification model for class-imbalanced data

SHI Peibei, LIU Guiquan, WANG Zhong, WEI Bing   

  1. 1.Department of Public Computer Teaching, Hefei Normal University, Hefei 230601, China; 2.School of Computer Science and Technology, University of Science and Technology of China,Hefei 230027, China; 3.Department of Digital Technology, No.38 Research Institute of CETC, Hefei 230088, China
  • Received:2014-06-09 Revised:2014-07-29 Accepted:2014-07-29 Online:2014-07-29 Published:2014-07-29

Abstract: Traditional machine learning methods have lower classification performance when dealing with class imbalanced data. A hierarchical classification model for class imbalanced data was thus proposed. With an AdaBoost classifier as its basis classifier, the model builds mathematical models by the features and false positive rates of the classifier, and demonstrates that parameters of the hierarchical classification model could be calculated. First, the hierarchical classification tree was as the structure, and then the classification cost of the hierarchical classification tree mode was obtained as well as a quantitative and mathematical description of the features of each layer. Finally, the classification cost could be converted to a optimization problem, and the solving process of the optimization problem was given. Meanwhile, results of the hierarchical classification are presented. Experiments have been conducted on UCI dataset, and the results show that the proposed method has higher AUC and F-measure compared to many existing class-imbalanced learning methods.

Key words: machine learning, class-imbalanced, hierarchical classification, feature, evaluation criteria

CLC Number: