中国科学技术大学学报 ›› 2020, Vol. 50 ›› Issue (5): 596-604.DOI: 10.3969/j.issn.0253-2778.2020.05.007

• 论著 • 上一篇    下一篇

非均衡数据情形的一种协同正则化多视图半监督学习分类器

崔文泉,陈伟,程浩洋   

  1. 中国科学技术大学管理学院统计与金融系,安徽合肥 230026
  • 收稿日期:2019-04-14 修回日期:2019-05-17 接受日期:2019-05-17 出版日期:2020-05-31 发布日期:2019-05-17
  • 通讯作者: 崔文泉
  • 作者简介:崔文泉(通讯作者),男,1964年生,博士/副教授.研究方向:数理统计.E-mail: wqcui@ustc.edu.cn
  • 基金资助:
    国家自然科学基金(71873128)资助.

A multi-view based semi-supervised classifier with co-regularization for imbalanced data

CUI Wenquan, CHEN Wei, CHENG Haoyang   

  1. Department of Statistics and Finance, School of Management, University of Science and of Technology of China,Hefei 230026, China
  • Received:2019-04-14 Revised:2019-05-17 Accepted:2019-05-17 Online:2020-05-31 Published:2019-05-17

摘要: 利用多视图学习、流形学习以及协同正则化的多重惩罚处理,对含有大量无标签的类别数据提出一种多视图半监督学习的分类器构造方法.该方法由递归提升的方式对数据进行逐次多视图半监督学习,通过适当的标签化、均衡化处理改进每次集成的学习效率直到稳定.通过最小二乘和多分类SVM研究了新方法的性质,给出泛化误差的一个有意义上界,体现了新方法良好的泛化能力.模拟研究和实证分析显示,在有限样本情形下新方法具有良好的表现.

关键词: 半监督学习, 多视图学习, 协同正则化, 非均衡数据, 集成学习

Abstract: A method of constructing a multi-view semi-supervised learning classifier was presented for manifold learning and multi-puncture processing. The multi-view and semi-supervised learning of the data is achieved through recursive optimization, and appropriate labeling and equalization processing, until the efficiency of learning becomes stable. The properties of this multi-classifier were given, for instance, an upper bound of the generalization error, which showed a good capacity for generalization. Simulation and empirical analysis showed that the new method performs well with small samples.

Key words: semi-supervised learning, multi-view learning, co-regularization, imbalanced data, ensemble learning

中图分类号: