通信学报 ›› 2013, Vol. 34 ›› Issue (5): 42-51.doi: 10.3969/j.issn.1000-436x.2013.05.005

• 学术论文 • 上一篇    下一篇

类不均衡的半监督高斯过程分类算法

夏战国,夏士雄,蔡世玉,万玲   

  1. 中国矿业大学 计算机科学与技术学院,江苏 徐州 221116
  • 出版日期:2013-05-25 发布日期:2017-06-27
  • 基金资助:
    国家自然科学基金资助项目;国家教育部博士点基金资助项目

Semi-supervised Gaussian process classification algorithm addressing the class imbalance

Zhan-guo XIA,Shi-xiong XIA,Shi-yu CAI,Ling WAN   

  1. School of Computer Science and Technology,China University of Mining and Technology,Xuzhou 221116,China
  • Online:2013-05-25 Published:2017-06-27
  • Supported by:
    The National Natural Science Foundation of China;The Ph.D.Programs Foundation of the Ministry of Education of China

摘要:

摘要:针对传统的监督学习方法难以解决真实数据集标记信息少、训练样本集中存在类不均衡的问题,提出了类不均衡的半监督高斯过程分类算法。算法引入自训练的半监督学习思想,结合高斯过程分类算法计算后验概率,向未标记数据中注入类标记以获得更多准确可信的标记数据,使得训练样本的类分布相对平衡,分类器自适应优化以获得较好的分类效果。实验结果表明,在类不均衡的训练样本及标记信息过少的情况下,该算法通过自训练分类器获得了有效标记,使分类精度得到了有效提高,为解决类不均衡数据分类提供了一个新的思路。

关键词: 类不均衡, 半监督, 高斯过程分类, 自训练

Abstract:

The traditional supervised learning is difficult to deal with real-world datasets with less labeled information when the training sets class is imbalanced.Therefore,a new semi-supervised Gaussian process classification of address-ing was proposed.The semi-supervised Gaussian process was realized by calculating the posterior probability to obtain more accurate and credible labeled data,and embarking from self-training semi-supervised methods to add class label into the unlabeled data.The algorithm makes the distribution of training samples relatively balance,so the classifier can adaptively optimized to obtain better effect of classification.According to the experimental results,when the circum-stances of training set are class imbalance and much lack of label information,The algorithm improves the accuracy by obtaining effective labeled in comparison with other related works and provides a new idea for addressing the class im-balance is demonstrated.

Key words: class imbalance, semi-supervised, Gaussian process classification, self-training

No Suggested Reading articles found!