通信学报

• 学术论文 • 上一篇    下一篇

类不均衡的半监督高斯过程分类算法

夏战国,夏士雄,蔡世玉,万玲   

  1. 中国矿业大学 计算机科学与技术学院,江苏 徐州 221116
  • 出版日期:2013-05-25 发布日期:2013-05-15
  • 基金资助:
    国家自然科学基金资助项目(50674086);国家教育部博士点基金资助项目(20110095110010)

Semi-supervised Gaussian process classification algorithm addressing the class imbalance

  • Online:2013-05-25 Published:2013-05-15

摘要: 针对传统的监督学习方法难以解决真实数据集标记信息少、训练样本集中存在类不均衡的问题,提出了类不均衡的半监督高斯过程分类算法。算法引入自训练的半监督学习思想,结合高斯过程分类算法计算后验概率,向未标记数据中注入类标记以获得更多准确可信的标记数据,使得训练样本的类分布相对平衡,分类器自适应优化以获得较好的分类效果。实验结果表明,在类不均衡的训练样本及标记信息过少的情况下,该算法通过自训练分类器获得了有效标记,使分类精度得到了有效提高,为解决类不均衡数据分类提供了一个新的思路。

Abstract: The traditional supervised learning is difficult to deal with real-world datasets with less labeled information when the training sets class is imbalanced. Therefore, a new semi-supervised Gaussian process classification of addressing was proposed. The semi-supervised Gaussian process was realized by calculating the posterior probability to obtain more accurate and credible labeled data, and embarking from self-training semi-supervised methods to add class label into the unlabeled data. The algorithm makes the distribution of training samples relatively balance, so the classifier can adaptively optimized to obtain better effect of classification. According to the experimental results, when the circumstances of training set are class imbalance and much lack of label information, The algorithm improves the accuracy by obtaining effective labeled in comparison with other related works and provides a new idea for addressing the class imbalance is demonstrated.

No Suggested Reading articles found!