通信学报 ›› 2016, Vol. 37 ›› Issue (10): 81-91.doi: 10.11959/j.issn.1000-436x.2016199

• 学术论文 • 上一篇    下一篇

改进的朴素贝叶斯增量算法研究

曾谁飞1,张笑燕1,杜晓峰2,陆天波1   

  1. 1 北京邮电大学软件学院,北京 100876
    2 北京邮电大学计算机学院,北京 100876
  • 出版日期:2016-10-25 发布日期:2016-10-25

Improved incremental algorithm of Naive Bayes

Shui-fei ZENG1,Xiao-yan ZHANG1,Xiao-feng DU2,Tian-bo LU1   

  1. 1 School of Software Engineer,Beijing University of Posts and Telecommunications,Beijing 100876,China
    2 School of Computer,Beijing University of Posts and Telecommunications,Beijing 100876,China
  • Online:2016-10-25 Published:2016-10-25

摘要:

提出了一种新增特征的朴素贝叶斯增量算法。在无标注语料增量样本的选择上,借助传统的类置信度阈值,构建一个最小后验概率作为样本选择的双阈值,当识别到增量语料中有新的特征时,会将该特征加入到特征空间,并对分类器进行相应的更新,发现对类置信度阈值起到很好的补充作用,最后利用了无标注和有标注语料验证所提算法。实验结果表明,改进的朴素贝叶斯增量算法较传统增量算法表现出了更优的增量学习效果。

关键词: 朴素贝叶斯, 增量算法, 特征空间, 评价指标

Abstract:

A novel Naive Bayes incremental algorithm was proposed,which could select new features.For the incremental sample selection of the unlabeled corpus,a minimum posterior probability was designed as the double threshold of sample selection by using the traditional class confidence.When new feature was detected in the corpus,it would be mapped into feature space,and then the corresponding classifier was updated.Thus this method played a very important role in class confidence threshold.Finally,it took advantage of the unlabeled and annotated corpus to validate improved incremental algorithm of Naive Bayes.The experimental results show that an improved incremental algorithm of Naive Bayes significantly outperforms traditonal incremental algorithm.

Key words: Naive Bayes, incremental algorithm, feature space, evaluation index

No Suggested Reading articles found!