电信科学 ›› 2015, Vol. 31 ›› Issue (2): 86-96.doi: 10.11959/j.issn.1000-0801.2015014

• 研究与开发 • 上一篇    下一篇

基于信息熵差异性度量的数据流增量集成分类算法

琚春华1,2,邹江波1,2   

  1. 1 浙江工商大学信息学院 杭州 310018
    2 浙江工商大学现代商贸研究中心 杭州 310000
  • 出版日期:2015-02-20 发布日期:2017-03-18
  • 基金资助:
    国家科技支撑计划基金资助项目;浙江省自然科学基金资助项目;教育部人文社会科学重点研究基地基金资助项目

An Incremental Classification Algorithm for Data Stream Based on Information Entropy Diversity Measure

Chunhua Ju1,2,Jiangbo Zou1,2   

  1. 1 School of Computer Science &Information Engineering, Hangzhou 310018, China
    2 Center for Studies of Modern Business, Zhejiang Gongshang University, Hangzhou 310018, China
  • Online:2015-02-20 Published:2017-03-18
  • Supported by:
    The National Key Technology R&D Program;The Natural Science Foundation of Zhejiang Province of China;The Key Ministry of Education,Humanities and Social Sciences Project

摘要:

摘要:对分类器之间的差异性进行了研究,提出了一种基于信息熵差异性度量的增量集成分类算法,将信息熵差异性度量方法融入到基分类器选择过程中,通过对训练数据集的基分类结果的信息熵差异度计算,采用循环迭代优化的选择方法,以熵差异性最优化为约束目标,动态调整基分类器个数,实现了分类准确稳定,减少了系统开销。通过实验比对,证明了算法在数据流处理时比其他算法具有更小的开销和较强的适应性。

关键词: 集成分类器, 差异性度量, 信息熵, 增量集成, 数据流

Abstract:

The diversity between classifiers was studied and an incremental classification algorithm for data stream based on information entropy diversity measure was proposed, the method of information entropy diversity measure was integrated into the selection process of base classifiers, the information entropy diversity of base classifier which trained from training data was calculated, by means of cyclic iterative as optimization method and entropy diversity as optimization constrained goal, the numbers of base classifiers was dynamic adjusted that improved the classification accuracy and stability to reduce system costs. The experiments prove that the algorithm has less cost and strong adaptability compare with other data stream algorithm when processing data stream.

Key words: ensemble classifier, diversity measure, entropy of information, incremental ensemble, data stream

No Suggested Reading articles found!