网络与信息安全学报 ›› 2017, Vol. 3 ›› Issue (3): 64-70.doi: 10.11959/j.issn.2096-109x.2017.00119

• 学术论文 • 上一篇    下一篇

用于垃圾邮件的贝叶斯过滤算法研究

CAOCui-ling1,WANGYuan-yuan2,YUANYe1,ZHAOGuo-dong1   

  1. 1 哈尔滨工程大学计算机科学与技术学院,黑龙江 哈尔滨 150001
    2 东北林业大学机电工程学院,黑龙江 哈尔滨 150040
  • 修回日期:2016-11-25 出版日期:2017-03-01 发布日期:2017-03-25
  • 作者简介:曹翠玲(1990-),女,河北邯郸人,哈尔滨工程大学硕士生,主要研究方向为网络信息安全、嵌入式系统。|王媛媛(1995-),女,黑龙江哈尔滨人,东北林业大学本科生,主要研究方向为信息安全。|袁野(1995-),男,黑龙江北安人,哈尔滨工程大学本科生,主要研究方向为嵌入式系统。|赵国冬(1978-),黑龙江大庆人,博士,哈尔滨工程大学讲师,主要研究方向为机器人、信息安全。

Research of a spam filter based on improved naive Bayes algorithm

  1. 1 College of Computer Science and Technology,Harbin Engineering University,Harbin 150001,China
    2 College of Mechanical and Electrical Engineering,Northeast Forestry University,Harbin 150040,China
  • Revised:2016-11-25 Online:2017-03-01 Published:2017-03-25

摘要:

研究了基于改进的支持向量机(SVM,support vector machine)算法结合朴素贝叶斯算法在垃圾邮件过滤中的应用。首先,SVM 对训练集样本空间中两类交界处的集合构造一个最优分类超平面;然后,每个样本根据与其最近邻的类型是否相同进行取舍,从而降低样本空间也提高了每个样本类别的独立性;最后,利用朴素贝叶斯算法对邮件分类。仿真实验结果表明,该算法降低了样本空间复杂度,快速得到最优分类特征子集,有效地提高了垃圾邮件过滤的分类速度、准确率和召回率。

关键词: 朴素贝叶斯, 支持向量机, 修剪, 垃圾邮件

Abstract:

In spam filtering filed,naive Bayes algorithm is one of the most popular algorithm,a modified using support vector machine(SVM)of the native Bayes algorithm :SVM-NB was proposed.Firstly,SVM constructs an optimal separating hyperplane for training set in the sample space at the junction two types of collection,Secondly,according to its similarities and differences between the neighboring class mark for each sample to reduce the sample space also increase the independence of classes of each samples.Finally,using naive Bayesian classification algorithm for mails.The simulation results show that the algorithm reduces the sample space complexity,get the optimal classification feature subset fast,improve the classification speed and accuracy of spam filtering effectively.

Key words: naive Bayes, SVM, trim, spam mail

中图分类号: 

No Suggested Reading articles found!