通信学报 ›› 2018, Vol. 39 ›› Issue (10): 155-165.doi: 10.11959/j.issn.1000-436x.2018224

• 学术通信 • 上一篇    下一篇

基于集成分类器的恶意网络流量检测

汪洁,杨力立,杨珉   

  1. 中南大学信息科学与工程学院,湖南 长沙 410083
  • 修回日期:2018-07-19 出版日期:2018-10-01 发布日期:2018-11-23
  • 作者简介:汪洁(1980-),女,湖南桃江人,博士,中南大学副教授,主要研究方向为网络与信息安全等。|杨力立(1992-),女,布依族,贵州安顺人,中南大学硕士生,主要研究方向为网络与信息安全等。|杨珉(1993-),男,江西南昌人,中南大学硕士生,主要研究方向为强化学习等。
  • 基金资助:
    国家自然科学基金资助项目(61202495)

Multitier ensemble classifiers for malicious network traffic detection

Jie WANG,Lili YANG,Min YANG   

  1. School of Information Science and Engineering,Central South University,Changsha 410083,China
  • Revised:2018-07-19 Online:2018-10-01 Published:2018-11-23
  • Supported by:
    The National Natural Science Foundation of China(61202495)

摘要:

针对目前网络大数据环境攻击检测中因某些攻击步骤样本的缺失而导致攻击模型训练不够准确的问题,以及现有集成分类器在构建多级分类器时存在的不足,提出基于多层集成分类器的恶意网络流量检测方法。该方法首先采用无监督学习框架对数据进行预处理并将其聚成不同的簇,并对每一个簇进行噪音处理,然后构建一个多层集成分类器 MLDE 检测网络恶意流量。MLDE 集成框架在底层使用基分类器,非底层使用不同的集成元分类器。该框架构建简单,能并发处理大数据集,并能根据数据集的大小来调整集成分类器的规模。实验结果显示,当MLDE的基层使用随机森林、第2层使用bagging集成分类器、第3层使用AdaBoost集成分类器时,AUC的值能达到0.999。

关键词: 恶意网络流量, 攻击检测, 攻击阶段, 网络流量聚类, 集成分类器

Abstract:

A malicious network traffic detection method based on multi-level distributed ensemble classifier was proposed for the problem that the attack model was not trained accurately due to the lack of some samples of attack steps for detecting attack in the current network big data environment,as well as the deficiency of the existing ensemble classifier in the construction of multilevel classifier.The dataset was first preprocessed and aggregated into different clusters,then noise processing on each cluster was performed,and then a multi-level distributed ensemble classifier,MLDE,was built to detect network malicious traffic.In the MLDE ensemble framework the base classifier was used at the bottom,while the non-bottom different ensemble classifiers were used.The framework was simple to be built.In the framework,big data sets were concurrently processed,and the size of ensemble classifier was adjusted according to the size of data sets.The experimental results show that the AUC value can reach 0.999 when MLDE base users random forest was used in the first layer,bagging was used in the second layer and AdaBoost classifier was used in the third layer.

Key words: malicious network traffic, attack detection, attack phase, network flow clustering, ensemble classifier

中图分类号: 

No Suggested Reading articles found!