通信学报 ›› 2020, Vol. 41 ›› Issue (6): 88-97.doi: 10.11959/j.issn.1000-436x.2020123

• 学术论文 • 上一篇    下一篇

基于自编码器的未知协议分类方法

顾纯祥1,2,吴伟森1,石雅男1,李光松1   

  1. 1 信息工程大学网络空间安全学院,河南 郑州 450001
    2 网络密码技术河南省重点实验室,河南 郑州 450001
  • 修回日期:2020-04-03 出版日期:2020-06-25 发布日期:2020-07-04
  • 作者简介:顾纯祥(1976- ),男,安徽霍山人,博士,信息工程大学教授、博士生导师,网络密码技术河南省重点实验室主任,主要研究方向为密码学与网络安全|吴伟森(1996- ),男,浙江天台人,信息工程大学硕士生,主要研究方向为网络安全、机器学习|石雅男(1982- ),女,河南安阳人,信息工程大学讲师,主要研究方向为安全协议分析|李光松(1977- ),男,山东德州人,博士,信息工程大学副教授,主要研究方向为网络协议分析、区块链、无线网络安全
  • 基金资助:
    国家自然科学基金资助项目(61772548);国家自然科学基金创新研究群体资助项目(61521003);信息保障技术重点实验室开放基金资助项目(KJ-17-001)

Method of unknown protocol classification based on autoencoder

Chunxiang GU1,2,Weisen WU1,Ya’nan SHI1,Guangsong LI1   

  1. 1 School of Cyberspace Security,Information Engineering University,Zhengzhou 450001,China
    2 Henan Key Laboratory of Network Cryptography Technology,Zhengzhou 450001,China
  • Revised:2020-04-03 Online:2020-06-25 Published:2020-07-04
  • Supported by:
    The National Natural Science Foundation China(61772548);Innovative Research Groups of the National Natural Science Foundation of China(61521003);Foundation of Science and Technology on Information Assurance Laboratory(KJ-17-001)

摘要:

针对互联网中存在的大量未知协议导致网络管理和维护网络安全十分困难的问题,提出了一种未知协议的分类识别方法。结合自编码器技术和改进的K-means聚类技术针对网络流量实现了未知协议的分类识别。利用自编码器对网络流量进行降维和特征提取,使用聚类技术对降维后数据进行无监督的分类,最终实现对网络流量的无监督识别分类。实验结果表明,所提方法分类效果优于传统的 K-means、DBSCAN、GMM 算法,且具有更高的效率。

关键词: 未知协议分类, 自编码器, 无监督分类, 特征提取

Abstract:

Aiming at the problem that a large number of unknown protocols exist in the Internet,which makes it very difficult to manage and maintain the network security,a classification and identification method of unknown protocols was proposed.Combined with the autoencoder technology and the improved K-means clustering technology,the unknown protocol was classified and identified for the network traffic.The autoencoder was used to reduce dimensionality and select features of network traffic,clustering technology was used to classify the dimensionality reduction data unsupervised,and finally unsupervised recognition and classification of network traffic were realized.Experimental results show that the classification effect is better than the traditional K-means,DBSCAN,GMM algorithm,and has higher efficiency.

Key words: unknown protocol classification, autoencoder, unsupervised classification, feature extraction

中图分类号: 

No Suggested Reading articles found!