网络与信息安全学报 ›› 2024, Vol. 10 ›› Issue (1): 136-155.doi: 10.11959/j.issn.2096-109x.2024004

• 学术论文 • 上一篇    

基于改进的残差U-Net的不平衡协议识别方法

吴吉胜, 洪征, 马甜甜   

  1. 陆军工程大学指挥控制工程学院,江苏 南京 210000
  • 修回日期:2023-12-18 出版日期:2024-02-01 发布日期:2024-02-01
  • 作者简介:吴吉胜(1997− ),男,湖南凤凰人,陆军工程大学硕士生,主要研究方向为协议识别
    洪征(1979− ),男,江西南昌人,陆军工程大学副教授,主要研究方向为漏洞挖掘和协议逆向分析
    马甜甜(1998− ),女,江苏邳州人,陆军工程大学硕士生,主要研究方向为信息安全
  • 基金资助:
    国家重点研发计划(2017YFB0802900)

Unbalanced protocol recognition method based on improved residual U-Net

Jisheng WU, Zheng HONG, Tiantian MA   

  1. College of Command and Control Engineering, Army Engineering University, Nanjing 210000, China
  • Revised:2023-12-18 Online:2024-02-01 Published:2024-02-01
  • Supported by:
    The National Key R&D Program of China(2017YFB0802900)

摘要:

随着互联网的不断发展,网络攻击事件不断增多,成为网络安全的巨大挑战。在所捕获网络流量中,恶意流量往往占比较少,即攻击者使用的通信协议往往为少数类协议。当协议数据的类别分布不平衡时,现有协议识别方法能够识别出多数类协议,但是难以准确识别少数类协议。针对这一问题,提出一种基于改进的残差 U-Net 的不平衡协议识别方法,利用新的激活函数和 SE-Net(squeeze-and-excitation networks)改进残差U-Net,提升残差U-Net的特征提取能力。同时采用带权重的Dice损失函数作为协议识别模型的损失函数,少数类协议的识别准确率偏低会导致损失函数的值偏高,进而促使少数类协议主导模型的优化方向。采用所提方法进行协议识别时,首先从网络流量中抽取网络流,经过预处理转化为一维矩阵,利用协议识别模型提取协议数据的特征,进而由Softmax分类器计算输出协议类型。实验结果表明,与对比模型相比,所提协议识别模型能够更为准确地识别少数类协议,同时多数类协议的识别准确率得到了提升。

关键词: 协议识别, 类别不平衡, 卷积神经网络, 激活函数, 损失函数

Abstract:

An unbalanced protocol recognition method based on the improved Residual U-Net was proposed to solve the challenge of network security posed by the increasing network attacks with the continuous development of the Internet.In the captured network traffic, a small proportion is constituted by malicious traffic, typically utilizing minority protocols.However, existing protocol recognition methods struggle to accurately identify these minority protocols when the class distribution of the protocol data is imbalanced.To address this issue, an unbalanced protocol recognition method was proposed, which utilized the improved Residual U-Net, incorporating a novel activation function and the Squeeze-and-Excitation Networks (SE-Net) to enhance the feature extraction capability.The loss function employed in the proposed model was the weighted Dice loss function.In cases where the recognition accuracies of the minority protocols were low, the loss function value would be high.Consequently, the optimization direction of the model would be dominated by the minority protocols, resulting in improved recognition accuracies for them.During the protocol recognition process, the network flow was extracted from the network traffic and preprocessed to convert it into a one-dimensional matrix.Subsequently, the protocol recognition model extracted the features of the protocol data, and the Softmax classifier predicted the protocol types.Experimental results demonstrate that the proposed protocol recognition model achieves more accurate recognition of the minority protocols compared to the comparison model, while also improving the recognition accuracies of the majority protocols.

Key words: protocol recognition, class unbalance, convolutional neural network, activation function, loss function

中图分类号: 

No Suggested Reading articles found!