通信学报 ›› 2023, Vol. 44 ›› Issue (5): 181-192.doi: 10.11959/j.issn.1000-436x.2023090

• 学术论文 • 上一篇    下一篇

基于深度强化学习的软件定义安全中台QoS实时优化算法

李元诚, 秦永泰   

  1. 华北电力大学控制与计算机工程学院,北京 102206
  • 修回日期:2023-02-04 出版日期:2023-05-25 发布日期:2023-05-01
  • 作者简介:李元诚(1970- ),男,山东烟台人,博士,华北电力大学教授、博士生导师,主要研究方向为密码学、信息安全等
    秦永泰(1998- ),男,甘肃定西人,华北电力大学硕士生,主要研究方向为深度强化学习、软件定义安全等
  • 基金资助:
    国网江西信息通信公司基金资助项目(52183520007V)

Deep reinforcement learning based algorithm for real-time QoS optimization of software-defined security middle platform

Yuancheng LI, Yongtai QIN   

  1. School of Control and Computer Engineering, North China Electric Power University, Beijing 102206, China
  • Revised:2023-02-04 Online:2023-05-25 Published:2023-05-01
  • Supported by:
    The State Grid Jiangxi Information & Telecommunication Company Project(52183520007V)

摘要:

针对软件定义安全场景中的服务质量(QoS)实时优化方案因安全防护手段与业务场景不匹配而导致的适用困难和性能下降的问题,提出了基于深度强化学习的软件定义安全中台QoS实时优化算法。首先,将碎片化的安全需求与安全基础设施统一到软件定义安全中台云模型中;然后,通过深度强化学习结合云计算技术提高安全中台的实时匹配和动态适应能力;最后,生成满足QoS目标的安全中台资源实时调度策略。实验结果表明,与现有实时算法相比,所提算法不但保证负载均衡,还提高了 18.7%的作业调度成功率以提高服务质量,降低了34.2%的平均响应时间,具有很好的稳健性,更适用于实时环境。

关键词: 软件定义安全, 深度强化学习, 安全中台, 服务质量

Abstract:

To overcome the problem that the real-time optimization of the quality of service (QoS) in software-defined security scenarios was hindered by the mismatch between security protection measures and business scenarios, which led to difficulties in application and performance degradation., a novel algorithm based on deep reinforcement learning for optimizing QoS in software defined security middle platforms (SDSmp) in real-time was proposed.Firstly, the fragmented security requirements and infrastructure were integrated into the SDSmp cloud model.Then by leveraging the power of deep reinforcement learning and cloud computing technology, the real-time matching and dynamic adaptation capabilities of the security middle platform were enhanced.Finally, a real-time scheduling strategy for security middle platform resources that meet QoS goals was generated.Experimental results demonstrate that compared to existing real-time methods, the proposed algorithm not only ensures load balancing but also improves job success rate by 18.7% for high QoS and reduces the average response time by 34.2%, and it is highly robust and better suited for real-time environments than existing methods.

Key words: software defined security, deep reinforcement learning, security middle platform, quality of service

中图分类号: 

No Suggested Reading articles found!