通信学报 ›› 2020, Vol. 41 ›› Issue (1): 42-52.doi: 10.11959/j.issn.1000-436x.2020003

• 学术论文 • 上一篇    下一篇

基于Markov时间博弈的移动目标防御最优策略选取方法

谭晶磊1,2,张恒巍1,张红旗1,2,金辉1,2,雷程1,2   

  1. 1 信息工程大学三院,河南 郑州 450001
    2 河南省信息安全重点实验室,河南 郑州 450001
  • 修回日期:2019-09-21 出版日期:2020-01-25 发布日期:2020-02-11
  • 作者简介:谭晶磊(1994- ),男,山东章丘人,信息工程大学博士生,主要研究方向为网络信息安全、移动目标防御、攻防博弈对抗等|张恒巍(1978- ),男,河南洛阳人,博士,信息工程大学副教授,主要研究方向为网络安全与攻防对抗、信息安全风险评估|张红旗(1962- ),男,河北遵化人,博士,信息工程大学教授、博士生导师,主要研究方向为网络安全、移动目标防御、等级保护和信息安全管理等|金辉(1988- ),男,北京人,信息工程大学硕士生,主要研究方向为网络信息安全等|雷程(1989- ),男,北京人,信息工程大学博士生,主要研究方向为网络信息安全、移动目标防御、数据安全交换和网络流指纹等
  • 基金资助:
    国家重点研发计划基金资助项目(2016YFF0204002);国家重点研发计划基金资助项目(2016YFF0204003);国家自然科学基金资助项目(61902427)

Optimal strategy selection approach of moving target defense based on Markov time game

Jinglei TAN1,2,Hengwei ZHANG1,Hongqi ZHANG1,2,Hui JIN1,2,Cheng LEI1,2   

  1. 1 Department of Three,Information Engineering University,Zhengzhou 450001,China
    2 Henan Key Laboratory of Information Security,Zhengzhou 450001,China
  • Revised:2019-09-21 Online:2020-01-25 Published:2020-02-11
  • Supported by:
    The National Key Research and Development Program of China(2016YFF0204002);The National Key Research and Development Program of China(2016YFF0204003);The National Natural Science Foundation of China(61902427)

摘要:

针对现有博弈模型难以有效建模网络攻防对抗动态连续特性的问题,提出了一种基于 Markov 时间博弈的移动目标防御最优策略选取方法。在分析移动目标攻防对抗过程的基础上,构建了移动目标攻防策略集合,利用时间博弈刻画了单阶段移动目标防御过程的动态性,利用 Markov 决策过程描述了多阶段移动目标防御状态转化的随机性。同时,将攻防双方对资源脆弱性抽象为对攻击面控制权的交替,从而有效保证了博弈模型的通用性。在此基础上,分析并证明了均衡的存在性,设计了最优策略选取算法。最后,通过应用实例验证了所提模型的实用性和算法的有效性。

关键词: 时间博弈, 移动目标攻击, 移动目标防御, 最优策略选取, Markov决策

Abstract:

For the problem that the existed game model was challenging to model the dynamic continuous characteristics of network attack and defense confrontation effectively,a method based on Markov time game was proposed to select the optimal strategy for moving target defense.Based on the analysis of the attack and defense confrontation process of moving targets,the set of moving target attack and defense strategies was constructed.The dynamics of the single-stage moving target defense process was described by time game.The randomness of multi-stage moving target defense state transformation was described by Markov decision process.At the same time,by abstracting the use of resource vulnerability by attack-defense participants as the alternation of the control of the attack surface,the versatility of the game model was effectively guaranteed.On this basis,the existence of equilibrium was analyzed and proved,and the optimal strategy selection algorithm was designed.Finally,the practicality of the constructed model and the effectiveness of the algorithm are verified by an application example.

Key words: time game, moving target attack, moving target defense, optimal strategy selection, Markov decision

中图分类号: 

No Suggested Reading articles found!