Journal on Communications ›› 2018, Vol. 39 ›› Issue (8): 56-68.doi: 10.11959/j.issn.1000-436x.2018145

• Artificial Intelligence and Network Security • Previous Articles     Next Articles

Defense decision-making method based on incomplete information stochastic game and Q-learning

Hongqi ZHANG1,2,Junnan YANG1,2,Chuanfu ZHANG1,2   

  1. 1 The Third Institute,Information Engineering University,Zhengzhou 450001,China
    2 Henan Province Key Laboratory of Information Security,Zhengzhou 450001,China
  • Revised:2018-07-10 Online:2018-08-01 Published:2018-09-13
  • Supported by:
    The National High Technology Researchand Development Program of China(863Program)(2014AA7116082);The National High Technology Researchand Development Program of China(863Program)(2015AA7116040)

Abstract:

Most of the existing stochastic games are based on the assumption of complete information,which are not consistent with the fact of network attack and defense.Aiming at this problem,the uncertainty of the attacker’s revenue was transformed to the uncertainty of the attacker type,and then a stochastic game model with incomplete information was constructed.The probability of network state transition is difficult to determine,which makes it impossible to determine the parameter needed to solve the equilibrium.Aiming at this problem,the Q-learning was introduced into stochastic game,which allowed defender to get the relevant parameter by learning in network attack and defense and to solve Bayesian Nash equilibrium.Based on the above,a defense decision algorithm that could learn online was designed.The simulation experiment proves the effectiveness of the proposed method.

Key words: network attack and defense, stochastic game, Q-learning, Bayesian Nash equilibrium, defense strategy

CLC Number: 

No Suggested Reading articles found!