Journal on Communications ›› 2022, Vol. 43 ›› Issue (10): 106-120.doi: 10.11959/j.issn.1000-436x.2022202

• Papers • Previous Articles     Next Articles

Deception defense method against intelligent penetration attack

Jinyin CHEN1,2, Shulong HU1,2, Changyou XING3, Guomin ZHANG3   

  1. 1 College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
    2 Institute of Cyber Space Security, Zhejiang University of Technology, Hangzhou 310023, China
    3 College of Command &Control Engineering, Army Engineering University, Nanjing 210007, China
  • Revised:2022-09-29 Online:2022-10-25 Published:2022-10-01
  • Supported by:
    The National Natural Science Foundation of China(62072406);The Key Research and Development Program of Zhejiang Province(2021C01117);The 2020 Industrial Internet Innovation Development Project(TC200H01V);The Ten Thousand Talents Program of Zhejiang Province(2020R52011)

Abstract:

The intelligent penetration attack based on reinforcement learning aims to model the penetration process as a Markov decision process, and train the attacker to optimize the penetration path in a trial-and-error manner, so as to achieve strong attack performance.In order to prevent intelligent penetration attacks from being maliciously exploited, a deception defense method for intelligent penetration attack based on reinforcement learning was proposed.Firstly, obtaining the necessary information for the attacker to construct the penetration model, which included state, action and reward.Secondly, conducting deception defense against the attacker through inverting the state dimension, disrupting the action generation, and flipping the reward value sign, respectively, which corresponded to the early, middle and final stages of the penetration attack.At last, the three-stage defense comparison experiments were carried out in the same network environment.The results show that the proposed method can effectively reduce the success rate of intelligent penetration attacks based on reinforcement learning.Besides, the deception method that disrupts the action generation of the attacker can reduce the penetration attack success rate to 0 when the interference ratio is 20%.

Key words: reinforcement learning, intelligent penetration attack, attack path, deception defense

CLC Number: 

No Suggested Reading articles found!