Telecommunications Science ›› 2022, Vol. 38 ›› Issue (11): 86-95.doi: 10.11959/j.issn.1000-0801.2022264

• Research and Development • Previous Articles     Next Articles

Intelligent anti-jamming decision algorithm of bivariate frequency hopping pattern based on ET-PPO

Yibo CHEN, Zhijin ZHAO   

  1. School of Communication Engineering, Hangzhou Dianzi University, Hangzhou 310018, China
  • Revised:2022-09-29 Online:2022-11-20 Published:2022-11-01
  • Supported by:
    The National Natural Science Foundation of China(U19B2016)

Abstract:

In order to further improve its anti-interference ability in complex electromagnetic environment, a PPO algorithm based on weighted importance sampling and eligibility traces (ET-PPO) was proposed.On the basis of the traditional frequency hopping pattern, time-varying parameters were introduced, and the bivariate frequency hopping pattern decision problem was modeled as a Markov decision problem through the construction of the state-action-reward triple.Aiming at the high variance problem of the sample update method of an actor network of the PPO algorithm, weighted importance sampling was introduced to reduce the variance, and the action selection strategy of Beta distribution was used to enhance the stability of the learning stage.Aiming at the problem of slow convergence speed of the evaluator network, the eligibility trace method was introduced, which better balanced the convergence speed and the global optimal solution.The algorithm comparison simulation results in different electromagnetic interference environments show that ET-PPO has better adaptability and stability, and has better performance against obstruction interference and sweep frequency interference.

Key words: complex electromagnetic environment, bivariate frequency hopping pattern, proximal policy optimization, eligibility trace

CLC Number: 

No Suggested Reading articles found!