Journal on Communications ›› 2022, Vol. 43 ›› Issue (8): 30-40.doi: 10.11959/j.issn.1000-436x.2022148

• Papers • Previous Articles     Next Articles

Forwarding efficiency aware traffic scheduling algorithm based on deep reinforcement learning

Zongxuan SHA1, Ru HUO1,2, Chuang SUN3, Shuo WANG2,4, Tao HUANG2,4   

  1. 1 Information Department, Beijing University of Technology, Beijing 100124, China
    2 Purple Mountain Laboratories, Nanjing 211111, China
    3 Department of Automation, Tsinghua University, Beijing 100084, China
    4 State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Revised:2022-07-19 Online:2022-08-25 Published:2022-08-01
  • Supported by:
    The MIIT of China 2020 (Identification Resources Search System for Industrial Internet of Things);Young Elite Scientist Sponsorship Program by CAST and China-CIC(YESS20200287)

Abstract:

The software defined network separates the control plane from the data plane to achieve flexible traffic scheduling, which can use network resources more efficiently.However, with the increase of the number of flow entries, load rate, the number of connected hosts, and other factors, the forwarding efficiency of the SDN switch will be reduced, which will affect the end-to-end transmission delay.To solve the above problems, the forwarding efficiency aware traffic scheduling algorithm based on deep reinforcement learning was proposed.First, the switch state was integrated into the perception model, and the mapping relationship between switch state information and forwarding efficiency was established based on neural network.Then, combined with network state and traffic information, traffic scheduling policy was generated through deep reinforcement learning.Finally, the expert samples generated by the shortest path and load balance algorithms could guide the model training, which enabled the model to learn knowledge from expert samples to improve performance and accelerated the training process.The experimental results show that the proposed algorithm not only reduces the average end-to-end transmission delay by 15.31%, but also ensures the overall load balance of the network, compared with other algorithms.

Key words: software defined network, deep reinforcement learning, traffic scheduling, forwarding efficiency aware, load balance

CLC Number: 

No Suggested Reading articles found!