Journal on Communications ›› 2021, Vol. 42 ›› Issue (6): 62-71.doi: 10.11959/j.issn.1000-436x.2021111

Special Issue: 联邦学习

• Papers • Previous Articles     Next Articles

Node selection method in federated learning based on deep reinforcement learning

Wenchen HE1, Shaoyong GUO1, Xuesong QIU1, Liandong CHEN2, Suxiang ZHANG3   

  1. 1 State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China
    2 Hebei State Grid Information &Telecommunication Branch, Shijiazhuang 050011, China
    3 State Grid Information & Telecommunication Branch, Beijing 100761, China
  • Revised:2021-04-16 Online:2021-06-25 Published:2021-06-01
  • Supported by:
    The National Natural Science Foundation of China(62071070);Key Project Plan of Blockchain in Ministry of Education of the People’s Republic of China(2020KJ010802);The Key Research and Development Program of Hebei Province(20310103D)

Abstract:

To cope with the impact of different device computing capabilities and non-independent uniformly distributed data on federated learning performance, and to efficiently schedule terminal devices to complete model aggregation, a method of node selection based on deep reinforcement learning was proposed.It considered training quality and efficiency of heterogeneous terminal devices, and filtrate malicious nodes to guarantee higher model accuracy and shorter training delay of federated learning.Firstly, according to characteristics of model distributed training in federated learning, a node selection system model based on deep reinforcement learning was constructed.Secondly, considering such factors as device training delay, model transmission delay and accuracy, an optimization model of accuracy for node selection was proposed.Finally, the problem model was constructed as a Markov decision process and a node selection algorithm based on distributed proximal strategy optimization was designed to obtain a reasonable set of devices before each training iteration to complete model aggregation.Simulation results demonstrate that the proposed method significantly improves the accuracy and training speed of federated learning, and its convergence and robustness are also well.

Key words: federated learning, model aggregation, node selection, deep reinforcement learning, accuracy

CLC Number: 

No Suggested Reading articles found!