Journal on Communications ›› 2023, Vol. 44 ›› Issue (1): 164-176.doi: 10.11959/j.issn.1000-436x.2023007

• Papers • Previous Articles     Next Articles

Data augmentation scheme for federated learning with non-IID data

Lingtao TANG1, Di WANG1, Shengyun LIU2   

  1. 1 State Key Laboratory of Mathematical Engineering and Advanced Computing, Wuxi 214125, China
    2 School of Cyber Science and Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
  • Revised:2022-11-16 Online:2023-01-25 Published:2023-01-01
  • Supported by:
    The National Key Research and Development Program of China(2016YFB1000500);The National Science and Technology Major Project(2018ZX01028102)

Abstract:

To solve the problem that the model accuracy remains low when the data are not independent and identically distributed (non-IID) across different clients in federated learning, a privacy-preserving data augmentation scheme was proposed.Firstly, a data augmentation framework for federated learning scenarios was designed.All clients generated synthetic samples locally and shared them with each other, which eased the problem of client drift caused by the difference of clients’ data distributions.Secondly, based on generative adversarial network and differential privacy, a private sample generation algorithm was proposed.It helped clients to generate informative samples while preserving the privacy of clients’ local data.Finally, a differentially private label selection algorithm was proposed to ensure the labels of synthetic samples will not leak information.Simulation results demonstrate that under multiple non-IID data partition strategies, the proposed scheme can consistently improve the model accuracy and make the model converge faster.Compared with the benchmark approaches, the proposed scheme can achieve at least 25% accuracy improvement when each client has only one class of samples.

Key words: federated learning, non-IID, generative adversarial network, differential privacy, data augmentation

CLC Number: 

No Suggested Reading articles found!