Journal on Communications ›› 2023, Vol. 44 ›› Issue (5): 123-136.doi: 10.11959/j.issn.1000-436x.2023102

• Papers • Previous Articles     Next Articles

High-performance federated continual learning algorithm for heterogeneous streaming data

Hui JIANG1,2, Tianliu HE1,2, Min LIU1,2,3, Sheng SUN1, Yuwei WANG1,2   

  1. 1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
    2 School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing 100190, China
    3 Zhongguancun Laboratory, Beijing 100084, China
  • Revised:2023-04-28 Online:2023-05-25 Published:2023-05-01
  • Supported by:
    The National Key Research and Development Program of China(2021YFB2900102);The National Natural Science Foundation of China(62072436)

Abstract:

Aiming at the problems of poor model performance and low training efficiency in training streaming data of AI models that provide intelligent services, a high-performance federated continual learning algorithm for heterogeneous streaming data (FCL-HSD) was proposed in the distributed terminal system with privacy data.In order to solve the problem of the current model forgetting old data, a model with dynamically extensible structure was introduced in the local training stage, and an extension audit mechanism was designed to ensure the capability of the AI model to recognize old data at the cost of small storage overhead.Considering the heterogeneity of terminal data, a customized global model strategy based on data distribution similarity was designed at the central server side, and an aggregation-by-block manner was implemented for different modules of the model.The feasibility and effectiveness of the proposed algorithm were verified under various data increment scenarios with different data sets.Experimental results show that, compared with existing works, the proposed algorithm can effectively improve the model performance to classify old data on the premise of ensuring the capability to classify new data.

Key words: heterogeneous data, streaming data, federated learning, federated continual learning, catastrophic forgetting

CLC Number: 

No Suggested Reading articles found!