A Review of Solving Non-IID Data in Federated Learning: Current Status and Future Directions

Publication Name

Communications in Computer and Information Science

Abstract

Federated learning (FL), as a machine learning framework, has garnered substantial attention from researchers in recent years. FL makes it possible to train a global model through coordination by a central server while ensuring the privacy of data on individual edge devices. However, the data on edge devices that participate in FL training are not independently and identically distributed (IID), resulting in challenges related to heterogeneity data. In this paper, we introduce the challenges generated by non-IID data to FL and provide a detailed classification of non-IID data. Then, we summarize the existing solutions to non-IID data in FL from the perspectives of data and process. To the best of our knowledge, despite the considerable efforts achieved by many researchers in solving the non-IID problem, some issues remain unsolved. This paper provides researchers with the latest findings and analyzes the potential future directions for solving non-IID in FL.

Open Access Status

This publication is not available as open access

Volume

2058 CCIS

First Page

58

Last Page

72

Funding Number

620MS021

Funding Sponsor

Natural Science Foundation of Hainan Province

Share

COinS
 

Link to publisher version (DOI)

http://dx.doi.org/10.1007/978-981-97-1277-9_5