Optimizing Multi-Objective Federated Learning on Non-IID Data with Improved NSGA-III and Hierarchical Clustering

Federated learning (FL) can tackle the problem of data silos of asymmetric information and privacy leakage; however, it still has shortcomings, such as data heterogeneity, high communication cost and uneven distribution of performance. To overcome these issues and achieve parameter optimization of F...

Full description

Bibliographic Details
Main Authors: Jialin Zhong, Yahui Wu, Wubin Ma, Su Deng, Haohao Zhou
Format: Article
Language:English
Published: MDPI AG 2022-05-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/14/5/1070
_version_ 1797495000282431488
author Jialin Zhong
Yahui Wu
Wubin Ma
Su Deng
Haohao Zhou
author_facet Jialin Zhong
Yahui Wu
Wubin Ma
Su Deng
Haohao Zhou
author_sort Jialin Zhong
collection DOAJ
description Federated learning (FL) can tackle the problem of data silos of asymmetric information and privacy leakage; however, it still has shortcomings, such as data heterogeneity, high communication cost and uneven distribution of performance. To overcome these issues and achieve parameter optimization of FL on non-Independent Identically Distributed (non-IID) data, a multi-objective FL parameter optimization method based on hierarchical clustering and the third-generation non-dominated sorted genetic algorithm III (NSGA-III) algorithm is proposed, which aims to simultaneously minimize the global model error rate, global model accuracy distribution variance and communication cost. The introduction of a hierarchical clustering algorithm on non-IID data can accelerate convergence so that FL can employ an evolutionary algorithm with a low FL client participation ratio, reducing the overall communication cost of the NSGA-III algorithm. Meanwhile, the NSGA-III algorithm, with fast greedy initialization and a strategy of discarding low-quality individuals (named NSGA-III-FD), is proposed to improve the convergence efficiency and the quality of Pareto-optimal solutions. Under two non-IID data settings, the CNN experiments on both MNIST and CIFAR-10 datasets show that our approach can obtain better Pareto-optimal solutions than classical evolutionary algorithms, and the selected solutions with an optimized model can achieve better multi-objective equilibrium than the standard federated averaging (FedAvg) algorithm and the Clustering-based FedAvg algorithm.
first_indexed 2024-03-10T01:43:15Z
format Article
id doaj.art-5975549949c441b29505d29c95f16845
institution Directory Open Access Journal
issn 2073-8994
language English
last_indexed 2024-03-10T01:43:15Z
publishDate 2022-05-01
publisher MDPI AG
record_format Article
series Symmetry
spelling doaj.art-5975549949c441b29505d29c95f168452023-11-23T13:20:56ZengMDPI AGSymmetry2073-89942022-05-01145107010.3390/sym14051070Optimizing Multi-Objective Federated Learning on Non-IID Data with Improved NSGA-III and Hierarchical ClusteringJialin Zhong0Yahui Wu1Wubin Ma2Su Deng3Haohao Zhou4Science and Technology on Information System Engineering Laboratory, National University of Defense Technology, Changsha 410073, ChinaScience and Technology on Information System Engineering Laboratory, National University of Defense Technology, Changsha 410073, ChinaScience and Technology on Information System Engineering Laboratory, National University of Defense Technology, Changsha 410073, ChinaScience and Technology on Information System Engineering Laboratory, National University of Defense Technology, Changsha 410073, ChinaScience and Technology on Information System Engineering Laboratory, National University of Defense Technology, Changsha 410073, ChinaFederated learning (FL) can tackle the problem of data silos of asymmetric information and privacy leakage; however, it still has shortcomings, such as data heterogeneity, high communication cost and uneven distribution of performance. To overcome these issues and achieve parameter optimization of FL on non-Independent Identically Distributed (non-IID) data, a multi-objective FL parameter optimization method based on hierarchical clustering and the third-generation non-dominated sorted genetic algorithm III (NSGA-III) algorithm is proposed, which aims to simultaneously minimize the global model error rate, global model accuracy distribution variance and communication cost. The introduction of a hierarchical clustering algorithm on non-IID data can accelerate convergence so that FL can employ an evolutionary algorithm with a low FL client participation ratio, reducing the overall communication cost of the NSGA-III algorithm. Meanwhile, the NSGA-III algorithm, with fast greedy initialization and a strategy of discarding low-quality individuals (named NSGA-III-FD), is proposed to improve the convergence efficiency and the quality of Pareto-optimal solutions. Under two non-IID data settings, the CNN experiments on both MNIST and CIFAR-10 datasets show that our approach can obtain better Pareto-optimal solutions than classical evolutionary algorithms, and the selected solutions with an optimized model can achieve better multi-objective equilibrium than the standard federated averaging (FedAvg) algorithm and the Clustering-based FedAvg algorithm.https://www.mdpi.com/2073-8994/14/5/1070federated learningmulti-objective optimizationNSGA-IIIparameter optimization
spellingShingle Jialin Zhong
Yahui Wu
Wubin Ma
Su Deng
Haohao Zhou
Optimizing Multi-Objective Federated Learning on Non-IID Data with Improved NSGA-III and Hierarchical Clustering
Symmetry
federated learning
multi-objective optimization
NSGA-III
parameter optimization
title Optimizing Multi-Objective Federated Learning on Non-IID Data with Improved NSGA-III and Hierarchical Clustering
title_full Optimizing Multi-Objective Federated Learning on Non-IID Data with Improved NSGA-III and Hierarchical Clustering
title_fullStr Optimizing Multi-Objective Federated Learning on Non-IID Data with Improved NSGA-III and Hierarchical Clustering
title_full_unstemmed Optimizing Multi-Objective Federated Learning on Non-IID Data with Improved NSGA-III and Hierarchical Clustering
title_short Optimizing Multi-Objective Federated Learning on Non-IID Data with Improved NSGA-III and Hierarchical Clustering
title_sort optimizing multi objective federated learning on non iid data with improved nsga iii and hierarchical clustering
topic federated learning
multi-objective optimization
NSGA-III
parameter optimization
url https://www.mdpi.com/2073-8994/14/5/1070
work_keys_str_mv AT jialinzhong optimizingmultiobjectivefederatedlearningonnoniiddatawithimprovednsgaiiiandhierarchicalclustering
AT yahuiwu optimizingmultiobjectivefederatedlearningonnoniiddatawithimprovednsgaiiiandhierarchicalclustering
AT wubinma optimizingmultiobjectivefederatedlearningonnoniiddatawithimprovednsgaiiiandhierarchicalclustering
AT sudeng optimizingmultiobjectivefederatedlearningonnoniiddatawithimprovednsgaiiiandhierarchicalclustering
AT haohaozhou optimizingmultiobjectivefederatedlearningonnoniiddatawithimprovednsgaiiiandhierarchicalclustering