Data Skew in Federated Learning: An Experimental Evaluation on Aggregation Algorithms
2024 (English)In: 2024 2nd International Conference on Federated Learning Technologies and Applications, FLTA 2024 / [ed] Awaysheh F.M., Alawadi S., Alawadi S., Carnevale L., Lloret Mauri J., Alsmirat M., Institute of Electrical and Electronics Engineers (IEEE), 2024, p. 162-170Conference paper, Published paper (Refereed)
Abstract [en]
Federated Learning (FL), a revolutionized privacy-preserving distributed Machine Learning (ML), enables models to learn from data distributed across multiple devices at the edge, empowering Edge Intelligence (EI) applications. However, a significant challenge within FL is the issue of data skew, where data distribution across devices varies significantly, potentially impairing model performance. This paper investigates this challenge by exploring the application of FL in a complex facial ethnicity classification, including blurry label boundaries and non-IID data distribution. The paper systematically examines the effects of data skews on FL aggregation algorithms over five algorithms and three different datasets using multiple scenarios. In particular, in scenarios involving sensitive non-IID data such as facial attributes. Our approach involves a novel methodology that adapts aggregation techniques to handle better the heterogeneous data distributions typical of real-world FL environments, demonstrating the potential for more robust and equitable model performance across diverse edge devices. Key findings highlight the importance of FL in preserving data privacy while facilitating model improvement, exemplifying its potential in diverse fields beyond biometrics.
Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024. p. 162-170
Keywords [en]
Aggregation Algorithms, Data Skew, Federated Learning, Non-IID, Race Classification, Soft Biometrics, Adversarial machine learning, Contrastive Learning, Differential privacy, Data distribution, Experimental evaluation, IID data, Modeling performance, Privacy preserving
National Category
Artificial Intelligence Security, Privacy and Cryptography
Identifiers
URN: urn:nbn:se:bth-27497DOI: 10.1109/FLTA63145.2024.10840118Scopus ID: 2-s2.0-85217834069ISBN: 9798350354812 (print)OAI: oai:DiVA.org:bth-27497DiVA, id: diva2:1941340
Conference
2nd IEEE International Conference on Federated Learning Technologies and Applications, FLTA 2024, Hybrid, Valencia, Sept 7-20, 2024
2025-02-282025-02-282025-02-28Bibliographically approved