Labeling Chaos to Learning Harmony: Federated Learning with Noisy Labels

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 22 Feb 2024Embargo end date: 01 Jan 2022 English Publisher:Association for Computing Machinery (ACM)Journal:ACM Transactions on Intelligent Systems and Technology, volume 15, pages 1-26 (issn: 2157-6904, eissn: 2157-6912,

Copyright policy )

Authors: Vasileios Tsouvalas; Aaqib Saeed; Tanir Ozcelebi; Nirvana Meratnia;

doi: 10.1145/3626242 , 10.48550/arxiv.2208.09378

arXiv: 2208.09378

Labeling Chaos to Learning Harmony: Federated Learning with Noisy Labels

- Summary
- Subjects
- Related research
  (1)
- Metrics

Abstract

Federated Learning (FL) is a distributed machine learning paradigm that enables learning models from decentralized private datasets where the labeling effort is entrusted to the clients. While most existing FL approaches assume high-quality labels are readily available on users’ devices, in reality, label noise can naturally occur in FL and is closely related to clients’ characteristics. Due to scarcity of available data and significant label noise variations among clients in FL, existing state-of-the-art centralized approaches exhibit unsatisfactory performance, whereas prior FL studies rely on excessive on-device computational schemes or additional clean data available on the server. We propose FedLN , a framework to deal with label noise across different FL training stages, namely FL initialization, on-device model training, and server model aggregation, able to accommodate the diverse computational capabilities of devices in an FL system. Specifically, FedLN computes per-client noise level estimation in a single federated round and improves the models’ performance by either correcting or mitigating the effect of noisy samples. Our evaluation on various publicly available vision and audio datasets demonstrates a 22% improvement on average compared to other existing methods for a label noise level of 60%. We further validate the efficiency of FedLN in human-annotated real-world noisy datasets and report a 4.8% increase on average in models’ recognition performance, highlighting that FedLN can be useful for improving FL services provided to everyday users.

Related Organizations

Eindhoven University of Technology
Netherlands
Technical University Eindhoven
Netherlands

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, knowledge distillation, label correction, Federated learning, deep learning, noisy labels, Machine Learning (cs.LG)

1 Research products, page 1 of 1

FedLN software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	15
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%