A Hierarchical Gradient Tracking Algorithm for Mitigating Subnet-Drift in Fog Learning Networks

Name: A Hierarchical Gradient Tracking Algorithm for Mitigating Subnet-Drift in Fog Learning Networks
Keywords: Networking and Internet Architecture (cs.NI), FOS: Computer and information sciences, Networking and Internet Architecture

Chen, Evan; Wang, Shiqiang; Brinton, Christopher G.

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2024

Data sources: arXiv.org e-Print Archive

https://dx.doi.org/10.48550/ar...

Article . 2024

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

A Hierarchical Gradient Tracking Algorithm for Mitigating Subnet-Drift in Fog Learning Networks

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2024Embargo end date: 01 Jan 2024Publisher:arXivFunded by:NSF | Collaborative Research: C..., NSF | GOALI: CNS: Medium: Commu...

Authors: Chen, Evan; Wang, Shiqiang; Brinton, Christopher G.;

doi: 10.48550/arxiv.2409.17430

arXiv: 2409.17430

A Hierarchical Gradient Tracking Algorithm for Mitigating Subnet-Drift in Fog Learning Networks

- Summary
- Subjects
- Metrics

Abstract

Federated learning (FL) encounters scalability challenges when implemented over fog networks that do not follow FL's conventional star topology architecture. Semi-decentralized FL (SD-FL) has proposed a solution for device-to-device (D2D) enabled networks that divides model cooperation into two stages: at the lower stage, D2D communications is employed for local model aggregations within subnetworks (subnets), while the upper stage handles device-server (DS) communications for global model aggregations. However, existing SD-FL schemes are based on gradient diversity assumptions that become performance bottlenecks as data distributions become more heterogeneous. In this work, we develop semi-decentralized gradient tracking (SD-GT), the first SD-FL methodology that removes the need for such assumptions by incorporating tracking terms into device updates for each communication layer. Our analytical characterization of SD-GT reveals upper bounds on convergence for non-convex, convex, and strongly-convex problems. We show how the bounds enable the development of an optimization algorithm that navigates the performance-efficiency trade-off by tuning subnet sampling rate and D2D rounds for each global training interval. Our subsequent numerical evaluations demonstrate that SD-GT obtains substantial improvements in trained model quality and communication cost relative to baselines in SD-FL and gradient tracking on several datasets.

This paper is under revision in IEEE/ACM Transactions on Networking

Related Organizations

Purdue University Northwest
United States
TRUSTEES OF PURDUE UNIVERSITY
United States
Purdue University System
United States
University of Kansas Center for Research Inc
United States

Keywords

Networking and Internet Architecture (cs.NI), FOS: Computer and information sciences, Networking and Internet Architecture

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Funded by

NSF| Collaborative Research: CPS Medium: Learning through the Air: Cross-Layer UAV Orchestration for Online Federated Optimization, NSF| GOALI: CNS: Medium: Communication-Computation Co-Design for Rural Connectivtiy and Intelligence under Nonuniformity: Modeling, Analysis, and Implementation