Abstract

Does Federated Learning (FL) work when <i>both</i> uplink and downlink communications have errors? How much communication noise can FL handle and what is its impact on the learning performance? This work is devoted to answering these practically important questions by explicitly incorporating both uplink and downlink noisy channels in the FL pipeline. We present several novel convergence analyses of FL over simultaneous uplink and downlink noisy communication channels, which encompass full and partial clients participation, direct model and model differential transmissions, and non-independent and identically distributed (IID) local datasets. These analyses characterize the sufficient conditions for FL over noisy channels to have the same convergence behavior as the ideal case of no communication error. More specifically, in order to maintain the <inline-formula> <tex-math notation="LaTeX">$\mathcal {O}({1}/{T})$ </tex-math></inline-formula> convergence rate of FED AVG with perfect communications, the uplink and downlink signal-to-noise ratio (SNR) for direct model transmissions should be controlled such that they scale as <inline-formula> <tex-math notation="LaTeX">$\mathcal {O}(t^{2})$ </tex-math></inline-formula> where <inline-formula> <tex-math notation="LaTeX">${t}$ </tex-math></inline-formula> is the index of communication rounds, but can stay <inline-formula> <tex-math notation="LaTeX">$\mathcal {O}(1)$ </tex-math></inline-formula> (i.e., constant) for model differential transmissions. The key insight of these theoretical results is a &#x201C;flying under the radar&#x201D; principle &#x2013; stochastic gradient descent (SGD) is an inherent noisy process and uplink/downlink communication noises can be tolerated as long as they do not dominate the time-varying SGD noise. We exemplify these theoretical findings with two widely adopted communication techniques &#x2013; transmit power control and receive diversity combining &#x2013; and further validate their performance advantages over the standard methods via numerical experiments using several real-world FL tasks.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.