2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP) 2019
DOI: 10.1109/globalsip45357.2019.8969185
|View full text |Cite
|
Sign up to set email alerts
|

Collaborative Machine Learning at the Wireless Edge with Blind Transmitters

Abstract: We study wireless collaborative machine learning (ML), where mobile edge devices, each with its own dataset, carry out distributed stochastic gradient descent (DSGD) over-the-air with the help of a wireless access point acting as the parameter server (PS). At each iteration of the DSGD algorithm wireless devices compute gradient estimates with their local datasets, and send them to the PS over a wireless fading multiple access channel (MAC). Motivated by the additive nature of the wireless MAC, we propose an a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
53
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 56 publications
(53 citation statements)
references
References 20 publications
0
53
0
Order By: Relevance
“…In the case of fading channels [11], each node uses power control to cancel the channel effect at the receiver, where nodes that experience deep fading do not transmit. In [12], the authors extended the method for transmitting without knowing the channel state at the transmitter. The channel fading is mitigated at the receiver by using multiple antennas, where the fading diminishes as the number of antennas approaches infinity.…”
Section: B Distributed Learning Over Macmentioning
confidence: 99%
“…In the case of fading channels [11], each node uses power control to cancel the channel effect at the receiver, where nodes that experience deep fading do not transmit. In [12], the authors extended the method for transmitting without knowing the channel state at the transmitter. The channel fading is mitigated at the receiver by using multiple antennas, where the fading diminishes as the number of antennas approaches infinity.…”
Section: B Distributed Learning Over Macmentioning
confidence: 99%
“…In [11], [12], the bandwidth-efficient analog communication technique in [10] is combined with power control over a bandwidth-limited fading MAC, significantly reducing the communication load. Beamforming techniques at a multi-antenna PS for increasing the number of participating devices and overcoming the lack of CSI at the devices are introduced in [13] and [14], respectively. In [15], resource allocation across devices for FL over wireless channels is formulated as an optimization problem aiming to minimize the learning empirical loss function.…”
Section: Introductionmentioning
confidence: 99%
“…Accordingly, for fixed gradient values, each of the M interference terms in (45b) has zero mean and their variances scale with M−1 K . Thus, similar to the ideal case (where the receive chains are equipped with infinite resolution ADCs as considered in [12]), the interference term approaches zero as K → ∞. In other words, using a sufficiently large number of antennas at the PS eliminates the destructive effects of the interference on the learning process, and the estimate for the gradient vector is obtained as…”
Section: Dsgd With Low-resolution Adcs At the Psmentioning
confidence: 97%
“…where n ∈ [N +N cp ], L is the number of channel taps, τ mkl is the time delay and h t mkl ∈ C is the gain of the l-th channel tap from the m-th worker to the k-th antenna of the PS. Note that this is nothing but the machine learning over-the-air framework of [12]. We assume that h t mkl are zero-mean (circularly symmetric) complex Gaussian with…”
Section: System Modelmentioning
confidence: 99%
See 1 more Smart Citation