Flexible Clustered Federated Learning for Client-Level Data Distribution Shift

Duan, Moming; Liu, Duo; Ji, Xinyuan; Wu, Yu; Liang, Liang; Chen, Xianzhang; Tan, Yujuan

doi:10.48550/arxiv.2108.09749

Cited by 1 publication

(2 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Previous works that have considered collaborative training and server failure include [35], [36] in the context of FL and [12], [32], [37] that consider gossip-based training schemes. While gossip-based methods rely on random walks of data, the former, FL modification methods opt for a clustering method from which Tol-FL draws its inspiration.…”

Section: B Prior Federated Learning Researchmentioning

confidence: 99%

“…While gossip-based methods rely on random walks of data, the former, FL modification methods opt for a clustering method from which Tol-FL draws its inspiration. Ignoring the constraints of local communication, the FL-based methods [35], [36], [11] determine a natural grouping scheme over all of the available devices based on the similarity between their datasets. This modified scheme discards the speed benefit that arises through single-hop communications and practical considerations of link availability and instead forming virtual clusters that may include devices with large communications delays.…”

Section: B Prior Federated Learning Researchmentioning

confidence: 99%

See 1 more Smart Citation

Failure-tolerant Distributed Learning for Anomaly Detection in Wireless Networks

Katzef¹,

Cullen²,

Alpcan³

et al. 2023

Preprint

View full text Add to dashboard Cite

The analysis of distributed techniques is often focused upon their efficiency, without considering their robustness (or lack thereof). Such a consideration is particularly important when devices or central servers can fail, which can potentially cripple distributed systems. When such failures arise in wireless communications networks, important services that they use/provide (like anomaly detection) can be left inoperable and can result in a cascade of security problems. In this paper, we present a novel method to address these risks by combining both flat-and star-topologies, combining the performance and reliability benefits of both. We refer to this method as "Tol-FL", due to its increased failure-tolerance as compared to the technique of Federated Learning. Our approach both limits device failure risks while outperforming prior methods by up to 8% in terms of anomaly detection AUROC in a range of realistic settings that consider client as well as server failure, all while reducing communication costs. This performance demonstrates that Tol-FL is a highly suitable method for distributed model training for anomaly detection, especially in the domain of wireless networks.

show abstract