Papers

Federated Learning under Periodic Client Participation and Heterogeneous Data: A New Communication-Efficient Algorithm and Analysis

NeurIPS

2024

Crawshaw, Michael, Liu, Mingrui

In federated learning, it is common to assume that clients are always available to participate in training, which may not be feasible with user devices in practice. Recent works analyze federated learning under more realistic participation patterns, such as cyclic client availability or arbitrary participation. However, all such works either require strong assumptions (e.g., all clients participate almost surely within a bounded window), do not achieve linear speedup and reduced communication rounds, or are not applicable in the general non-convex setting. In this work, we focus on nonconvex optimization and consider participation patterns in which the chance of participation over a fixed window of rounds is equal among all clients, which includes cyclic client availability as a special case. Under this setting, we propose a new algorithm, named Amplified SCAFFOLD, and prove that it achieves linear speedup, reduced communication, and resilience to data heterogeneity simultaneously. In particular, for cyclic participation, our algorithm is proved to enjoy $O (ϵ^{- 2})$ communication rounds to find an $ϵ$ -stationary point in the non-convex stochastic setting. In contrast, the prior work under the same setting requires $O (κ^{2} ϵ^{- 4})$ communication rounds, where $κ$ denotes the data heterogeneity. Therefore, our algorithm significantly reduces communication rounds due to better dependency in terms of $ϵ$ and $κ$ . Our analysis relies on a fine-grained treatment of the nested dependence between client participation and errors in the control variates, which results in tighter guarantees than previous work. We also provide experimental results with (1) synthetic data and (2) real-world data with a large number of clients $(N = 250)$ , demonstrating the effectiveness of our algorithm under periodic client participation.

Federated Learning under Periodic Client Participation and Heterogeneous Data: A New Communication-Efficient Algorithm and Analysis

Interpretable Lightweight Transformer via Unrolling of Learned Graph Smoothness Priors

Discovering Hidden Features with Gaussian Processes Regression

A Recipe for Charge Density Prediction

A Unified Principle of Pessimism for Offline Reinforcement Learning under Model Mismatch

xMIL: Insightful Explanations for Multiple Instance Learning in Histopathology

Truthful High Dimensional Sparse Linear Regression

Make Continual Learning Stronger via C-Flat

Generating Behaviorally Diverse Policies with Latent Diffusion Models

In-Context Learning Unlocked for Diffusion Models

Remove that Square Root: A New Efficient Scale-Invariant Version of AdaGrad

CausalStock: Deep End-to-end Causal Discovery for News-driven Multi-stock Movement Prediction

ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination

Disentangled Multi-Fidelity Deep Bayesian Active Learning

Provably Efficient Algorithm for Nonstationary Low-Rank MDPs

Generalized Tensor Decomposition for Understanding Multi-Output Regression under Combinatorial Shifts

DVSOD: RGB-D Video Salient Object Detection

Task Confusion and Catastrophic Forgetting in Class-Incremental Learning: A Mathematical Framework for Discriminative and Generative Modelings

Unleashing the Denoising Capability of Diffusion Prior for Solving Inverse Problems

Stepping on the Edge: Curvature Aware Learning Rate Tuners

Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase

Contextually Affinitive Neighborhood Refinery for Deep Clustering

MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization

Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning

Time-Independent Information-Theoretic Generalization Bounds for SGLD

Sample-efficient Bayesian Optimisation Using Known Invariances

Counterfactual Fairness by Combining Factual and Counterfactual Predictions

Does Video-Text Pretraining Help Open-Vocabulary Online Action Detection?

HA-ViD: A Human Assembly Video Dataset for Comprehensive Assembly Knowledge Understanding

Expectation Alignment: Handling Reward Misspecification in the Presence of Expectation Mismatch

Balancing memorization and generalization in RNNs for high performance brain-machine Interfaces

ProbTS: Benchmarking Point and Distributional Forecasting across Diverse Prediction Horizons

WATT: Weight Average Test Time Adaptation of CLIP

Improved Training of Generative Adversarial Networks Using Representative Features

Predicting Ordinary Differential Equations with Transformers

A Unified Detection Framework for Inference-Stage Backdoor Defenses

Taming Heavy-Tailed Losses in Adversarial Bandits and the Best-of-Both-Worlds Setting

Visual Programming for Step-by-Step Text-to-Image Generation and Evaluation

The Implicit Regularization of Dynamical Stability in Stochastic Gradient Descent

ReSync: Riemannian Subgradient-based Robust Rotation Synchronization

Incentivizing Honesty among Competitors in Collaborative Learning and Optimization

Online Control with Adversarial Disturbance for Continuous-time Linear Systems

Federated Conditional Stochastic Optimization

Simplicity Bias in 1-Hidden Layer Neural Networks

The Time-Marginalized Coalescent Prior for Hierarchical Clustering

Online Agnostic Multiclass Boosting

Unlocking the Capabilities of Thought: A Reasoning Boundary Framework to Quantify and Optimize Chain-of-Thought

Distinguishing Learning Rules with Brain Machine Interfaces

Beyond L1: Faster and Better Sparse Models with skglm

Active multiple matrix completion with adaptive confidence sets