Inferensys

Glossary

Communication Rounds

Communication Rounds are the fundamental iterative cycles in federated learning where a central server coordinates model training across distributed clients without sharing raw data.
Data scientist building training data pipeline on laptop, data preprocessing visible, technical workspace.
FEDERATED LEARNING

What is Communication Rounds?

A core iterative cycle in federated learning where a global model is collaboratively improved across distributed devices without centralizing raw data.

In federated learning, a communication round is the fundamental iterative cycle where a central server coordinates the collaborative training of a global model across a distributed network of clients. Each round consists of three phases: the server broadcasts the current global model to a selected subset of clients; each client performs local training (e.g., via Stochastic Gradient Descent) on its private data; and the server aggregates the resulting model updates (e.g., using Federated Averaging) to produce an improved global model. This process repeats until convergence, minimizing the need to exchange raw, sensitive data.

The efficiency of federated learning is critically measured by the number of communication rounds required for the model to converge, as this directly impacts training time, bandwidth cost, and device energy consumption. Key challenges include managing statistical heterogeneity (non-IID data) across clients, which can cause client drift and slow convergence, and optimizing for partial client participation where only a fraction of devices are available each round. Advanced algorithms like FedProx and SCAFFOLD are designed to reduce the required rounds and improve stability under these conditions.

FEDERATED LEARNING

Key Components of a Communication Round

A communication round is the fundamental iterative cycle in federated learning where the global model is updated through decentralized collaboration. Each round consists of distinct, orchestrated phases between a central server and participating edge clients.

01

Server Model Broadcast

The round begins with the central server selecting a cohort of available clients and transmitting the current global model parameters to them. This broadcast is a one-to-many distribution, often optimized for bandwidth. The selection strategy can be random, based on device capability, or designed to maximize statistical diversity. In cross-device FL, this phase must handle intermittent connectivity and device churn.

02

Local Model Training (Epochs)

Each selected client performs local stochastic gradient descent (Local SGD) on its private, on-device dataset. This involves multiple training epochs over the local data. The key hyperparameter is the number of local steps or epochs before communication. Performing more local computation reduces communication frequency but can lead to client drift, where local models diverge due to non-IID data distributions. Algorithms like FedProx add a proximal term to the local loss to constrain this drift.

03

Client Update Computation & Privacy

After local training, the client computes an update. This is typically the difference between the initialized and final model weights (delta) or the computed gradients. To preserve privacy, this update may be protected before transmission:

  • Differential Privacy: Adding calibrated noise to the update.
  • Secure Aggregation Preparation: Using cryptographic masks so the server can only decrypt the sum of all updates, not individual contributions.
  • Compression: Applying techniques like quantization or sparsification to reduce upload payload size.
04

Secure Update Aggregation

The server collects updates from participating clients. The core algorithmic step is model aggregation, most commonly Federated Averaging (FedAvg), which computes a weighted average of client updates. For security:

  • Secure Aggregation protocols ensure the server learns only the aggregated sum, not individual updates.
  • Byzantine Robust aggregation rules (e.g., median, trimmed mean) are used to filter out malicious updates from model poisoning attacks. This phase transforms many local insights into a single, improved global model.
05

Global Model Update & Evaluation

The aggregated update is applied to the global model, creating a new model version for the next round. The server then evaluates the new global model's performance, typically on a held-out validation set. Key metrics tracked across rounds include global accuracy, loss convergence, and fairness across client distributions. This evaluation informs decisions like adjusting client selection or learning rates for subsequent rounds.

06

Round Completion & Synchronization

The round concludes with the server preparing for the next iteration. In synchronous FL, the server waits for all selected clients to respond or times out, creating a consistent update cycle. In asynchronous FL, clients update the global model as soon as they finish, improving efficiency at the cost of potential staleness. The total number of communication rounds is a primary determinant of the training time and the final model quality, directly impacting the communication-computation trade-off.

KEY DETERMINANTS

Factors Influencing Communication Efficiency

This table compares the primary technical and system factors that determine the efficiency of communication rounds in federated learning, focusing on their impact on latency, bandwidth, and overall training convergence.

FactorHigh Efficiency (Favorable)Low Efficiency (Unfavorable)Primary Impact

Client Compute Speed

Fast, dedicated hardware (e.g., NPU)

Slow, shared CPU on constrained device

Local Training Time

Network Bandwidth

High (>100 Mbps), stable connection

Low (<1 Mbps), intermittent connection

Update Upload/Download Time

Model Size

Small, compressed (<1 MB)

Large, uncompressed (>100 MB)

Data Transferred per Round

Client Participation Rate

High, consistent participation

Low, sporadic participation

Statistical Utility per Round

Data Heterogeneity (Non-IID)

Low, similar distributions across clients

High, divergent distributions

Rounds to Convergence

Aggregation Algorithm

Robust to stragglers & heterogeneity (e.g., FedProx)

Simple averaging (FedAvg) on highly heterogeneous data

Convergence Efficiency

Update Compression

Applied (e.g., quantization, sparsification)

Not applied (full precision updates sent)

Bandwidth Consumption

Security/Privacy Overhead

Lightweight or selectively applied

Heavy (e.g., full Homomorphic Encryption)

Round Completion Latency

COMMUNICATION ROUNDS

Frequently Asked Questions

Communication rounds are the fundamental iterative cycles in federated learning. This FAQ addresses the core mechanics, optimization strategies, and trade-offs involved in coordinating model training across decentralized devices.

A communication round is the core iterative cycle in federated learning where a central server coordinates the collaborative training of a global model across multiple decentralized clients without exchanging raw data. Each round consists of four key phases: 1) Server Selection & Broadcast, where the server selects a subset of available clients and sends them the current global model; 2) Local Training, where each selected client computes an update by training the model on its private, on-device data; 3) Update Transmission, where clients send their local model updates (e.g., gradients or weights) back to the server; and 4) Secure Aggregation, where the server combines these updates—often using an algorithm like Federated Averaging (FedAvg)—to produce a new, improved global model. The process then repeats, with the refined global model being broadcast in the next round.

Prasad Kumkar

About the author

Prasad Kumkar

CEO & MD, Inference Systems

Prasad Kumkar is the CEO & MD of Inference Systems and writes about AI systems architecture, LLM infrastructure, model serving, evaluation, and production deployment. Over 5+ years, he has worked across computer vision models, L5 autonomous vehicle systems, and LLM research, with a focus on taking complex AI ideas into real-world engineering systems.

His work and writing cover AI systems, large language models, AI agents, multimodal systems, autonomous systems, inference optimization, RAG, evaluation, and production AI engineering.