![]() |
VOOZH | about |
Federated Learning is a powerful technique that allow a single machine to learn from many different source and converting the data into small pieces sending them to different Federated Learning (FL) is a decentralized of the machine learning paradigm that can enables to model training across various devices while preserving your data the data privacy.
In this article, we are going to learn about federated learning and discuss it's types.
Table of Content
Federated learning is a machine learning setting where the goal is to train a model across multiple decentralized edge devices or servers holding local data samples, without exchanging them. This approach contrasts with traditional centralized machine learning techniques where all data is uploaded to one server. Instead, in federated learning, the model is trained iteratively at multiple points of data origin, which send model updates rather than raw data to a central server.
Centralized Federated Learning involves a central server that orchestrates the training process. Participants (clients) train local models on their data and send the updated model parameters to the central server. The server aggregates these parameters to update the global model, which is then distributed back to the clients.
Key Features:
Example Use Case: A healthcare consortium where hospitals train models locally on patient data and a central server aggregates these models to improve disease prediction models.
Decentralized Federated Learning eliminates the need for a central server. Instead, participants communicate directly with each other to share and aggregate model updates. This peer-to-peer communication ensures that there is no single point of failure and enhances privacy by distributing the aggregation process.
Key Features:
Example Use Case: A network of mobile devices collaboratively training a model for predicting app usage patterns without relying on a central server.
Horizontal Federated Learning (also known as Sample-based FL) occurs when datasets across different participants share the same feature space but differ in samples. Essentially, the datasets have the same structure (features) but contain data about different entities.
Key Features:
Example Use Case: Multiple banks training a fraud detection model on their transaction data without sharing the actual transaction records.
Vertical Federated Learning (also known as Feature-based FL) occurs when participants have datasets that share the same sample space but differ in features. This means that each participant holds different attributes about the same set of entities.
Key Features:
Example Use Case: A partnership between a retail company and a bank where the retail company has purchasing data and the bank has financial data about the same customers.
Cross-Silo Federated Learning involves a small number of reliable and stable participants (silos) such as organizations or institutions. These silos typically have significant computational resources and stable network connections.
Key Features:
Example Use Case: Universities collaborating on a research project where each university has its dataset but wants to build a common predictive model.
Cross-Device Federated Learning involves a large number of devices such as smartphones, IoT devices, or edge devices. These devices are typically less reliable and have varying computational capabilities and intermittent network connectivity.
Key Features:
Example Use Case: Training a predictive text input model on mobile devices where each device contributes to the model without sending user data to a central server.
Federated Learning offers a versatile framework for collaborative machine learning while preserving data privacy. By understanding the different types of Federated LearningāCentralized vs. Decentralized, Horizontal vs. Vertical, and Cross-Silo vs. Cross-Deviceāorganizations can choose the approach that best fits their needs and constraints. Each type has its unique advantages and challenges, but all share the common goal of enabling robust, privacy-preserving machine learning across distributed data sources.