Federated Learning

Click here to explore the dashboard on federated learning

Tech Champion: Xabier Lareo

Training, testing and validating machine-learning models require data. Data that sometimes is dispersed amongst many, even millions of, parties (devices). Federated learning is a relatively new way of developing machine-learning models where each federated device shares its local model parameters instead of sharing the whole dataset used to train it. The federated learning topology defines the way parameters are shared. In a centralised topology, the parties send their model parameters to a central server that uses them to train a central model which in turn sends back updated parameters to the parties. In other topologies, such as the peer-to-peer or hierarchical one, the parties share their parameters with a subset of their peers.

Federated learning is a potential solution for developing machine-learning models that require huge or very disperse datasets. However, it is not a one-size-fits-all machine learning scenarios.

Federated learning still has open issues that scientists and engineers work hard to solve, some of which are detailed below.

Communication efficiency: federated learning involves numerous data transfers. Consequently, the central server or parties receiving the parameters need to be resilient to communication failures and delays. Ensuring efficient communication and synchronisation amongst the federated devices remains a relevant issue.
Device heterogeneity: computing capacities of the federated parties are often heterogeneous and sometimes unknown to the other parties or central server. It is still difficult to ensure the training tasks will work within a heterogeneous set of devices.
Data heterogeneity: federated parties’ datasets can be very heterogeneous in terms of data quantity, quality and diversity. It is difficult to measure beforehand the statistical heterogeneity of the training datasets and to mitigate the potential negative impacts such heterogeneity might have.
Privacy: there is a need for efficient implementation of privacy enhancing technologies to avoid information leakages from shared model parameters.

Positive foreseen impacts on data protection:

Decentralisation: by leveraging on distributed datasets, federated learning avoids data centralisation and allows the parties to have better control over the processing of their personal data.
Data minimisation: federated learning reduces the amount of personal data transferred and processed by third parties for machine-learning model training.
International cooperation: when the shared parameters are anonymous, federated learning facilitates the training of models with data coming from different jurisdictions.

Negative foreseen impacts on data protection:

Interpretability: machine-learning developers often rely on the analysis of the training dataset to interpret the model behaviour. The developers using federated learning do not have access to the full training dataset, which can reduce the models’ interpretability.
Fairness: some federated learning settings may facilitate bias toward some parties, for example towards devices hosting the most common model types.
Security issues: the distributed nature of federated learning facilitates some types of attacks (e.g. model poisoning). Classic defence mechanisms do not currently provide sufficient protection in a federated learning setup. Ad hoc defence methods still have to be developed and tested.

Our three picks of suggested readings:

L. Tian, A. Kumar Sahu, A. S. Talwalkar and V. Smith, Federated Learning: Challenges, Methods, and Future Directions, IEEE Signal Processing Magazine 37, 2020.
Q. Li, W. Zeyi, H. Bingsheng, A Survey on Federated Learning Systems: Vision, Hype and Reality for Data Privacy and Protection, ArXiv abs/1907.09693, 2021.
P. Kairouz et al, Advances and Open Problems in Federated Learning, Foundations and Trends in Machine Learning Vol 4 Issue 1, 2021.

+ View more news

News

EDPS Annual Report 2025: protecting people in a changing digital world

7 May 2026

The EDPS Annual Report 2025 looks back over a year when our work was characterised by the operationalisation of our expanding mandate, guided by our strategic principles: Foresight, Action and Solidarity. Consult the full Annual Report 2025 and its executive summary to get the full picture of our actions, monitoring activities, and new roles.

Read the Annual Report 2025 and Executive Summaries
Read the speech by Supervisor
Read the press release
Follow the press conference
Follow the speech of the Supervisor before the LIBE Committee

The digital future is here. We are guiding the EU administration through this transformation with expertise and an unwavering focus on your rights.

New episode of the Newsletter Digest is out

1 May 2026

Curious about how the EU is tackling AI supervision, cybersecurity rules, and data protection in the health industry? Our newest episode breaks it all down.

Have a listen

Latest EDPS newsletter out now

20 April 2026

Welcome to this edition of the EDPS Newsletter, focusing on new regulatory roles and essential safeguards. Catch up on our joint opinions regarding the European Biotech Act and the EU's cybersecurity package, the launch of the EDPS Compass for the AI Act, reflections on the value of prior consultations in law enforcement, and details on our upcoming high-level debate and Europe Day celebrations!

Agenda

13 May 2026

Leonardo Cervera Navas patricipates in the Information Law Committee of the German Bar Association, Brussels, Belgium

11 May 2026

119th EDPB Plenary, Participation of Wojciech Wiewiórowski, Brussels, Belgium

7 May 2026

Presentation of the EDPS Annual Report 2025 to the LIBE Committee of the European Parliament, Participation of Wojciech Wiewiórowski, Brussels, Belgium

4 May 2026

Wojciech Wiewiórowski meeting with Prof. Juan Manuel Corchado, Rector of the University of Salamanca, Brussels, Belgium

29 April 2026

"The Digital Omnibus: Simplification, Deregulation or Better Regulation?", Participation by Wojciech Wiewiórowski (online) in BRIDGE Symposium organised by the University of Łódź, Brussels, Belgium