An introduction to federated learning in an industrial context: Fundamentals

25.8.2023 | 7 minutes reading time

With the help of data, companies are able to make more informed decisions, optimize their workflows and gain an edge in the competitive world of business using the power of Machine Learning (ML). However, handling data has become increasingly difficult. Today, firms face the challenge of adapting to ever-changing regulatory and cybersecurity requirements, as well as ensuring the privacy of data owners. Due to these challenges, many industries still have limited access to the cutting-edge data technologies of the 21st century.

So what can we do about it? Well, if we cannot move data to machine learning (ML) models, then how about moving the ML models to the data? This is exactly what federated learning is hoping to achieve.

Federated learning is all about moving the machine learning operations from the cloud to the edge devices and interacting with the data locally. The two basic components of federated learning setup are server and multiple clients. Below you can see the fundamental steps of a federated learning system:

Federated Learning Introduction.pptx.png

As shown in the graphic (Authors Work), we start by initializing a global model. We then send this model to the individual clients, who train it on their local data. Later, these clients return their updates to the server, where they're aggregated. At this point, one iteration is complete, and we transition back to the first step – the server sends the updated global model back to the clients and the whole process starts all over again.

That's the basic concept of federated learning! As you can already guess, for each step that you see on the diagram, there is more to what you see. The big concepts include: strategy, privacy-enhancing techniques (PETs), and secure multiparty computation (SMPC). The goal of this article series, which consists of 2 blog posts, is to provide a fundamental overview over these topics and how each of them relates to federated learning.

Strategy for federated learning

What is "strategy" then in the context of federated learning? Strategy is our journey of trying to figure out the most effective way to train the model and aggregate the weights. Weights are the learnings of the model from the training. Let's think about client selection and assume that we have 1000 clients. Should we pick all the clients, when we start the training? More clients means more data. But is more data always better for ML? Research has shown that using all the clients might actually lead to slower convergence, which means that the machine learning model will learn slower compared to if we had used fewer clients with high-quality datasets and less communication overhead (Németh et al. 4). One thing to keep in mind is that this is what the empirical research suggests and not a rule, so it is not set in stone and all the ideas that are mentioned here should be taken with a grain of salt as the "best" strategy is heavily context dependent. So, how then do we choose the clients we want to include in our update step? Not all clients are the same in their communication or computational capabilities as well as their access to the data points. There have been many strategies suggested by research. Some of them include prioritizing the clients with unique data points (Németh et al. 6), while some suggest choosing the clients in the most energy-efficient way possible (Németh et al. 5).

Now let's have a look at it from the server (aggregator) side. The amount of data points available to each client varies and, therefore, the trained weights also have different “experience levels”. A model that has been trained on 1000 data points is at a different stage than one that has merely seen 100 data points. After the aggregation, the model with 1000 data points will have more influence on the global model. In order to give a fairer distribution to the clients, the FedAvg algorithm for example can be used to average the model updates coming from the clients.

Screenshot 2023-08-22 at 11.50.29.png

In the above diagram (Németh et al. 5), you can see the broad range of ideas that were proposed by researchers over the years.

Proof of Concept: Predictive Maintenance

Now, let's take a deeper dive into it in an industrial context. I would like to present a proof of concept that I worked on to give you a glimpse of how federated learning in practice works. The proof of concept is about predictive maintenance. It's also a great example of how federated learning can be used to tackle key industrial challenges by enabling a greater use of ML.

Importance of ML-supported predictive maintenance

Machines are at the heart of the industry and their downtimes are associated with considerable costs. Predictive maintenance refers to the concept of predicting when machines will fail and performing maintenance before they do. According to a study by McKinsey & Company, with the help of artificial intelligence, “availability can sometimes increase by more than 20%. Inspection costs may be reduced by up to 25% and an overall reduction of up to 10% of annual maintenance costs is possible.” (McKinsey & Company, Inc. 8).

However, the elephant in the room is the limited availability of data to individual factory owners. How often do modern machines actually fail? Perhaps once or twice a month? And how often do these failures have the same cause? With only a small number of machines available to each organization, it will be hard to collect enough quality failure data.

In a traditional ML environment, after acknowledging the lack of sufficient data, we could look for similar data outside our organization. Understandably, however, factory owners are hesitant about sharing their data with external organizations - a tricky scenario for conventional ML, however not so much for federated learning since it allows a large number of machines to contribute their data to one larger central ML model, while also preserving privacy.

Implementation

We chose the Flower framework as it is very beginner friendly and has quite an active community that is ready to help with discussions and questions. We used the “Machine Predictive Maintenance Classification predictive maintenance” dataset from Kaggle. It is a synthetic dataset and therefore perfect for our proof of concept since there is not much preprocessing involved. The dataset was partitioned into smaller pieces so that each client had an unique subset available to them to train the model locally. Let's take a closer look at the server and the client side.

Server Side

On the server side, first, you define and compile your model as you normally would. As our strategy, we chose FedAvg, giving equal influence to each client on the global model when the models are aggregated. It is not the best algorithm to use but its simplicity makes it quite good for our proof of concept. For the initial parameters, we use random values. However, in a real-world scenario, you could request a client device to supply the initial weights by initiating the local training exclusively for that client. This approach provides more realistic starting weights, which can lead to faster convergence.

Screenshot 2023-08-22 at 11.54.20.png

From the image (Authors Work), you can get an idea of what the server outputs and the steps it goes through. As you can see, after initialization, the server follows the steps that we described above. Each round, it samples a group of clients and uses them for training (fit_rounds) and then aggregates them and provides an evaluation of that round.

Client Side

On the client side, we use the same model architecture. You might wonder why we chose the same architecture. The reason is that it would be impossible to aggregate the weights without knowing what kind of model architecture they belong to. Consider the weights as materials for constructing a building. Without the blueprint (the model architecture) of the building, placing the materials in the correct positions would be impossible.

Screenshot 2023-08-22 at 11.55.26.png

In the image (Authors Work), you can see that the client first establishes a connection with the server. After that, it trains the model using local data and also indicates how the model performs on that data.

Conclusion

Federated Learning is a fascinating concept that is not only interesting from a technical perspective but also from a business perspective. The idea of training models directly on users' distributed data sources makes it possible to extend data-intensive ML applications into areas that were previously not possible, due to privacy concerns or limited data access. In the next blog post, we will dive into the advanced techniques and what problems they try to solve.

References:

Németh, Gergely Dániel, et al. "A Snapshot of the Frontiers of Client Selection in Federated Learning." Transactions on Machine Learning Research, 2022, https://openreview.net/forum?id=vwOKBldzFu
McKinsey & Company, Inc. Smartening up with Artificial Intelligence (AI) - What’s in it for Germany and its Industrial Sector? Digital McKinsey, 2017, www.mckinsey.com/~/media/mckinsey/industries/semiconductors/our%20insights/smartening%20up%20with%20artificial%20intelligence/smartening-up-with-artificial-intelligence.ashx . Accessed 24 July 2023.

Was this post helpful?

Blog author

Ihsan Kisi

Do you still have questions? Just send me a message.

fromIhsan Kisi

An introduction to federated learning in an industrial context: Advanced

In the Machine Learning space, it was long believed that sharing learnings or weights was safe in the sense that the input data couldn't be extracted. However, this belief has been challenged by researchers coming out over the years. Nowadays, numerous...

Machine Learning
Big Data
Data Science
Data

18.9.2023 | 9 minutes reading time

Ihsan Kisi

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

Using Dagster with DuckDB

DuckDB has rapidly emerged as a popular in-process analytics database. Dagster, on the other hand, is a modern data orchestration framework that makes it easy to build and manage data pipelines. Combining Dagster with DuckDB allows data engineers to ...

Data

16.5.2025 | 4 [Missing String "readingTime"]

Hendrik Kamp

Querying Databricks Delta Tables in Motherduck

Intro In a previous article, my colleague Matthias Niehoff demonstrated how duckdb can serve as a viable alternative to Spark for processing data stored in Databricks, specifically by directly accessing the Unity Catalog. Building upon that, a next ...

Data

25.4.2025 | 4 [Missing String "readingTime"]

Hendrik Kamp

Introducing Data Interface Quadrants (DIQs)

In today’s rapidly evolving, data-driven world, organisations face an increasingly complex challenge: how to design, implement, and manage data interfaces that meet both immediate operational demands and long-term strategic business objectives. A data...

API
Data

30.1.2025 | 8 [Missing String "readingTime"]

Daniel Kocot

Miriam Greis

Access Databricks UnityCatalog from duckdb

Databricks is a great platform when it comes to data management and governance, mostly due to the unity catalog. But Spark as an engine for processing the data is just ok'ish, especially when data is not really big. New engines like polars, datafusion...

Data

20.1.2025 | 5 [Missing String "readingTime"]

Matthias Niehoff

Charge your APIs Volume 36 - Trends for 2025

As 2025 approaches, new trends are emerging in the world of APIs. After 2024 was user-centric, the focus is now shifting back to developer needs and increasing productivity. APIs are evolving and the technologies surrounding them are becoming more powerful...

Integration
API
Data
Software architecture

11.12.2024 | 5 [Missing String "readingTime"]

Daniel Kocot

When Business Meets Technology: From Data Product to Data Architecture...

Abstract The Data Product Canvas (DPC) is a tool for the lightweight and iterative definition of data products. It increases the efficiency of product definition by clearly presenting the key impact areas on data products. Additionally, the DPC motivates...

Software architecture
Data
DDD
Digital product developement

6.8.2024 | 24 [Missing String "readingTime"]

Dr. Florian Rademacher

Charge your APIs Volume 28: Empowering application and data integration...

In today's fast-paced world, seamless application and data integration is crucial for organisational success. This blog explores how frameworks like Maslow's Pyramid, Team Topologies, Evolutionary Architectures, API Federation, and API Marketplaces, ...

API
Data
Integration

25.7.2024 | 8 [Missing String "readingTime"]

Daniel Kocot

Data for the Masses Volume 2: Data Products, Data Contracts and API Contracts

The pillars of modern data architectures as success factors for organisations In the digital economy, a well-thought-out data architecture and the efficient use of data are crucial for organisational success. Data products, data contracts and API contracts...

Data
API

13.6.2024 | 7 [Missing String "readingTime"]

Daniel Kocot

Becoming a Data-Driven Company with Applied Data Products

In recent years, the hype surrounding the value of data has grown continuously, and a multitude of concepts and methods have emerged on how companies can become 'data-driven'. From strategic top management to detail-oriented data analysts attempts are...

Agile
Big Data
Data
Product management
Digitalization
Data Science
Business Intelligence

18.5.2024 | 9 [Missing String "readingTime"]

Dr. Florian Rademacher

A/B Testing: Tool support and testing GrowthBook

In the previous blog post we introduced some general concepts of A/B testing: we explored the main aspects, defined test types and explained the most common statistical methods. Now we want to explore the areas in which A/B testing tools can provide...

Testing
Python
Data
UX/UI
Analysis
JavaScript

18.3.2024 | 20 [Missing String "readingTime"]

Francesca Diana

A/B Testing: An introduction

This blog series aims to aid teams who are contemplating adding A/B testing to their toolkit but are unsure of which tool to use. In addition to helping with tool selection, the series also provides the entire team with a consistent initial understanding...

Testing
Data
UX/UI
Analysis

6.2.2024 | 29 [Missing String "readingTime"]

Francesca Diana

Data for the Masses Volume 1: The Digital Product Passport - A Key Element...

The Digital Product Passport represents a significant shift for digital units within organisations, compelling them to ensure comprehensive data transparency. This tool not only serves as a product's digital fingerprint but also opens up new dimensions...

Data
Product management

25.1.2024 | 7 [Missing String "readingTime"]

Daniel Kocot

Charge your APIs: NordicAPIs Platform Summit Edition - API first ... not...

In the ever-evolving landscape of software development, buzzwords and paradigms come and go. One such term that has gained significant traction in recent years is "API-First Development." It's been hailed as the holy grail of modern software engineering...

API
Data

19.10.2023 | 5 [Missing String "readingTime"]

Daniel Kocot

An introduction to federated learning in an industrial context: Advanced

Machine Learning
Big Data
Data Science
Data

18.9.2023 | 9 [Missing String "readingTime"]

Charge your APIs Volume 13: Data meets APIOps

In the swirling digital vortex that modern businesses navigate, two things stand clear as day: our escalating reliance on Application Programming Interfaces (APIs) and the immeasurable value of data. The API Operations (APIOps) pipeline, with its automated...

API
Data

24.8.2023 | 11 [Missing String "readingTime"]

Daniel Kocot

Simple Fraud Detection with PyMC

In one of my last projects, we were facing a prediction problem with very limited data. Each set of data took a specialist hours to compile, and results were not always successful. Therefore, we were looking for a tool to handle these requirements, as...

Python
Data Science

26.1.2023 | 7 [Missing String "readingTime"]

How to combine Poetry, TensorFlow, and the power of the Apple M1 GPU

In this article, we'll explore how to use the Poetry package manager to manage the dependencies of a machine learning project that makes use of the M1 GPU for TensorFlow training. We'll cover the motivation for using Poetry in this context, and we'll...

Machine Learning
Apple
Data
AI
Python

11.1.2023 | 3 [Missing String "readingTime"]

Denis Stalz-John

Money, Money, Money - Monetization of APIs needs more than just a business...

Welcome to my blog series on the topic of my bachelor's thesis, "Real-time dashboard with distributed streaming". To summarize, it discusses the visualization of API-related data that is essential for business owners. How is this series structured? This...

API
Streaming
Data

27.10.2022 | 5 [Missing String "readingTime"]

Python on an M1 chip: Running smoothly using Docker

I have been working as a data scientist at codecentric for several years now. Thus, my language of choice is Python and I am using it in several projects on a daily basis. Last year, I got pretty excited about the announcement of the new versions of ...

Data
Machine Learning
Apple
Python

14.2.2022 | 6 [Missing String "readingTime"]

Denis Stalz-John

BigQuery to the rescue: How to prototype an ML system for a medium-sized...

BigQuery can help with building an ML system for production with a short time to market.Follow industry standards. Agile methods, the MLOps framework and focus on an MVP are helpful.Model improvement is not everything. A good model evaluation as well...

Data

2.2.2022 | 9 [Missing String "readingTime"]

Felix

An introduction to federated learning in an industrial context: Fundamentals

Strategy for federated learning

Proof of Concept: Predictive Maintenance

Importance of ML-supported predictive maintenance

Implementation

Server Side

Client Side

Conclusion

Was this post helpful?

Blog author

More articles

An introduction to federated learning in an industrial context: Advanced

Your job at codecentric?

Agile Developer und Consultant (w/d/m)

More articles in this subject area

Using Dagster with DuckDB

Querying Databricks Delta Tables in Motherduck

Introducing Data Interface Quadrants (DIQs)

Access Databricks UnityCatalog from duckdb

Charge your APIs Volume 36 - Trends for 2025

When Business Meets Technology: From Data Product to Data Architecture...

Charge your APIs Volume 28: Empowering application and data integration...

Data for the Masses Volume 2: Data Products, Data Contracts and API Contracts

Becoming a Data-Driven Company with Applied Data Products

A/B Testing: Tool support and testing GrowthBook

A/B Testing: An introduction

Data for the Masses Volume 1: The Digital Product Passport - A Key Element...

Charge your APIs: NordicAPIs Platform Summit Edition - API first ... not...

An introduction to federated learning in an industrial context: Advanced

Charge your APIs Volume 13: Data meets APIOps

Simple Fraud Detection with PyMC

How to combine Poetry, TensorFlow, and the power of the Apple M1 GPU

Money, Money, Money - Monetization of APIs needs more than just a business...

Python on an M1 chip: Running smoothly using Docker

BigQuery to the rescue: How to prototype an ML system for a medium-sized...