Using Apache PLC4X and ElasticSearch for IIoT monitoring and anomaly detection

7.10.2019 | 6 minutes reading time

Industrial IoT (IIoT) as a buzzword gained traction within recent years. However, implementing common use cases like real-time monitoring of PLCs may involve a huge amount of money and effort. For example, current approaches implementing such a monitoring solution require complex architectures. Examples and show-cases for real-time monitoring often use OPC-UA as an interface to connect to the PLCs. However, this leaves out a huge number of older PLCs in factories all over the world with no support for OPC-UA.

We are presenting an approach involving less effort and way less money needed to implement near-real-time IIoT monitoring. This is realized by utilizing Apache PLC4X and the ELK Stack . Apache PLC4X is integrated into Logstash as a Logstash plugin. This is used to connect to the PLCs and transfer the incoming data into ElasticSearch. Kibana serves as the data analysis and monitoring user interface.

Showcase overview

The showcase implements a rather simple scenario with two conveyor belts within a factory containing temperature sensors for each conveyor belt. A conveyor belt consists of three workstations (or stages). The conveyor belt processes items through the workstations. The temperature of the processed items is monitored by the sensors at every stage. During each workstation, errors may occur due to high temperatures (it’s a rather theoretical example). These errors may produce faulty material, so we are eager to monitor the temperature on each workstation and get insights if anomalies happen.

The showcase consists of three main components. First, there is a simulated (virtual) factory with two servers as data sources for our scenario. The servers are polled by Logstash (with integrated Apache PLC4X) as the second component. Logstash pushes the data to ElasticSearch, with Kibana as the frontend for data analysis and display.

What is Apache PLC4X?

Apache PLC4X serves as a universal protocol adapter for different types of PLCs. The project’s goal is to easily access many types of PLCs by having a standardized interface for the PLCs’ protocol, e.g. Siemens S7 or Beckhoff ADS.

Logstash integration

The simplicity of the presented approach is achieved by the integration of Apache PLC4X into Logstash as a plugin. We implemented an easy and convenient way to leverage the power of PLC4X with a well-known ETL tool like Logstash. This blog post (German only) presents our challenges in building the Logstash integration.

The showcase

The showcases example implementation is available within a GitHub repository . You can run and try out the example with all components in a few seconds by executing docker-compose.

This will spin up several Docker containers for ElasticSearch, Kibana, Logstash, and 2 simulated OPC-UA servers. Everything is preconfigured to show the demo.

The simulated OPC-UA server

For the first iteration of our showcase, we quickly needed a simulated PLC to demonstrate the capabilities of Apache PLC4X with Logstash. As PLC4X’s greatest advantage over OPC-UA is that it seamlessly supports different kinds of legacy PLCs (i.e. PLCs without support for OPC-UA), we wanted to use a simulated PLC. However, we didn’t have such a simulated PLC at hand. Therefore, we used a free implementation of an OPC-UA server and because of the OPC-UA protocol support of Apache PLC4X, we can demonstrate our use case.

The server consists of three temperature sensors, continuously emitting temperature data with little variations in the temperature value (Gaussian distributed) and introduces randomized temperature variations into the data.

Demo Implementation

The implementation of our showcase is done by configuring the Logstash PLC4X plugin within a pipeline:

input {
    plc4x {
        jobs => {
            job1 => {
                rate => 200
                sources => ["sensors1", "sensors2"]
                queries =>  {
                    PreStage => "ns=2;i=3"
                    MidStage => "ns=2;i=4"
                    PostStage => "ns=2;i=5"
                    ConveyorBeltTimestamp => "ns=2;i=7"
                }
            }
        }
        sources => {
            sensors1 => "opcua:tcp://opcua-server-1:4840/freeopcua/server/"
            sensors2 => "opcua:tcp://opcua-server-2:4841/freeopcua/server/"
        }
    }
}

Afterward, we configured three Timelion visualizations to display the three different temperature sensors for each conveyor belt. You can configure many more useful visualizations if needed. The picture below shows the configuration of a time series diagram for one stage.

The 3 Timelion visualizations are combined within a dashboard and already present a nice overview of the temperature data.

You may already see some of the detected anomalies at first sight. However, smaller anomalies, with narrower outliers, are harder to manually discover within such a view. This is where Kibana’s machine learning features (part of the platinum license) come in place. We are able to automatically detect anomalies within our factory, simply by configuring a new machine learning job.

You can configure the machine learning job by navigating to the machine learning tab and clicking on the “create new job” button. Then you have to select the plant index and choose the wizard for multi-metric anomaly detection. Next, choose the time range for your job data. In our case, we had the example running for about four hours, which produced around 140.000 data points.

In the job settings area, select the fields on which the job runs: values.PreStage, values.MidStage and values.PostStage with the max aggregation operation. As the split data field, select the sensor name by choosing the sourceName field.

The bucket span describes the size of the generated buckets for the max aggregation and should be set to one second for a finer resolution. Last but not least, configure a job name and description. By clicking on the “create job” button, the machine learning job gets started.

After the job finished, you can view the results. As you can see in the screenshot, the model detected several anomalies and even scores them with a severity measure.

It’s also possible to configure a watcher (alerting) for a continuously running job. This allows for e-mail notifications when anomalies occur.

Key takeaways

In this blog post, we built a near-real-time monitoring and anomaly detection for a simulated factory showcase.

The benefits of this solution are its low cost and effort compared to similar solutions on the market. Although we used a simulated PLC, the demo is easily applicable to a real PLC. Monitoring, alerting, and machine learning is usually just a matter of configuring the available components and therefore quickly implemented.

For the demo setting with only three sensors on two servers, machine learning seems a bit overpowered. However, with thousands of PLCs within multiple factories, manually configuring alerting and thresholds can be a tedious task. Scaling up our example is easily manageable, all it takes is an ElasticSearch cluster and Logstash with the PLC4X plugin on small machines close to the PLCs.

So what’s next? We want to improve our showcase by using real or simulated PLCs without OPC-UA to demonstrate the real power of Apache PLC4X. We will further improve the PLC4X Logstash plugin and get it ready for production. It is on our roadmap to extend the showcase scenario to a more production-like setting with an ElasticSearch cluster and a scalable, variable amount of PLCs.

Was this post helpful?

Blog author

Stefan Herrmann

Do you still have questions? Just send me a message.

fromStefan Herrmann

Failure on demand – Scenes from an agile transformation

In this blog post, we want to show why agile transformations fail, illustrating various situations that unfortunately still occur far too often in reality today. More and more often, we notice that the company culture lived by the management and the...

Agile transformation
Agile

30.7.2020 | 12 minutes reading time

Stefan Herrmann

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

IoT fleet management: A comparison of balena and Portainer

When your system contains many IoT devices that are scattered over a large production facility or even distributed over multiple facilities, it is important that you can manage and update the deployed software, access logs and easily provision new devices...

IoT
IIoT
DevOps
Container
Raspberry Pi

10.1.2023 | 8 minutes reading time

Florian Lüdiger

Toit will bring your IoT projects up to speed

If you have ever created an application for a microcontroller such as the ESP32, you might have noticed that it’s quite different to most of the software development we as IT consultants are doing most of the time. Using C/C++ and ancient tooling for...

29.8.2022 | 6 minutes reading time

Florian Lüdiger

The universal recommender in Action(ML)

IntroductionRecommender systems have become crucial for many different businesses. E-commerce uses recommenders to guide their customers in finding the right products and to assure they stay on the site. Newspapers or entertainment websites want to keep...

AI
NoSQL
Data
Machine Learning
Python

18.4.2021 | 11 minutes reading time

Francesca Diana

A different flavour of IIoT: Recipes for the plant thermomix

In this article we explain our IIoT solution for autonomous, declarative plant growing. This article is the second in our series on IIoT. The first article dealt with general questions and problems about IIoT. Growing plants offers unique requirements...

Software architecture
IIoT
IoT
Software development

3.3.2021 | 8 minutes reading time

Marcus Hanhart

IIoT product development: lessons from past projects

In this overview article on industrial IoT product development we will guide you along the essential questions and directions to consider. We will go with you along the relevant topics and preconditions, when you start to connect large numbers of small...

IIoT
IoT
Python

11.11.2020 | 10 minutes reading time

Lifting an electric vehicle charger into the cloud

With the increasing popularity and availability of battery electric vehicles, privately-owned charger infrastructure at home or on company premises has become more and more common.A vehicle charger is much more than a simple mains socket – it contains...

18.10.2020 | 12 minutes reading time

Processing protobufs messages with AWS IoT Core

IntroductionThe Internet of Things (IoT) is gradually changing an ever increasing number of aspects of modern day life. From connected vehicles to sensors monitoring all sorts of metrics in our homes: chips can be put to use almost everywhere. They are...

AWS
Go
IoT
Serverless

2.7.2020 | 15 minutes reading time

Kick-start your microservice project with JHipster

I recently looked for a solution on how to prototype a customer project in a short time and came across JHipster. The target architecture used Spring Boot in the backend and an Angular frontend. JHipster can scaffold this in its simplest variant as...

Node.js
Angular
Software development
Container
NoSQL
Cloud
JavaScript
Java
Keycloak
Kubernetes
Microservices
IT-Security
Open Source
React
Spring

12.5.2020 | 13 minutes reading time

Jörg Riegel

Golang, Gin & MongoDB – Building microservices easily

Golang, a.k.a. Go, has been around in the industry for quite some time now, but people are still reluctant to just go ahead and use it. To help you get started, follow me on this journey and create your first microservice using Golang, Gin and Docker...

Cloud
Container
Go
Microservices
NoSQL

21.4.2020 | 10 minutes reading time

Selecting the right hardware for embedded development of open-source protocol...

With Apache PLC4X we are currently able to access almost any industrial PLC from the Java world. This is great, but we do know that there is a world outside of the Java ecosystem. Especially when it comes to embedded development, Java won’t get you far...

IoT
IIoT

15.4.2020 | 13 minutes reading time

From PDF data sheets to shared understanding with serverless SHACL

Knowledge contained in PDF filesWhen crawling the web for information about products of a specific category, may it be instances of industrial machine parts, chemical components, or even household goods, manufacturers of such goods often provide the ...

NoSQL
AWS
Big Data
Data
API
Microservices
Python
Serverless
Webdevelopment

1.4.2020 | 12 minutes reading time

Physical regression testing for the Thermomix

Automating physical regression testing of products with computer vision and roboticsTesting a physical product can be a highly manual task. The advances in Deep Learning techniques and computer vision have led to a situation where we can start to strive...

AWS
IoT
Computer Vision
Product management
AI
Testing

31.3.2020 | 8 minutes reading time

IoT from sensor to browser: Retrofitting machines

IoT is broadening its place in the industry. We can see many industry level devices available on the market. New technologies, such as AI, computer vision, 5G, VR/AR, microcontrollers, sensors, robotics and more, are pushing IoT forward and growing its...

IIoT
IoT

30.9.2019 | 3 minutes reading time

Apache PLC4X, the missing link for industrial innovation

When reading the current media or talking directly to people in the industry, it seems the industry is currently undergoing one of its greatest revolutions. It’s all about “Industry 4.0”, “smart factories” and “digitization” in general.Usually this implies...

22.4.2019 | 9 minutes reading time

Convolutional neural networks for damage detection

Damage detection from sensor data is at the basis of predictive maintenance . Mainly, one needs to discriminate the normal from the anomalous (damaged) status, and estimate the severity of the damage to forecast the right course of action (maintenance...

AI
Machine Learning
IoT

11.3.2019 | 10 minutes reading time

Fixing history — An event sourcing journey

Introduction Elescore, a platform built by me that tracks elevator disruptions, integrates multiple external data sources. One of these sources is the DB FaSta API , providing disruption information for all facilities operated by Deutsche Bahn. In ....

Open Source
Event Sourcing
Functional programming
IoT
Data Science

30.11.2018 | 15 minutes reading time

Book review: Smarter Homes – How Technology Will Change Your Home Life

At a recent IoT Hessen meetup in Frankfurt, where Alexandra Deschamps-Sonsino presented her brand-new book Smarter Homes, she told the audience that writing a book had been an item on her bucket list for a long time. So when she received an offer to...

19.11.2018 | 5 minutes reading time

Predictive Maintenance as business driver

When the sensor calls It is like a dream come true.Running factories with hundreds of machines for forging, cutting, melting, extruding, all monitored continuously 24/7 by many more sensors. Gigabytes of data streamed directly in the cloud. Every alteration...

Data
IoT

14.10.2018 | 11 minutes reading time

Understanding IoT (Part 2)

Summary of Part 1In the first part of this blog post we postulated that an IoT device in general is an abstract real-world interface. Subsequently, a general definition of the concept of innovation was elaborated and we found that three aspects in particular...

IoT
Software development

28.8.2018 | 11 minutes reading time

Understanding IoT (Part 1)

IntroductionThe Internet of Things, short “IoT”, belongs, without a doubt, to the trend topics (not to say hype topics) of the present time. Who doesn’t know the studies predicting the explosion of internet-connected devices in the near future (cf. e...

IoT
Software development

21.8.2018 | 8 minutes reading time