Realtime face detection and filtering with the Coral USB accelerator

8.11.2019 | 9 minutes reading time

In this blog post we explain how you can build your own face detection application without much machine learning knowledge. Why? At codecentric everyone has one day per week for professional development and training. Among other things we use this time to get in touch with new technologies and build cool stuff. This time we decided to have a closer look at the Coral USB accelerator. You can see the outcome in the following video.

By loading the video, you agree to YouTube’s privacy policy.
Learn more

Load video

Always unblock YouTube

The application detects faces based on a pre-trained neural network and overlays them with face filters. In order to keep the face filters assigned to individual faces, even if multiple people appear in the video, it tracks the detected faces over time. In this blog post we explain how it works and how you can build your own face detection application with low cost consumer hardware and without much machine learning knowledge.

Our hardware setup

We used the following hardware components:

Coral USB Accelerator
Raspberry PI 4
A webcam with 60fps – have a look at webcams which are compatible with a Raspberry Pi

The Coral USB accelerator connected to a Raspberry Pi 4 is the heart of the setup. The accelerator contains an edge TPU (Tensor Processing Unit) coprocessor which is optimized to process matrix operations. It currently only supports pre-compiled TensorFlow Lite models. It can perform 4 trillion operations per second. Therefore, it allows high inference speed for image classification and object detection using neural networks. When we ran our experiments on the CPU of the Raspberry Pi 4 without the Coral USB accelerator, the application could process between 0.5 and 1.5 frames per second. Using the accelerator, we achieved between 10 and 25 frames per second depending on how much image manipulation features we added and which image resolution we used.

The USB accelerator is connected with the Raspberry Pi 4 via the USB 3.0 Type C interface. While the accelerator also supports USB 2.0, it is recommended to use USB 3.0 to ensure sufficient data transfer rates. The Pi 4 is the first Pi which has USB 3.0 on board. You can also use older Raspberry Pi versions but expect USB 2.0 to be a bottleneck which will substantially lower the achievable framerate. Have a look at the framerate from our experiment we did two years ago with a Pi 3 and the Movidius stick, which was connected via USB 2.0.

We used a Logitech C920 HD Pro webcam for the setup but as we mentioned earlier, many webcams should work and should lead to similar results. After connecting the devices as shown below we can start to install the needed drivers and libraries.

Setup and installation

First you should install a clean Raspbian distribution on your Raspberry Pi using the Noobs installer. A detailed guide can be found here . With your Raspberry Pi up and running you can install git and clone the repository we prepared to help you get started fast. Installing all needed dependencies for the USB accelerator is quite some work. To save you the time and effort we wrote a script which automates the installation. You can find it in the root folder of the repository. Simply run install.sh, which is located in the root folder and it will install all dependencies. The installation will take a while. After the installation unplug and replug the Coral USB Accelerator once. Now you should be able to run the face replace demo with python3.7 -m face_replace.

Face detection

First we have to initialize the detection engine with the pre-trained model contained in the repository.

1from edgetpu.detection.engine import DetectionEngine
2
3face_detection_engine = DetectionEngine(FACE_DETECTION_MODEL_PATH)

Now we can read the video stream from the webcam. Therefore, we are using the image-utils library. To allow the camera sensor to warm up we wait 1 second before we start processing the stream.

1from imutils.video import VideoStream
2
3video_stream = VideoStream(src=0).start()
4time.sleep(1.0)

The next step is to read the single frames from the video stream and preprocess them. Since the video is recorded mirrored, we flip each frame horizontally using the computer vision library opencv . This code, together with the rest of the frame processing, happens inside a loop which runs infinitely until the user stops the application.

1import cv2
2
3while True:
4    input_frame = cv2.flip(video_stream.read(), 1) 
5    frame_as_image, resized_frame = frame_processor.preprocess(input_frame)

As the first step of the image preprocessing, we resize the frame. The frame we captured from the video is a numpy ndarray. The color model of the pixel stored in this array is BGR. Since we need an RGB image as input for the face detection engine, we have to convert the colors from BGR to RGB and create an image out of the array using the imaging library Pillow (PIL).

1import imutils
2from PIL import Image
3
4def preprocess(frame):
5    resized_frame = imutils.resize(frame, width=IMAGE_WIDTH)
6    rgb_array = cv2.cvtColor(resized_frame, cv2.COLOR_BGR2RGB)
7    frame_as_image = Image.fromarray(rgb_array)
8    return frame_as_image, resized_frame

Using the preprocessed image and the previously initialized model we can now start to run the face detection. Therefore, we call the method detect_with_image on the previously initialized model. This will run the inference, which means it will produce the predicted faces. The method takes multiple inputs: the image, a threshold which defines the minimum confidence for the detected faces and the top_k parameter, which defines the maximum number of faces the model should detect.

1face_detection_engine.detect_with_image(
2    frame_as_image,
3    threshold=confidence,
4    keep_aspect_ratio=True,
5    relative_coord=False,
6    top_k=MAX_FACES,
7)

The detected faces are a list of DetectionCandidates where each entry provides the bounding box of the detected face.

Face overlay

In the next step we iterate over the detected faces, extract each bounding box and use it to overlay the faces with a face filter. How we determine the face filter will be explained in the next chapter. Notice, we apply the face filter on the resized frame (ndarray) instead of the Pillow image.

1for face in detected_faces:
2    bounding_box = face.bounding_box.flatten().astype("int")
3    face_filter = cache.update(bounding_box)
4    frame = frame_processor.replace_face(
5        bounding_box, resized_frame , face_filter
6    )

We achieve this by extracting the coordinates from the bounding box and resizing the face filter to the size of the bounding box.

1(bbox_x1, bbox_y1, bbox_x2, bbox_y2) = bounding_box
2width = bbox_x2 - bbox_x1
3height = bbox_y2 - bbox_y1
4face_filter_resized = cv2.resize(
5    face_filter, (width, height), interpolation=cv2.INTER_AREA
6)

Afterwards, we can overwrite the original face with the resized face filter. We rewrite all pixels inside the bounding box. Since our face filters are PNG images and want to keep the transparent regions of the images we have to take the alpha value into account and draw the original image with the inverted alpha value of the face filter image.

1face_filter_alpha = face_filter[:, :, 3] / 255.0
2inverted_alpha = 1.0 - face_filter_alpha
3for colour_index in range(0, 3):
4    frame[bbox_y1:bbox_y2, bbox_x1:bbox_x2, colour_index] = (
5        face_filter_alpha * face_filter[:, :, colour_index]
6        + inverted_alpha * frame[bbox_y1:bbox_y2, bbox_x1:bbox_x2, colour_index]
7    )

The last step is to display the manipulated frame. We can easily do this by calling the method imshow and providing the window name and the frame inside the constructor.

1cv2.imshow(window_name, frame)

Face tracking

Until now, we didn’t explain how we keep track of faces and cover a face with the same face filter over time. Let’s say you would choose a filter randomly. Since the application does not yet know any time dependency, different face filters would be randomly chosen for the same person for every frame, which would result in a very chaotic filter flickering. Instead, our goal should be to track faces. That’s why we implemented a simple tracking algorithm which assigns a specific face filter to each face even when the position of the face changes from frame to frame.

Caching – keeping faces in memory

If you already worked with face detection the first idea which might come to your mind is to apply feature recognition to each detected face and compare the features from frame to frame, which would allow you to track the face. On the one hand this approach would be very expensive computationally speaking, on the other hand it would also be complex to implement.

Instead, we came up with the idea to store the bounding boxes of the detected faces, together with the related face filter, inside a cache. Using this cache, we calculate the nearest bounding box from frame to frame in order to rediscover the face related to the bounding box. This approach is much easier to implement and requires significantly fewer calculations. Though, it leads to an unwanted feature. When person B walks in front of person A, they might be able to “steal” the face filter from person A, since person A’s face won’t be visible while covered by person B and the closest face to A’s filter will then be the face of person B. We did not mind this feature for our experiments ;-).

Each entry of the cache contains the bounding box of the detected face, a face filter randomly chosen from the available face filter collection and the age of the entry

1class Cache:
2   def __init__(self, face_filters):
3        self.entries = []
4        self.available_face_filters = face_filters

For each detected face we update the cache with its bounding box and get the face filter for the face.

1def update(self, bbox):
2    if len(self.entries) == 0:
3        return self._add_new_entry(bbox)
4    nearest_bbox_distance, nearest_bbox_index = self._nearest_bounding_box(bbox)
5    if nearest_bbox_distance > Cache.MAX_BBOX_DISTANCE:
6        return self._add_new_entry(bbox)
7    return self._update_entry(bbox, nearest_bbox_index)

If the cache is empty or the distance of the new bounding box to the cached bounding box exceeds a defined threshold, we add a new entry to the cache and return a new randomly chosen face filter.

1def _add_new_entry(self, bbox):
2    face_filter = random.choice(self.available_face_filters)
3    self.entries.append([bbox, face_filter, Cache.INITIAL_AGE])
4    return face_filter

Otherwise, we update the cache entry with the nearest bounding box. This means we overwrite the bounding box of the cached entry with the new one, reduce the age of the entry and return the previously applied face filter.

1def _update_cache_entry(self, bbox, nearest_bbox_index):
2    self.entries[nearest_bbox_index][0] = bbox
3    self.entries[nearest_bbox_index][2] -= Cache.REJUVENATE
4    return self.entries[nearest_bbox_index][1]

To find the nearest bounding box we perform a nearest neighbor lookup using the k dimensional search tree provided by the scipy library .

1from scipy.spatial import cKDTree
2
3def _nearest_bounding_box(self, bbox):
4    bb_matrix = [entry[0] for entry in self.entries]
5    nearest_bbox_distance, nearest_bbox_index = cKDTree(bb_matrix).query(bbox, k=1)
6    return nearest_bbox_distance, nearest_bbox_index

Cache invalidation

Our caching approach allows us to track faces but when the application is running, the cache will grow and use up an increasing amount of memory. Furthermore, it will contain bounding boxes of faces which left the captured area of the camera. For these reasons we decided to invalidate the cache each 10 iterations of the video loop.

1if num_iterations % 10 == 0:
2    cache.invalidate()

The invalidate method first increases the age of each cache entry and then drops all entries whose age is equal or bigger than the maximum age.

1def invalidate(self):
2     aged_entries = [
3         [entry[0], entry[1], entry[2] + Cache.AGING] for entry in self.entries
4     ]
5     self.entries = [entry for entry in aged_entries if entry[2] < Cache.MAX_AGE]

Your idea?

In this blog post we showed how you can build your own AI based face detection application using low cost consumer hardware and little machine learning knowledge.
Now that you know how it works, you could try to build your own applications. For example, have a look at the pong game .

We hope that we inspired you to start your own experiments. Share your ideas or results in the comments below and let us know which experiments we should conduct next!

We thank our colleague Marcel Mikl for his support during the implementation of the demo.

Was this post helpful?

Blog authors

Christoph Knauf

IT Consultant

Do you still have questions? Just send me a message.

Paul Strobel

Do you still have questions? Just send me a message.

fromChristoph Knauf & Paul Strobel

Architecture docs as code with Structurizr & Asciidoctor. Part 5: Generating...

You are reading the final part of this article series about architecture documentation as code. In the previous articles a workflow was implemented that aims to reduce the efforts for maintaining long-living architecture documentation, keep it up to ...

Software architecture
Documentation

20.12.2022 | 18 minutes reading time

Christoph Knauf

Architecture docs as code with Structurizr & Asciidoctor. Part 4: Publishing

You are reading the fourth part of this article series about architecture documentation as code. If you worked through the previous articles, you already automated the generation of your architecture documents using Asciidoctor and integrated the diagrams...

Software architecture
Documentation

28.10.2022 | 6 minutes reading time

Christoph Knauf

Architecture docs as code with Structurizr & Asciidoctor. Part 3: Structurizr

You are reading the third part of this article series about architecture documentation as code. In this article, we will implement the Structurizr-related part of the workflow highlighted in the following figure. ...

Software architecture
Documentation

21.10.2022 | 15 minutes reading time

Christoph Knauf

Architecture docs as code with Structurizr & Asciidoctor. Part 2: Asciidoctor

You are reading the second part of this article series about architecture documentation as code. In this article, we will implement the Asciidoctor-related part of the workflow highlighted in the following figure. ...

Software architecture
Documentation

13.10.2022 | 9 minutes reading time

Christoph Knauf

Architecture docs as code with Structurizr & Asciidoctor. Part 1: Workflow...

In this article series, we learn how to generate and publish HTML architecture documentation from code with Structurizr and Asciidoctor. The goal of this approach is to reduce the efforts for maintaining long-living architecture documentation, keep it...

Software architecture
Documentation

25.8.2022 | 9 minutes reading time

Christoph Knauf

Tackling climate change with machine learning [part 6] – Datasets & further...

Before we get started with this chapter, here is the full summary video, containing all 5 previous parts, enjoy! By loading the video, you agree to YouTube's privacy policy. Learn more Load video Always unblock YouTube The first 5 chapters of...

Data
AI
Machine Learning

26.9.2019 | 4 minutes reading time

Paul Strobel

Tackling climate change with machine learning [part 5] – Industry & carbon...

By loading the video, you agree to YouTube's privacy policy. Learn more Load video Always unblock YouTube On 10th of June, 2019, twenty-two AI researchers, including Andrew Ng and Yoshua Bengio, published a paper on how climate change can be...

Data
AI
Machine Learning

25.9.2019 | 5 minutes reading time

Paul Strobel

Tackling climate change with machine learning [part 4] – Farms & Forests

Data
AI
Machine Learning

24.9.2019 | 4 minutes reading time

Paul Strobel

Tackling climate change with machine learning [part 3] – Buildings & Cities

Data
AI
Machine Learning

23.9.2019 | 6 minutes reading time

Paul Strobel

Tackling climate change with machine learning [part 2] – Transportation

Data
AI
Machine Learning

22.9.2019 | 7 minutes reading time

Paul Strobel

Tackling climate change with machine learning [part 1] – Electricity systems

By loading the video, you agree to YouTube's privacy policy. Learn more Load video Always unblock YouTube On 10th of June, 2019, twenty-two AI researchers, including Andrew Ng, David Rolnick and Yoshua Bengio, published a paper on how climate...

Data
AI
Machine Learning

19.9.2019 | 7 minutes reading time

Paul Strobel

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

Pull off Architecture Reviews at Light-Speed with LASR!

Foreword: This blog is loosely based on a recent project experience. All persons, companies and names are fictitious, as to make them NDA compliant. Any resemblance to a person, existing company or brand is purely coincidental and unintentional.For most...

Software architecture

4.4.2025 | 13 minutes reading time

Feature-Sliced Design and what we need for good frontend architecture

Feature-Sliced Design and what we need for good frontend architecture While a lot has been published on the topic of software architecture in the backend, and there are well-established best practices, this topic is less prominent for frontend applications...

Software architecture
Frontend

23.1.2025 | 10 minutes reading time

Hexagonal Architecture is just an island

Imagine an island called "Alistair Island." This island is a vibrant place with houses, fertile soil, and a well-coordinated community of residents who live by well-defined routines. Every activity on the island has significance and serves a specific...

Software architecture
Testing
Software development

22.1.2025 | 10 minutes reading time

Danny Steinbrecher

Modularization the easy way: Spring Modulith with Kotlin and Hexagonal...

Modularization the easy way: Spring Modulith with Kotlin and Hexagonal Architecture Modularization is a key concept in modern software development to make applications maintainable, testable and flexible. In this article we will see how Spring Modulith...

Software architecture
Kotlin
Spring

14.1.2025 | 9 minutes reading time

Danny Steinbrecher

Charge your APIs Volume 36 - Trends for 2025

As 2025 approaches, new trends are emerging in the world of APIs. After 2024 was user-centric, the focus is now shifting back to developer needs and increasing productivity. APIs are evolving and the technologies surrounding them are becoming more powerful...

Integration
API
Data
Software architecture

11.12.2024 | 5 minutes reading time

Daniel Kocot

ArchUnit in practice: Keep your Architecture Clean

Who hasn’t been there: A new project kicks off or the old code finally needs a cleanup. A big meeting with all the developers is called: “This time, we’ll do it right—clean, correct, and structured!” Architecture Decision Records (ADRs) are created to...

Software architecture
Java
Kotlin
Software development

20.9.2024 | 18 minutes reading time

Danny Steinbrecher

Charge your APIs Volume 30 - Gateway to Success: Understanding and Choosing...

API gateways are essential for managing and securing data flow between services. As software architectures evolve, different types of API gateways have emerged to address specific challenges: Legacy, Agnostic, and Kubernetes-native. Drawing on insights...

API
Software architecture
Infrastructure
Integration

21.8.2024 | 12 minutes reading time

Daniel Kocot

When Business Meets Technology: From Data Product to Data Architecture...

Abstract The Data Product Canvas (DPC) is a tool for the lightweight and iterative definition of data products. It increases the efficiency of product definition by clearly presenting the key impact areas on data products. Additionally, the DPC motivates...

Software architecture
Data
DDD
Digital product developement

6.8.2024 | 24 minutes reading time

Dr. Florian Rademacher

Exploring Dapr: A Deep Dive into Distributed Application Runtime

In a recent blog post, we introduced Dapr (Distributed Application Runtime) and highlighted its potential as a valuable tool for cloud-native applications, in combination with Aspire. This post dives deeper into the inner workings of Dapr, explaining...

Software development
Cloud native
Software architecture
Open Source

10.7.2024 | 10 minutes reading time

Manuel Zapf

Spring Boot and HTMX: The boring app

Motivation Most apps I touched in the wild follow the same two tiered approach. A backend delivering JSON (some may call this REST) and a frontend framework, consuming JSON from the backend converting it to the HTML displayed to the user. Worst case,...

Software architecture
Software development
Spring
Kotlin

28.6.2024 | 16 minutes reading time

Modern Microservices: Unleashing the Power of .NET Core, Aspire, and Dapr

I recall the days when writing a web application in C# with .NET meant deploying it on an IIS web server for accessibility. Today, this approach seems outdated, especially with the shift towards microservice-based architectures. Fortunately, Microsoft...

Software architecture
Open Source
Cloud
Microservices
Infrastructure as Code
.NET
Cloud native

27.6.2024 | 8 minutes reading time

Manuel Zapf

Zero Trust Azure Identity & Access Architecture

Falko Lehmann and Hendrik Kamp have already explained in their blog post on Zero-trust Architecture why zero-trust security models are preferable to traditional perimeter security models in order to minimize damage from cyber attacks. Falko and Hendrik...

IT-Security
IAM
Azure
Software architecture

4.6.2024 | 14 minutes reading time

Plug-in architectures with WebAssembly

Plug-in architectures are an essential concept for developing customizable software. In a plug-in architecture, the application logic is split into a host (or core) system and a number of plug-in components. These plug-ins enable customers to tailor ...

Software architecture
Webdevelopment
Backend

3.11.2023 | 13 minutes reading time

An introduction to federated learning in an industrial context: Advanced

In the Machine Learning space, it was long believed that sharing learnings or weights was safe in the sense that the input data couldn't be extracted. However, this belief has been challenged by researchers coming out over the years. Nowadays, numerous...

Machine Learning
Big Data
Data Science
Data

18.9.2023 | 9 minutes reading time

Charge your APIs Volume 15: API Gateways - Navigating the Agony of Choice...

In the dynamic world of APIs, our previous exploration into API Managment and APIOps shed light on the intricacies of managing and streamlining API operations. As we delve deeper into this realm, another critical component emerges at the forefront: API...

API
Software architecture

7.9.2023 | 7 minutes reading time

Daniel Kocot

An introduction to federated learning in an industrial context: Fundamentals

With the help of data, companies are able to make more informed decisions, optimize their workflows and gain an edge in the competitive world of business using the power of Machine Learning (ML). However, handling data has become increasingly difficult...

Machine Learning
Data Science
Data
Big Data

25.8.2023 | 8 minutes reading time

How to combine Poetry, TensorFlow, and the power of the Apple M1 GPU

In this article, we'll explore how to use the Poetry package manager to manage the dependencies of a machine learning project that makes use of the M1 GPU for TensorFlow training. We'll cover the motivation for using Poetry in this context, and we'll...

Machine Learning
Apple
Data
AI
Python

11.1.2023 | 3 minutes reading time

Denis Stalz-John

Architecture docs as code with Structurizr & Asciidoctor. Part 5: Generating...

Software architecture
Documentation

20.12.2022 | 19 minutes reading time

Christoph Knauf

Architecture docs as code with Structurizr & Asciidoctor. Part 4: Publishing

Software architecture
Documentation

28.10.2022 | 7 minutes reading time

Christoph Knauf

Architecture docs as code with Structurizr & Asciidoctor. Part 3: Structurizr

Software architecture
Documentation

21.10.2022 | 16 minutes reading time

Christoph Knauf

Realtime face detection and filtering with the Coral USB accelerator

Our hardware setup

Setup and installation

Face detection

Face overlay

Face tracking

Caching – keeping faces in memory

Cache invalidation

Your idea?

Was this post helpful?

Blog authors

More articles

Architecture docs as code with Structurizr & Asciidoctor. Part 5: Generating...

Architecture docs as code with Structurizr & Asciidoctor. Part 4: Publishing

Architecture docs as code with Structurizr & Asciidoctor. Part 3: Structurizr

Architecture docs as code with Structurizr & Asciidoctor. Part 2: Asciidoctor

Architecture docs as code with Structurizr & Asciidoctor. Part 1: Workflow...

Tackling climate change with machine learning [part 6] – Datasets & further...

Tackling climate change with machine learning [part 5] – Industry & carbon...

Tackling climate change with machine learning [part 4] – Farms & Forests

Tackling climate change with machine learning [part 3] – Buildings & Cities

Tackling climate change with machine learning [part 2] – Transportation

Tackling climate change with machine learning [part 1] – Electricity systems

Your job at codecentric?

Agile Developer und Consultant (w/d/m)

More articles in this subject area

Pull off Architecture Reviews at Light-Speed with LASR!

Feature-Sliced Design and what we need for good frontend architecture

Hexagonal Architecture is just an island

Modularization the easy way: Spring Modulith with Kotlin and Hexagonal...

Charge your APIs Volume 36 - Trends for 2025

ArchUnit in practice: Keep your Architecture Clean

Charge your APIs Volume 30 - Gateway to Success: Understanding and Choosing...

When Business Meets Technology: From Data Product to Data Architecture...

Exploring Dapr: A Deep Dive into Distributed Application Runtime

Spring Boot and HTMX: The boring app

Modern Microservices: Unleashing the Power of .NET Core, Aspire, and Dapr

Zero Trust Azure Identity & Access Architecture

Plug-in architectures with WebAssembly

An introduction to federated learning in an industrial context: Advanced

Charge your APIs Volume 15: API Gateways - Navigating the Agony of Choice...

An introduction to federated learning in an industrial context: Fundamentals

How to combine Poetry, TensorFlow, and the power of the Apple M1 GPU

Architecture docs as code with Structurizr & Asciidoctor. Part 5: Generating...

Architecture docs as code with Structurizr & Asciidoctor. Part 4: Publishing

Architecture docs as code with Structurizr & Asciidoctor. Part 3: Structurizr