Neural Compute Stick: Object Detection with Neural Networks

3.10.2017 | 7 minutes reading time

Convolutional Neural Networks have become the first choice to extract information from visual data. For example, they are used in the Google search engine to classify images. Essentially, they mimic the way a human being recognizes images.
In order to do that, the system learns to recognize certain characteristics of an object. In the case of a person, this could be the limbs or head and face. It then produces a trained model.
What’s great about it: The algorithm learns the characteristics of an object on its own in the training process, there’s no need to point them out manually.
Of course, the system only detects objects it was previously trained with. The downside: You usually need a pretty fast graphics card to train a model or run inferences in it.

In this post, we use neural networks (specificly CNNs) to classify objects in a video stream on a cheap, ordinary PC without a dedicated GPU at all.
This is made possible by the recently released Movidius Neural Compute Stick.

By loading the video, you agree to YouTube's privacy policy.
Learn more

Load video

Always unblock YouTube

Movidius Neural Compute Stick

The Neural Compute Stick (NCS) is a tiny computer meant to accelerate the execution of neural networks. The entire hardware is packed into a USB-stick which is compatible with USB 2.0 or later. However, it is recommended to use the device on a USB 3.0 port if possible.
Internally, the NCS uses a so-called Vision Processing Unit that goes by the name of Myriad 2. This relatively new kind of micro-processor is specifically made for machine-vision related tasks and thus very energy-efficient. According to the manufacturer, the typical power consumption of the VPU is around 1 Watts. The NCS is on sale for around 80 US-Dollars, but only available in a few online-shops as of September 2017.

The Movidius Neural Compute Stick is not a universal solution for deep learning. It is not possible to train a model on it. Instead, you can only run inferences on input data like a video stream with a pre-trained model. This means you will need to train your model first on your computer, which, as already mentioned, can take lots of time. Luckily, the NCS community already provides some pre-trained models that are available for free. Theoretically every model that was made with the deep-learning library Caffe (.caffemodel) is compatible, but you will have to convert it for the NCS. Also, make sure the model uses one input variable only. The NCS is currently limited to that.
In the screenshot below we see the classification of images from a webcam input stream. This example uses the GoogLeNet.

Classification of images from a webcam stream on the NCS running GoogLeNet.

Preparation and setup:

The software you need to program the Neural Compute Stick and use it on the target platform is available for download on the Movidius page. That includes some example applications like the code for the application in the screenshot above. The models for that are also available for download, among which are some widely-known ones like the AlexNet or the GoogLeNet.

Requirements for the development platform:

x64-PC running a native Ubuntu Linux, virtual machines are not supported
Windows and MacOS are (currently) not supported
The Linux distribution has to be an Ubuntu 16.04 LTS.
On Ubuntu you need to have Python version 3.5.2 installed.

Note: The target platform does not have to meet these requirements. In case you are wondering: Yes, you can also plug it into your Raspberry Pi. Movidius mentions explicitly that the NCS is compatible.

Requirements to run the example code for the NCS:

You need to have a webcam (internal or external)
Gstreamer 1.0 and its Plugins “Basic”, “Good”, “Bad” and “Ugly” have to be installed.
Installation instructions for Gstreamer: https://wiki.ubuntu.com/Novacut/GStreamer1.0#Installing_GStreamer_1.0_packages .

Installing Toolkit and API:

Before we can use the Movidius Neural Compute Stick, the API and Toolkit have to be installed. The Toolkit is used to convert or test a model for the NCS, the Movidius API allows you to access the functionality of the Neural Compute Stick, for example wih Python. Here is the download page: https://ncsforum.movidius.com/discussion/98/latest-version-movidius-neural-compute-sdk .
The installation is done by simply running a Bash-script in the unzipped Toolkit/API folder.

$ ./setup.sh

Keep in mind that the script, especially the one for the Toolkit, can take a lot of time to complete (15-30 minutes) and needs a stable internet connection.

Before we can use a model with the stick, we have to convert it with this script from the Toolkit:

$ python3 ./mvNCCompile.pyc sample_network.prototxt -w sample_network.caffemodel -s 12 -o name_of_outputfile

We can now use the generated “graph” file (default name, if not specified) in an application for the NCS.

Object detection with Tiny YOLO

Now comes the hard part. We want to be able to find various objects in a given scene and identify what they are. The provided models by Movidius are not up to this task.
They can only classify one object at a time respecitvely per frame. They are based on the assumption that an input image shows only one relevant object in close-up.
YOLO (You Only Look Once) in contrast is a neural network which can detect and localize multiple objects in one frame.
On top of that, YOLO can tell persons apart from objects in a given scene.
Tiny YOLO is the small brother of YOLO, a resource saving alternative for weaker devices. Thanks to various optimizations it enables the NCS to run object detection almost in realtime (approximately 0.2s processing time per frame). Naturally, this comes at a cost and so the error rate increases noticeably. However, for our purposes the detection is still sufficiently accurate.
A few developers on the Movidius Forums have already ported Tiny YOLO to the NCS. The result of their efforts is available for download on GitHub: https://github.com/gudovskiy/yoloNCS .
Here you can find some example code written in Python and the Tiny YOLO model itself.

Setup Tiny YOLO on the NCS

To run the sample code for Tiny YOLO, some additional software is required on the development and target platform. You have to build OpenCV as well as ffmpeg from source on your platform. A simple installation via Python’s own package manager pip is not sufficient, because this “light” edition of OpenCV is missing some important parts.
This means, OpenCV can probably not access your camera and start the video-stream. Furthermore, you need a webcam with Linux-compatible drivers.

Let’s start with ffmpeg. To install ffmpeg under Linux, follow the instructions in the official guide: https://trac.ffmpeg.org/wiki/CompilationGuide/Ubuntu .
Once the installation is finished, use the command below in a new terminal to check whether ffmpeg can find and use your camera.
Replace /dev/video0 with your video source.

$ ffmpeg -f v4l2 -list_formats all -i /dev/video0

The output should look something like this:

[video4linux2,v4l2 @ 0x2753960] Raw : yuyv422 : YUYV 4:2:2 : 640x480 352x288 320x240 176x144 160x120 1280x720 640x480
[video4linux2,v4l2 @ 0x2753960] Compressed: mjpeg : Motion-JPEG : 640x480 352x288 320x240 176x144 160x120 1280x720 640x480

After that, we’re done with ffmpeg. Let’s move on to OpenCV: http://docs.opencv.org/2.4/doc/tutorials/introduction/linux_install/linux_install.html .
To roughly check if OpenCV has been installed sucessfully, you can run some samples in your installation folder under /bin if you like.

As already mentioned in the introduction to the Movidius-Stick, the Caffemodel of Tiny YOLO needs to be converted in a format that is compatible with the NCS. The software for this task is included in the Movidius Toolkit.
The exact command is:

$ python3 ./mvNCCompile.pyc your_path/yolo_tiny_deploy.prototxt -s 12

Before you execute it, make sure the Caffemodel and the corresponding Prototxt-file are in the same folder. Both files must have the same name, otherwise the conversion will silently fail without throwing an error. The generated model is then useless.

Tiny YOLO on the NCS

Now we are finally ready to see Tiny YOLO in action. As a last step, move the “py_examples” folder from the “yoloNCS” repo to the “ncapi” (the unpacked NCS-API) folder. Alternatively, you can create a symlink. The example code includes these two samples:

yolo_example: Will detect objects with the Tiny YOLO model in an .jpg image and highlights found objects in the image.
yolo_object_detection_app: Will detect objects in a video stream from your webcam and highlights found objects in a video.

Let’s start the “object_detection_app” with Python 3. If everything is set up correctly, you will now see the video stream of your webcam in which Tiny YOLO highlights objects it has learned. The numbers state to what extent the detected objects resemble the trained template. The level of similarity from which the system considers an object as “detected” is configurable. This means we either end up with more false positives or false negatives, depending on our requirements.
This last screenshot and the video at the top of this page demonstrate what the Tiny YOLO object detection looks like with a webcam.

Object detection in a webcam stream on the NCS running the Tiny YOLO model.

The model, as it is trained now, detects 20 different things, among which are persons and a bunch of animals. The detection of human beings works best. “Lifeless” items however are sometimes not recognized.

Was this post helpful?

Blog author

Dominique Dorscheid

Do you still have questions? Just send me a message.

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

Open Source hits Billion-Dollar Market: DeepSeek-R1 is shaking up the ...

On January 27, 2025, the technology stock exchange experienced an unexpected crash: The NVIDIA stock price plummeted by over 17%, temporarily wiping out nearly $600 billion in market value and setting a new historical record in the stock market. Many...

AI
Generative AI
LLM

29.1.2025 | 8 [Missing String "readingTime"]

How we can hack an AI with just a few words

How we can hack an AI with just a few words Artificial intelligence (AI) has undergone an astonishing transformation in recent years and is now present in many areas of life. Whether in the form of chatbots that help us with everyday questions or generative...

IT-Security
AI

27.1.2025 | 4 [Missing String "readingTime"]

Simplifying LLM Application Development: A Newcomer's Perspective

I. Introduction Large Language Models (LLMs) have become highly popular due to their transformative impact on various fields, especially within IT. They enable developers to create innovative software applications centered around AI interactions, offering...

Generative AI
AI

6.12.2024 | 13 [Missing String "readingTime"]

Function Calling with GPT Models

GenAI is a powerful tool for generating content and interacting with applications using natural language. However, this tool also has significant limitations when you plan to use it in your own software. GenAI's knowledge is limited to information that...

Generative AI
AI
LLM

6.9.2024 | 5 [Missing String "readingTime"]

Answer questions about your documents with OpenAI and Pinecone

In recent years, large language models (LLMs) have made remarkable progress in interacting with humans, showcasing their ability to answer a wide array of questions. Trained on publicly accessible internet content, these models have broad knowledge across...

13.11.2023 | 12 [Missing String "readingTime"]

Lukas Lehmann

An introduction to federated learning in an industrial context: Advanced

In the Machine Learning space, it was long believed that sharing learnings or weights was safe in the sense that the input data couldn't be extracted. However, this belief has been challenged by researchers coming out over the years. Nowadays, numerous...

Machine Learning
Big Data
Data Science
Data

18.9.2023 | 9 [Missing String "readingTime"]

An introduction to federated learning in an industrial context: Fundamentals

With the help of data, companies are able to make more informed decisions, optimize their workflows and gain an edge in the competitive world of business using the power of Machine Learning (ML). However, handling data has become increasingly difficult...

Machine Learning
Data Science
Data
Big Data

25.8.2023 | 8 [Missing String "readingTime"]

Fighting Gandalf with magic spells (the spells are prompt injections) ...

Note: Do not attack any systems for which you do not have explicit permission to do so. In this article, I will recount the tale of outwitting a large language model by performing prompt injection attacks. Before we start, let's establish a common baseline...

IT-Security
AI

10.7.2023 | 12 [Missing String "readingTime"]

Michael Wagner

How to combine Poetry, TensorFlow, and the power of the Apple M1 GPU

In this article, we'll explore how to use the Poetry package manager to manage the dependencies of a machine learning project that makes use of the M1 GPU for TensorFlow training. We'll cover the motivation for using Poetry in this context, and we'll...

Machine Learning
Apple
Data
AI
Python

11.1.2023 | 3 [Missing String "readingTime"]

Denis Stalz-John

Python on an M1 chip: Running smoothly using Docker

I have been working as a data scientist at codecentric for several years now. Thus, my language of choice is Python and I am using it in several projects on a daily basis. Last year, I got pretty excited about the announcement of the new versions of ...

Data
Machine Learning
Apple
Python

14.2.2022 | 6 [Missing String "readingTime"]

Denis Stalz-John

Evaluating machine learning models: Establishing quality gates

The quality or usefulness of machine learning models can be evaluated using test data and metrics. However, to what extent? Manually, automated, once, regularly? Manually, the first models as the result of a proof of concept can certainly still be evaluated...

Data
Machine Learning
Software development
CI/CD

7.12.2021 | 8 [Missing String "readingTime"]

Berthold Schulte

How to use Java classes in Python

There is an old truism: “Use the right tool for the job.” However, in building software, we are often forced to nail in screws, just because the rest of the application was built with the figurative hammer Java. Of course, one of the preferred solutions...

AI
Java
Python

15.11.2021 | 8 [Missing String "readingTime"]

The universal recommender in Action(ML)

IntroductionRecommender systems have become crucial for many different businesses. E-commerce uses recommenders to guide their customers in finding the right products and to assure they stay on the site. Newspapers or entertainment websites want to keep...

AI
NoSQL
Data
Machine Learning
Python

18.4.2021 | 11 [Missing String "readingTime"]

Francesca Diana

NER with little data? Transformers to the rescue!

How do you solve deep learning problems with too little labelled data? The answer, of course, is transfer learning. In this post, we will apply this concept to named entity recognition (NER) andfine-tune a pre-trained BERT to extract information from...

Data
Machine Learning
AI
NLP
Agile transformation

14.12.2020 | 8 [Missing String "readingTime"]

Take control of named entity recognition with your own Keras model!

This post shows how to extract information from text documents with the high-level deep learning library Keras : we build, train and evaluate a bidirectional LSTM model by hand for a custom named entity recognition (NER) task on legal texts.In a previous...

Data
Python
AI
NLP
Machine Learning

13.11.2020 | 9 [Missing String "readingTime"]

NER @ CLI: Custom-named entity recognition with spaCy in four lines

Named entity recognition is a technical term for a solution to a key automation problem: extraction of information from text. Applications includeautomation of business processes involving documentsdistillation of data from the web by scraping websitesindexing...

Data
AI
NLP
Machine Learning

6.11.2020 | 9 [Missing String "readingTime"]

DISH-O-TRON – Train that vision model!

With this article we continue our endeavor of building dish-o-tron – an AI system designed to prevent the sudden appearance of dirty dishes in the community kitchen sink, and hence turning the community kitchen into a place of peace and harmony.This ...

AI
Computer Vision

11.10.2020 | 11 [Missing String "readingTime"]

Marcel Mikl

DISH-O-TRON – Gather that DATA you must!

This is the second article in our dish-o-tron series (a non-standard Deep Learning tutorial) in which we tackle one of the biggest problems in community kitchens: coming across someone else’s dirty dishes. We are facing this problem by building a state...

AI
Computer Vision
Machine Learning

24.9.2020 | 11 [Missing String "readingTime"]

Marcel Mikl

DISH-O-TRON – No more dirty dishes thanks to AI

Sadly, to tell you the truth, doing dishes is still a thing. However, so far most of our readers still like our non-standard Deep Learning tutorial.Typically, AI is demonstrated as solving various toy problems. AI plays chess and Go, AI plays video games...

10.9.2020 | 7 [Missing String "readingTime"]

Marcel Mikl

Why user-oriented development is so important – the story of tactics.ai

In this blog post, we want to give you an insight into the product development of tactics.ai. Our initial idea was a data-driven football analysis tool that applies machine learning techniques to analyze the strengths and weaknesses of opponents and ...

Agile
AI
Startup
Machine Learning
Product management

23.8.2020 | 8 [Missing String "readingTime"]

Denis Stalz-John

Neural Compute Stick: Object Detection with Neural Networks

Movidius Neural Compute Stick

Preparation and setup:

Requirements for the development platform:

Requirements to run the example code for the NCS:

Installing Toolkit and API:

Object detection with Tiny YOLO

Setup Tiny YOLO on the NCS

Tiny YOLO on the NCS

Was this post helpful?

Blog author

Your job at codecentric?

Agile Developer und Consultant (w/d/m)

More articles in this subject area

Open Source hits Billion-Dollar Market: DeepSeek-R1 is shaking up the ...

How we can hack an AI with just a few words

Simplifying LLM Application Development: A Newcomer's Perspective

Function Calling with GPT Models

Answer questions about your documents with OpenAI and Pinecone

An introduction to federated learning in an industrial context: Advanced

An introduction to federated learning in an industrial context: Fundamentals

Fighting Gandalf with magic spells (the spells are prompt injections) ...

How to combine Poetry, TensorFlow, and the power of the Apple M1 GPU

Python on an M1 chip: Running smoothly using Docker

Evaluating machine learning models: Establishing quality gates

How to use Java classes in Python

The universal recommender in Action(ML)

NER with little data? Transformers to the rescue!

Take control of named entity recognition with your own Keras model!

NER @ CLI: Custom-named entity recognition with spaCy in four lines

DISH-O-TRON – Train that vision model!

DISH-O-TRON – Gather that DATA you must!

DISH-O-TRON – No more dirty dishes thanks to AI

Why user-oriented development is so important – the story of tactics.ai