Portability between deep learning frameworks – with ONNX

27.8.2019 | 6 minutes reading time

In recent years, the number of frameworks for deep learning has exploded. Companies such as Google, Facebook and Amazon have made their deep learning frameworks TensorFlow , PyTorch and MXNet available open-source or are actively involved in developing them. Each of these frameworks has different advantages and disadvantages, which have different consequences for development and commissioning. This article introduces Open Neural Network Exchange (ONNX) a model standard that makes it possible to exchange models between frameworks. Through the interoperability, we can use the advantages of a framework based on the situation to the fullest.

Deep learning frameworks: Background

Framework trends across PyTorch, Caffe2, TensorFlow, Theano

Deep Learning Framework trends: PyTorch, Caffe2, TensorFlow, Theano

In recent years the framework Theano has been heavily used. Nowadays, there isn’t any further development of the framework. Currently, we don’t know which frameworks which will establish themselves and which will disappear. Every framework has a different background and purpose of solving. Some of the frameworks were designed for research, while others were intended for production purpose. Besides the deep learning libraries, there are numerical frameworks to optimise the operations based on the hardware. The choosing of the numerical library will have an impact on the runtime of the models.

Hard manufacturers such as NVIDIA and INTEL are developing the frameworks to run the models as efficiently as possible on GPUs or CPUs.

Companies that want to implement deep learning in their daily business are overwhelmed by the range of possibilities. The selection of a framework can have severe consequences for different areas of the company. The speed of innovation can suffer significant losses, as the commissioning of a model may be delayed after model development. One reason for this may be that the chosen framework is designed more for development than for production.

Deep Learning Zoo

The graphic above shows a small selection of the deep learning framework Zoo and its technical possibilities. A general problem among the frameworks is the portability of the models to another framework. The interoperability allows the advantages of the different frameworks to be used depending on the phase, whether development or commissioning. For example, PyTorch is ideally suited for prototype development and experimentation of the models, while TensorFlow Serving provides an easy way to deploy a TensorFlow model.

Open Neural Network Exchange (ONNX)

Framework interoperability with ONNX

In 2017, Microsoft, Facebook and Amazon joined forces to solve the challenge of model portability. The result is the new standard Open Neural Network Exchange (ONNX). The vision behind ONNX is to export a model developed with framework A and import it into framework B without any problems. Here you can find a list of supported frameworks.

Seeing deep learning libraries from a very abstract perspective, one of the main difference is the way data is flowing through the operations. In TensorFlow and Caffe2 we are using a static graph to run computations. In PyTorch we are using a dynamic graph. The choose of the computation model can lead to some differences in programming and runtime. However, this is not an issue for the ONNX standard. Through the interfaces of the libraries, the relevant information like structure and weights can be extracted and transformed. The ONNX specification consists of these three essential components that enable import and export:

An extensible calculation graph
Fixed operators and functions
Defined standard data types

The exact definition with its details can be found inside the Github repository onnx/onnx .

MNIST Example

MNIST trained model from PyTorch to TensorFlow with ONNX

To get to know ONNX a little better, we will take a look at a practical example with PyTorch and TensorFlow. We are training a model in PyTorch that we convert to ONNX. Then the ONNX transformed model is loaded into TensorFlow to run inference. We are using MNIST dataset . Python3 and pip3 are required to perform the tutorial. We are installing the needed packages with pip3:

First, we define the neural network architecture with PyTorch. Our chosen architecture consists of two convolutional layers and two fully connected layers. We are using the activation function ReLU and a max pooling layer. The input data is an image with only one colour channel.

In the main() function, we are putting the essential parts together. It is necessary to save the weights with torch.save(model.state_dict(), file) after the training. The full training, test and main() functions can be read in the repository .

Before we export the model to ONNX, we need to read it back into PyTorch. Then it is necessary to define a dummy_input as the input vectors of the model. The dummy_input is required since PyTorch is using a dynamic input and ONNX requires a static one.

The model can be read by onnx.load(file). Via the prepare(model)-method of the onnx/onnx-tensorflow package the weights are bound to a static graph.

Afterwards, we can run to predictions in the TensorFlow runtime environment. For the preprocessing, we need to scale the image to 28×28 pixels and converted to Greyscale. Then we convert the datatype of the array to Float32 and transform the axes to the required dimensions of the input tensor.

Limits of ONNX

At first glance, the ONNX standard is an easy-to-use way to ensure the portability of models. The use of ONNX is straightforward as long as we provide these two conditions:

We are using supported data types and operations of the ONNX specification.
We don’t do any custom development in terms of specific custom layers/operations.

Furthermore, we need to double-check that the used operations and functions are implemented in the backends for the export and import.

The ONNX project is developing at a rapid pace and is continually releasing new versions that enhance the compatibility between the frameworks. If a project is carried out within this framework, the use of ONNX is entirely unproblematic.

If these conditions are not met, the functionality has to be implemented in the ONNX backends themselves to use it. The custom implementation can turn out to be very time-consuming and laborious.

Summary

The need for model portability is greater than ever. There are more and more deep learning frameworks on the market and the portability allows the advantages of the individual frameworks to be better exploited. ONNX is an easy-to-use framework that has a lot of potentials to be the standard for exchanging models between libraries. This ensures that developed models can be used flexibly and over the long term. Furthermore, the results of the research can go into production faster as long as the supported data types and operations are used by ONNX. Otherwise, they must be implemented in ONNX.

The German version of this post can be found here . Check out more posts on deep learning on our blog .

Was this post helpful?

Blog author

Nico Axtmann

Do you still have questions? Just send me a message.

fromNico Axtmann

Core ML – inference on iOS

In machine learning, we are training a model for a particular task, e.g. distinguishing dogs and cats in pictures. Inference refers to the application of the model. Most of the inference applications are addressed via a client-server API or used in batch...

AI
Data
iOS
Machine Learning
Mobile

19.8.2019 | 7 minutes reading time

Nico Axtmann

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

Using Dagster with DuckDB

DuckDB has rapidly emerged as a popular in-process analytics database. Dagster, on the other hand, is a modern data orchestration framework that makes it easy to build and manage data pipelines. Combining Dagster with DuckDB allows data engineers to ...

Data

16.5.2025 | 4 minutes reading time

Hendrik Kamp

Querying Databricks Delta Tables in Motherduck

Intro In a previous article, my colleague Matthias Niehoff demonstrated how duckdb can serve as a viable alternative to Spark for processing data stored in Databricks, specifically by directly accessing the Unity Catalog. Building upon that, a next ...

Data

25.4.2025 | 4 minutes reading time

Hendrik Kamp

Introducing Data Interface Quadrants (DIQs)

In today’s rapidly evolving, data-driven world, organisations face an increasingly complex challenge: how to design, implement, and manage data interfaces that meet both immediate operational demands and long-term strategic business objectives. A data...

API
Data

30.1.2025 | 8 minutes reading time

Daniel Kocot

Miriam Greis

Open Source hits Billion-Dollar Market: DeepSeek-R1 is shaking up the ...

On January 27, 2025, the technology stock exchange experienced an unexpected crash: The NVIDIA stock price plummeted by over 17%, temporarily wiping out nearly $600 billion in market value and setting a new historical record in the stock market. Many...

AI
Generative AI
LLM

29.1.2025 | 8 minutes reading time

How we can hack an AI with just a few words

How we can hack an AI with just a few words Artificial intelligence (AI) has undergone an astonishing transformation in recent years and is now present in many areas of life. Whether in the form of chatbots that help us with everyday questions or generative...

IT-Security
AI

27.1.2025 | 4 minutes reading time

Access Databricks UnityCatalog from duckdb

Databricks is a great platform when it comes to data management and governance, mostly due to the unity catalog. But Spark as an engine for processing the data is just ok'ish, especially when data is not really big. New engines like polars, datafusion...

Data

20.1.2025 | 5 minutes reading time

Matthias Niehoff

Charge your APIs Volume 36 - Trends for 2025

As 2025 approaches, new trends are emerging in the world of APIs. After 2024 was user-centric, the focus is now shifting back to developer needs and increasing productivity. APIs are evolving and the technologies surrounding them are becoming more powerful...

Integration
API
Data
Software architecture

11.12.2024 | 5 minutes reading time

Daniel Kocot

Simplifying LLM Application Development: A Newcomer's Perspective

I. Introduction Large Language Models (LLMs) have become highly popular due to their transformative impact on various fields, especially within IT. They enable developers to create innovative software applications centered around AI interactions, offering...

Generative AI
AI

6.12.2024 | 13 minutes reading time

Function Calling with GPT Models

GenAI is a powerful tool for generating content and interacting with applications using natural language. However, this tool also has significant limitations when you plan to use it in your own software. GenAI's knowledge is limited to information that...

Generative AI
AI
LLM

6.9.2024 | 5 minutes reading time

When Business Meets Technology: From Data Product to Data Architecture...

Abstract The Data Product Canvas (DPC) is a tool for the lightweight and iterative definition of data products. It increases the efficiency of product definition by clearly presenting the key impact areas on data products. Additionally, the DPC motivates...

Software architecture
Data
DDD
Digital product developement

6.8.2024 | 24 minutes reading time

Dr. Florian Rademacher

Charge your APIs Volume 28: Empowering application and data integration...

In today's fast-paced world, seamless application and data integration is crucial for organisational success. This blog explores how frameworks like Maslow's Pyramid, Team Topologies, Evolutionary Architectures, API Federation, and API Marketplaces, ...

API
Data
Integration

25.7.2024 | 8 minutes reading time

Daniel Kocot

Data for the Masses Volume 2: Data Products, Data Contracts and API Contracts

The pillars of modern data architectures as success factors for organisations In the digital economy, a well-thought-out data architecture and the efficient use of data are crucial for organisational success. Data products, data contracts and API contracts...

Data
API

13.6.2024 | 7 minutes reading time

Daniel Kocot

Becoming a Data-Driven Company with Applied Data Products

In recent years, the hype surrounding the value of data has grown continuously, and a multitude of concepts and methods have emerged on how companies can become 'data-driven'. From strategic top management to detail-oriented data analysts attempts are...

Agile
Big Data
Data
Product management
Digitalization
Data Science
Business Intelligence

18.5.2024 | 9 minutes reading time

Dr. Florian Rademacher

A/B Testing: Tool support and testing GrowthBook

In the previous blog post we introduced some general concepts of A/B testing: we explored the main aspects, defined test types and explained the most common statistical methods. Now we want to explore the areas in which A/B testing tools can provide...

Testing
Python
Data
UX/UI
Analysis
JavaScript

18.3.2024 | 20 minutes reading time

Francesca Diana

A/B Testing: An introduction

This blog series aims to aid teams who are contemplating adding A/B testing to their toolkit but are unsure of which tool to use. In addition to helping with tool selection, the series also provides the entire team with a consistent initial understanding...

Testing
Data
UX/UI
Analysis

6.2.2024 | 29 minutes reading time

Francesca Diana

Data for the Masses Volume 1: The Digital Product Passport - A Key Element...

The Digital Product Passport represents a significant shift for digital units within organisations, compelling them to ensure comprehensive data transparency. This tool not only serves as a product's digital fingerprint but also opens up new dimensions...

Data
Product management

25.1.2024 | 7 minutes reading time

Daniel Kocot

Answer questions about your documents with OpenAI and Pinecone

In recent years, large language models (LLMs) have made remarkable progress in interacting with humans, showcasing their ability to answer a wide array of questions. Trained on publicly accessible internet content, these models have broad knowledge across...

13.11.2023 | 12 minutes reading time

Lukas Lehmann

Charge your APIs: NordicAPIs Platform Summit Edition - API first ... not...

In the ever-evolving landscape of software development, buzzwords and paradigms come and go. One such term that has gained significant traction in recent years is "API-First Development." It's been hailed as the holy grail of modern software engineering...

API
Data

19.10.2023 | 5 minutes reading time

Daniel Kocot

An introduction to federated learning in an industrial context: Advanced

In the Machine Learning space, it was long believed that sharing learnings or weights was safe in the sense that the input data couldn't be extracted. However, this belief has been challenged by researchers coming out over the years. Nowadays, numerous...

Machine Learning
Big Data
Data Science
Data

18.9.2023 | 9 minutes reading time

An introduction to federated learning in an industrial context: Fundamentals

With the help of data, companies are able to make more informed decisions, optimize their workflows and gain an edge in the competitive world of business using the power of Machine Learning (ML). However, handling data has become increasingly difficult...

Machine Learning
Data Science
Data
Big Data

25.8.2023 | 8 minutes reading time

Portability between deep learning frameworks – with ONNX

Deep learning frameworks: Background

Open Neural Network Exchange (ONNX)

MNIST Example

Limits of ONNX

Summary

Was this post helpful?

Blog author

More articles

Core ML – inference on iOS

Your job at codecentric?

Agile Developer und Consultant (w/d/m)

More articles in this subject area

Using Dagster with DuckDB

Querying Databricks Delta Tables in Motherduck

Introducing Data Interface Quadrants (DIQs)

Open Source hits Billion-Dollar Market: DeepSeek-R1 is shaking up the ...

How we can hack an AI with just a few words

Access Databricks UnityCatalog from duckdb

Charge your APIs Volume 36 - Trends for 2025

Simplifying LLM Application Development: A Newcomer's Perspective

Function Calling with GPT Models

When Business Meets Technology: From Data Product to Data Architecture...

Charge your APIs Volume 28: Empowering application and data integration...

Data for the Masses Volume 2: Data Products, Data Contracts and API Contracts

Becoming a Data-Driven Company with Applied Data Products

A/B Testing: Tool support and testing GrowthBook

A/B Testing: An introduction

Data for the Masses Volume 1: The Digital Product Passport - A Key Element...

Answer questions about your documents with OpenAI and Pinecone

Charge your APIs: NordicAPIs Platform Summit Edition - API first ... not...

An introduction to federated learning in an industrial context: Advanced

An introduction to federated learning in an industrial context: Fundamentals