True KVM Live Migration with OpenStack Icehouse and Ceph based VM storage

16.3.2015 | 12 minutes reading time

Intro

As mentioned before — for example in Fabian’s The CenterDevice Cloud Architecture Revisited post from December 2014) — our document management product CenterDevice runs on top of infrastructure virtualized by OpenStack.
Where that older post was more application focused, this one covers a particularly nasty problem that plagued us for some time: Being unable to migrate virtual machines from one bare metal hypervisor host to another without interruption. By the end of this article you will see how we have overcome a series of obstacles on the way to successful live migrations in OpenStack Icehouse for KVM virtual machines using Ceph/Rados Block Device based volumes for data storage.

System setup

At the time of writing our cluster sports 12 bare metal servers. 6 of these are dedicated OpenStack compute nodes, with 4 more serving as Ceph storage cluster nodes. The remaining 2 are OpenStack controllers.
All storage is provided to virtual machines as OpenStack Cinder volumes backed by Ceph virtual block devices . One of the main reasons for this setup is that it allows for easy migration of virtual machines from one physical host to the next without also having to bring along large amounts of storage across the network.

Migrating Virtual Machines

OpenStack by default enables “regular” migrations, i. e. migrations where a virtual machine needs to be shut down to then be rebooted on another host. This incurs a service interruption inside the virtual machine. Ideally you would want to be able to seamlessly move the VM across physical servers without the OS and software inside it even noticing. Depending on the hypervisor type and the surrounding setup this is generally feasible.

With the instance to be migrated (the source VM) still running, its memory content is sent to the destination host. The source hypervisor keeps track of which memory pages are modified on the source while the transfer is in progress. Once the initial bulk transfer is complete, pages changed in the meantime are transferred again. This is done repeatedly with (ideally) ever smaller increments.

As long as the differences can be transferred faster than the source VM dirties memory pages, at some point the source VM gets suspended. Final differences are sent to the target host and an identical machine started there. At the same time the virtual network infrastructure takes care of all traffic being directed to the new virtual machine. Once the replacement machine is running, the suspended source instance is deleted. Usually the actual handover takes place so quickly and seamlessly that all but very time sensitive applications ever notice anything.

Since only memory is transferred, a prerequisite for this kind of live migration is shared storage, for which we use Ceph. OpenStack supports this, but you need to enable “true live migration” as described in the OpenStack Admin Guide. It boils down to adding the following to the /etc/nova/nova.conf file:

1live_migration_flag=VIR_MIGRATE_UNDEFINE_SOURCE,VIR_MIGRATE_PEER2PEER,VIR_MIGRATE_LIVE,VIR_MIGRATE_TUNNELLED

Sounds easy enough, so where’s the catch?

Problem #1

With our cluster set up as described above, this is what happened when I tried to live-migrate a VM from one host to the next:

1[daniel.schneller@control01]➜ nova list  
2+--------------+--------+--------+------------+-------------+---------------------+  
3| ID           | Name   | Status | Task State | Power State | Networks            |  
4+--------------+--------+--------+------------+-------------+---------------------+  
5| a1564ec8-... | dstest | ACTIVE | -          | Running     | testnet=192.168.1.2 |  
6+--------------+--------+--------+------------+-------------+---------------------+ 
7 
8[daniel.schneller@control01]➜ nova live-migration dstest node10
9 
10[daniel.schneller@control01]➜ tail -n20 /var/log/nova/nova-compute.log  
11Live migration of instance a1564ec8-... to host node10 failed  
12Traceback (most recent call last):  
13  File "/usr/lib/python2.7/dist-packages/nova/api/openstack/compute/contrib/admin_actions.py", line 282, in _migrate_live  
14    disk_over_commit, host)  
15  File "/usr/lib/python2.7/dist-packages/nova/compute/api.py", line 94, in inner  
16    return f(self, context, instance, *args, **kw)  
17  File "/usr/lib/python2.7/dist-packages/nova/compute/api.py", line 1960, in live_migrate  
18    disk_over_commit, instance, host)  
19  File "/usr/lib/python2.7/dist-packages/nova/scheduler/rpcapi.py", line 96, in live_migration  
20    dest=dest))  
21  File "/usr/lib/python2.7/dist-packages/nova/openstack/common/rpc/proxy.py", line 80, in call  
22    return rpc.call(context, self._get_topic(topic), msg, timeout)  
23  File "/usr/lib/python2.7/dist-packages/nova/openstack/common/rpc/__init__.py", line 102, in call  
24    return _get_impl().call(cfg.CONF, context, topic, msg, timeout)  
25  File "/usr/lib/python2.7/dist-packages/nova/openstack/common/rpc/impl_kombu.py", line 712, in call  
26    rpc_amqp.get_connection_pool(conf, Connection))  
27  File "/usr/lib/python2.7/dist-packages/nova/openstack/common/rpc/amqp.py", line 368, in call  
28    rv = list(rv)  
29  File "/usr/lib/python2.7/dist-packages/nova/openstack/common/rpc/amqp.py", line 336, in __iter__  
30    raise result  
31RemoteError: Remote error: InvalidCPUInfo_Remote Unacceptable CPU info: CPU doesn't have compatibility.

Notice the last line. Apparently there is some difference between CPUs. So let us see what kinds of CPUs the hypervisors have (some lines removed for brevity). First the source host the virtual machine lives on at the moment:

1[daniel.schneller@node05]➜ cat /proc/cpuinfo  
2vendor_id       : GenuineIntel  
3cpu family      : 6  
4model           : 45  
5model name      : Intel(R) Xeon(R) CPU E5-2670 0 @ 2.60GHz  
6stepping        : 7  
7cpuid level     : 13  
8flags           : fpu … (many more)

Then its designated new home:

1[daniel.schneller@node10]➜ cat /proc/cpuinfo  
2vendor_id       : GenuineIntel  
3cpu family      : 6  
4model           : 44  
5model name      : Intel(R) Xeon(R) CPU           X5650  @ 2.67GHz  
6stepping        : 2  
7cpuid level     : 11  
8flags           : fpu … (not as many as above)

The source host CPU is of more recent vintage. Unless configured otherwise, KVM will map the underlying host CPU’s features into any virtual machine that gets started on it. This is good for performance, because the guest OS can better leverage the hardware’s power. But as a downside, for a live migration you can only use hosts that have identical or even more capable CPUs as a migration target; otherwise the guest operating system — not knowing the hardware was “hot swapped” underneath — might try to access features not present on the new host, leading to crashes. For regular migrations this is not a problem, because it involves rebooting the guest.

Fix #1

In our case we gladly accept a slightly smaller CPU feature set over the potentially slightly better performance, because with it comes full migration flexibility. To ensure CPU compatibility across all VMs and hypervisors we can instruct Nova/KVM to report a specific CPU model and set of features to guests. We can figure out which model that would ideally be with the following set of commands.

1[daniel.schneller@control01] ~/tmp ➜ pdsh 'node0[1-9],node10' 'sudo virsh capabilities | xmllint --xpath "/capabilities/host/cpu" - > ~daniel.schneller/tmp/$(hostname).xml'  
2[daniel.schneller@control01] ~/tmp ➜ cat node*.xml >> all-cpus.xml  
3[daniel.schneller@control01] ~/tmp ➜ sudo virsh cpu-baseline all-cpus.xml  
4<cpu mode='custom' match='exact'>  
5  <model fallback='allow'>Westmere</model>  
6  <vendor>Intel</vendor>  
7  ...  
8</cpu>

The first command assumes a few things:

I can connect to all relevant hypervisor hosts via SSH
I can do password-less sudo there
xmllint is installed on them
My home directory resides on shared storage
~/tmp exists.

On each node it queries the hypervisor with virsh capabilities, then extracts only the relevant CPU element from the XML. The result is written into a file per host. The second command then combines all the separate XML files into a single one. The third command then uses virsh’s built-in mechanism to resolve multiple sets of CPU capabilities into a baseline that they all support.

In our case we learned that Westmere describes the intersection of all host CPU features. So using Ansible I made sure that all hypervisors had the following entries in /etc/nova/nova-compute.conf:

1[DEFAULT]  
2compute_driver=libvirt.LibvirtDriver  
3[libvirt]  
4virt_type=kvm  
5# Define custom CPU restriction to the lowest  
6# common subset of features across all hypervisors.  
7# Otherwise live migrations will fail when trying  
8# to move from a more modern CPU to an older one.  
9cpu_mode=custom  
10cpu_model=Westmere

After that the nova-compute service needs to be restarted on all hypervisors. This can be done without affecting running virtual machines, because it only restarts the service that is (among other things) responsible for spawning new CPUs, but is not required for active VMs to function.

Problem #2

Unfortunately even with this obstacle out of the way the migration would still fail with the same error message as before. Turns out, there is a problem with the CPU comparison code in Nova. See Nova Bug Ticket #1082414 for details. It boils down to the wrong set of CPUs being compared – instead of checking whether the source’s virtual CPU can be supported by the target’s real CPU, the code compares the two physical CPUs for compatibility, bringing us back to square 1.

Fix #2

While the bug is going to be fixed in a newer (at the time of writing yet to be released) OpenStack version, the patch is too big to be back-ported to OpenStack Icehouse. So as an interim solution 1 I simply disabled the broken check as discussed in comments #26 and following of the above mentioned bug.

Important: Once this patch is in place, nothing prevents you from migrating instances to incompatible hosts! Even though we specified a custom CPU model earlier (fix #1), virtual machines that were launched prior to that change cannot know about the new limitations! Before live-migrating any virtual machines you must make sure to reboot them once to make them pick up the new CPU type!

Problem #3

So. Now we should be able to live-migrate, right? Well… Wrong…!

The next problem came down to an unfortunate oversight on our part. Even though listed as a requirement in the Configure migrations chapter we did not ensure that the Nova instances directory (typically /var/lib/nova/instances) was mounted on shared storage. This led to the following error in the source hosts /var/lib/nova/nova-compute.log:

1RemoteError: Remote error: RemoteError Remote error:  
2InvalidSharedStorage_Remote node10 is not on shared storage: Live  
3migration can not be used without shared storage.

To determine the presence of shared storage Nova performs a (too) simple check: It tries to create a temporary file in the instance directory of the virtual machine to be migrated and checks if that file can be seen at the same path on the destination host. In our case that naturally failed, because that path resides on a local drive on each hypervisor, even though the VM volumes reside on shared storage. Same as before, apparently this whole part of the code is going through major refactoring for future OpenStack releases, but that did not exactly help me.

Fix #3

I was already looking for the right spot to remove that check, too, when I came across this old mailing list thread “Live migration of VM using librbd and OpenStack”, discussing this exact issue. The final message in that thread conveniently has the right place identified already and a valuable hint thrown in for free:

Just for posterity, my ultimate solution was to patch nova on each compute host to always return True in _check_shared_storage_test_file (nova/virt/libvirt/driver.py)
This did make migration work with “nova live-migration”, with one caveat. Since Nova is assuming that /var/lib/nova/instances is on shared storage (and since I hard coded the check to say “yes, it really is”), it thinks the /var/lib/nova/instances/ folder will exist at both source and destination, and makes no attempt to create it on the destination.

This is the complete patch we apply on new compute nodes (including both the CPU check mentioned above and the shared storage workaround):

1--- libvirt/driver.py.orig  2014-08-21 19:20:10.000000000 +0200  
2+++ libvirt/driver.py   2015-02-27 10:09:17.830455657 +0100  
3@@ -4234,9 +4234,10 @@  
4             disk_available_mb = \  
5                     (disk_available_gb * units.Ki) - CONF.reserved_host_disk_mb
6 
7-        # Compare CPU  
8-        source_cpu_info = src_compute_info['cpu_info']  
9-        self._compare_cpu(source_cpu_info)  
10+        # Compare CPU -- Daniel Schneller: Disabled due to  
11+        # https://bugs.launchpad.net/nova/+bug/1082414  
12+        # source_cpu_info = src_compute_info['cpu_info']  
13+        # self._compare_cpu(source_cpu_info)
14 
15         # Create file on storage, to be checked on source host  
16         filename = self._create_shared_storage_test_file()  
17@@ -4399,11 +4400,22 @@
18 
19         Cannot confirm tmpfile return False.  
20         """  
21-        tmp_file = os.path.join(CONF.instances_path, filename)  
22-        if not os.path.exists(tmp_file):  
23-            return False  
24-        else:  
25-            return True  
26+        # Daniel Schneller: Nova assumes live migration also  
27+        # implies shared storage for instance metadata (libvirt.xml)  
28+        # and checks this by creating a tempfile in that directory,  
29+        # verifying it can be seen from source and destination of  
30+        # the migration. This would prevent live migration for us  
31+        # unnecessarily. We return True here, no matter what, faking  
32+        # shared storage. Cleverly Nova itself even seems to copy  
33+        # the instance metdata over again in a later step.  
34+        # This will have to be reviewed in later OpenStack versions,  
35+        # where improved handling has already been announced.  
36+        return True  
37+        #tmp_file = os.path.join(CONF.instances_path, filename)  
38+        #if not os.path.exists(tmp_file):  
39+        #    return False  
40+        #else:  
41+        #    return True
42 
43     def _cleanup_shared_storage_test_file(self, filename):  
44         """Removes existence of the tmpfile under CONF.instances_path."""

As noted in the patch , in Icehouse Nova creates the console.log and libvirt.xml files on the destination hypervisor, provided the instance directory already exists. Also, since it assumes shared storage, it does not clean up the source directory once the migration is complete.

Finally!

With the above patches and modifications in place, live migration now works as follows:

Determine the VMs UUID, e. g. with nova show or nova list.
Pick the new destination host and create /var/lib/nova/instances/.
Ensure the directory has the correct ownership chown nova:nova /var/lib/nova/instances/
Perform the actual migration: nova live-migration
Remove the old /var/lib/nova/instances/ from the old host.

The time needed for the migration is usually in the range of several seconds, sometimes up to a few minutes. This primarily depends on the RAM size, its rate of change inside the virtual machine, and the speed of the network connecting source and destination hypervisors.

Limitations / Caveats

While the above procedure generally works flawlessly, the necessity for the manual creation and deletion of directories is unfortunate and a potential source of errors.

The CPU compatibility issue is less likely to cause trouble in the future. As we have full control over the VMs running in our cluster, we can make sure each VM gets rebooted at least once before it is migrated. And because we will most certainly not add new compute nodes with CPUs inferior to the Westmere models we presently have in the our servers, the baseline feature set now configured will work fine for the foreseeable future, too.

In the coming months we will therefore probably move /var/lib/nova/instances to CephFS which at the moment we only use for roaming home directories. Once we do that, the second part of the above patch can be reverted again.

Conclusion

In this post I compiled a comprehensive summary on how to enable true Live Migration with OpenStack Icehouse for KVM based virtual machines built on Ceph/RBD volumes. While the information presented is mostly available from other places on the Internet, having it all combined in one place will hopefully save someone else the tedious work of compiling it again.

Footnotes

interim, adj. originally “provisional”, “limited”; in IT contexts often referring to the most permanent of all solutions. See also: Prototype 😉.⤺

Was this post helpful?

Blog author

Daniel Schneller

Do you still have questions? Just send me a message.

fromDaniel Schneller

XFS: Possible Memory Allocation Deadlock in kmem_alloc

A few weeks ago we were surprised by seemingly random I/O hangs on several virtual machines. Any attempt to write to their data volumes blocked, making the load average rise into the stratosphere, and — slightly more consequentially — make Elasticsearch...

Cloud
DevOps
Infrastructure

10.4.2017 | 10 minutes reading time

Daniel Schneller

Rate Limiting based on HTTP headers with HAProxy

Recently we had a problem with a buggy update to a piece of 3rd party client software. It produced lots and lots of valid, but nonsensical requests, targeting our system. This post details how we added a dynamic rate limiting to our HAProxy load balancers...

3.12.2014 | 7 minutes reading time

Daniel Schneller

Localizing Mobile Apps

What do the acronyms I18N or L10N stand for? What do they mean for developers of mobile applications in particular? I hosted a session about localizing mobile applications at Developer Week 2014 in Nuremberg. It covers — among other things — text, numbers...

26.8.2014 | 1 minutes reading time

Daniel Schneller

Jinja2 for better Ansible playbooks and templates

There have been posts about Ansible on this blog before, so this one will not go into Ansible basics again, but focus on ways to improve your use of variables, often, but not only used together with the template module, showing some of the more involved...

24.8.2014 | 11 minutes reading time

Daniel Schneller

Ansible: Simple yet powerful automation

Automatic provisioning of infrastructure as well as deployment is a cornerstone of DevOps. It brings the benefits of version control, reproducibility, and a central place to consolidate (executable) knowledge about infrastructure setups. Best known provisioning...

CI/CD
DevOps
Infrastructure

22.6.2014 | 14 minutes reading time

Daniel Schneller

SSH Two-Factor Authentication with Duo Security

An ever increasing number of services start offering (and recommending) additional means of securing access to your accounts: Instead of just asking users to identify and authenticate themselves with a simple set of username and password, a second piece...

10.3.2014 | 7 minutes reading time

Daniel Schneller

Pseudo-Localization for Cocoa Apps

Locali… what? Simply speaking, localizing an application means translating all output it produces on the screen (and printouts etc.) to the language of the people using it. There is more to it, though, than a simple translation of messages. You should...

Java
iOS
Software development

23.10.2013 | 14 minutes reading time

Daniel Schneller

SSL: Man in the middle? – No, thank you!

At DWX Developer Week I recently gave a talk on SSL and man in the middle attacks. Due to the popular demand (and some internal scheduling issues) I repeated it again internally. However, the recording of that is available on the codecentric YouTube ...

2.7.2013 | 1 minutes reading time

Daniel Schneller

Easier JBehave steps with variants

In an earlier post we offered an introduction to the JBehave project for automatic acceptance testing. While that article focused on setup and general use of the framework, this time I will concentrate on a recent addition I wrote and contributed to...

Agile
Java

1.4.2012 | 4 minutes reading time

Daniel Schneller

Why good metrics values do not equal good quality

Quite regularly, codecentric’s experts perform reviews and quality evaluations of software products. For example, clients may want to get an independent assessment of a program they had a contractor develop. In other cases, they request an assessment...

Agile methods
Java

3.10.2011 | 7 minutes reading time

Daniel Schneller

Using JMeter to measure binary protocols

In a recent project I developed a bridge component to connect a backend web service with a credit-card terminal. The terminal can only speak a binary protocol. The bridge needs to map the binary messages to the corresponding backend calls. If you are...

Java
APM

9.5.2011 | 6 minutes reading time

Daniel Schneller

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

Serverless from Europe: My Experience with Scaleway as an Alternative ...

In addition to dominant US providers like AWS, Azure, and GCP, the French company Scaleway now offers a comprehensive serverless computing portfolio. This includes services for Function as a Service, a lightweight Key/Value Store, and a simple messaging...

Compliance
Infrastructure
data protection
Cloud native
Cloud
Infrastructure as Code

28.5.2025 | 5 minutes reading time

Florian Lüdiger

The Ultimate Tool for Engineers and Developers: Compass Premium

It’s not an every day activity that a tool comes and redefines how engineering and development teams operate, but Compass is the tool with a game-changing solution. As Atlassian's out-of-the-box internal developer platform, Compass helps teams to stay...

Atlassian
Cloud

3.12.2024 | 4 minutes reading time

Özge Kavas

Living on the edge: building serverless applications with Cloudflare Workers

Cloudflare is best known for its CDN, DNS server (1.1.1.1) or WAF/DDos mitigation services. These services are highly predicated on “Edge Computing”, bringing data closer to the user interested in those services – a user in Australia will be happier ...

Cloud native
Cloud
Serverless

28.11.2024 | 14 minutes reading time

We deployed our SaaS Application on fly.io (and it was great).

How we deployed our application in a fraction of the time while saving 100% of the cost. Our team, a bunch of experienced software engineers without prior contact to cloud deployments, wanted to deploy our OCPP-compliant EV Charging Station Simulator...

AWS
Cloud

23.10.2024 | 4 minutes reading time

Jannis Mainczyk

Dangling DNS in cloud infrastructures

Dangling DNS entries are nothing new. Forgotten, outdated or incorrect DNS records can lead to subdomains being taken over and used in phishing campaigns, for example, to steal employee secrets. Due to dynamic IP addresses of rapidly changing resources...

IT-Security
Validation
Cloud
AWS
Infrastructure

5.9.2024 | 4 minutes reading time

Markus Höfer

Spring Boot and HTMX: Deployment to AWS Lambda

This is the next part of my series about Spring Boot and HTMX. In this post, I will show you how to deploy the application created in the previous post to AWS Lambda. If you're in a hurry or impatient, you can simply check out the accompanying Git Repo...

Serverless
Spring
AWS
DevOps
Cloud

30.7.2024 | 5 minutes reading time

Integrating Dapr with Azure Kubernetes Service (AKS): Portability is key

In a recent blog post, we explored how Dapr works and how to test it on a simple local Kubernetes cluster. One of Dapr's key advantages is its component system, which enhances portability. In this post, we'll take our previously daperized demo app and...

Software development
Cloud
Azure
Cloud native

22.7.2024 | 10 minutes reading time

Manuel Zapf

Modern Microservices: Unleashing the Power of .NET Core, Aspire, and Dapr

I recall the days when writing a web application in C# with .NET meant deploying it on an IIS web server for accessibility. Today, this approach seems outdated, especially with the shift towards microservice-based architectures. Fortunately, Microsoft...

Software architecture
Open Source
Cloud
Microservices
Infrastructure as Code
.NET
Cloud native

27.6.2024 | 8 minutes reading time

Manuel Zapf

From sidecars to sidecarless: Tracing the evolution of service mesh technologies...

Ever wondered how the technology that seamlessly manages microservices traffic evolved from early implementations to lean, kernel-level solutions? Let's dive into the fascinating journey of service meshes, from Linkerd 1.x to the cutting-edge technologies...

Cloud
Networking
Infrastructure
Kubernetes
Linux

22.5.2024 | 10 minutes reading time

Manuel Zapf

Demystifying the Kubernetes Gateway API: What the heck is it and why should...

When Gateway API debuted in October last year, this concluded a nearly four-year-long process that started in summer 2019. Gateway API is the successor of core Ingress definition, aiming towards various goals. This blog post will give a brief overview...

API
Open Source
Cloud
Networking
Kubernetes
Cloud native

15.3.2024 | 6 minutes reading time

Manuel Zapf

Cloud-native (application) networking in 2024

It's 2024 and Software is still eating the world. Whether it's powering an e-commerce platform, driving AI applications, or supporting critical business processes within organizations, there's a high likelihood that these applications are running in ...

Cloud
Networking
Infrastructure
Kubernetes

8.3.2024 | 2 minutes reading time

Manuel Zapf

Charge your APIs Volume 22: Mastering the Art of API Federation

API Federation is becoming essential in modern API management, addressing the complexities of evolving digital enterprises. It marks a shift from centralised, monolithic management to a dynamic, modular framework. Unlike traditional methods, API Federation...

API
Cloud
Cloud native

7.2.2024 | 11 minutes reading time

Daniel Kocot

How to upgrade your Aurora Serverless database schema using CDK and Lambda

Imagine the following situation: You are building a serverless application using e.g. lambdas, you setup your system using CDK (or CloudFormation) and you store your data in Aurora Serverless. How would you automate your database schema adaptations or...

Cloud
Database
AWS
Infrastructure as Code
Serverless

16.1.2023 | 12 minutes reading time

Heroku is dead: Let’s deploy Spring Boot containers on fly.io!

Heroku is cancelling their free plan! What about all my open-source projects? Luckily fly.io comes to the rescue! Here are the missing docs on how to run Spring Boot on fly.io.Why I love(d) HerokuHeroku was my go-to PaaS for open-source projects for ...

CI/CD
Java
Cloud
DevOps
Spring

18.9.2022 | 17 minutes reading time

CloudWatch on AWS: How to tackle high-security requirements

If you build cloud-native applications, you will also generate log output. Log outputs are essential to log the functionality of the application and to be able to localize errors very quickly in the event of a crash. However, log outputs of any kind ...

AWS
Cloud
IT-Security

23.8.2022 | 15 minutes reading time

Jörg Riegel

Tame the multi-cloud beast with Crossplane: Let’s start with AWS S3

What if learning the Kubernetes API is all you need to provision any infrastructure? And we’re not only talking about AWS, Azure & Google – but also IONOS, DigitalOcean and even vSphere. Let’s have a look at Crossplane and how we can create an S3 Bucket...

AWS
CI/CD
Cloud
DevOps

3.7.2022 | 21 minutes reading time

Building an instant noodles DevOps starter pack with Terraform and AWS

How can we help a fictitious startup kickstart its software development process? Using Terraform and AWS services, we’ll build an IT infrastructure that is ready within minutes and ticks quite a few boxes on the technical DevOps capabilities list. Just...

Cloud
Infrastructure
AWS
CI/CD
DevOps

27.6.2022 | 21 minutes reading time

Development Containers & GitHub Codespaces kill the “works on my machine...

We love them, and hate them at the same time: local development environments. But what if we could use remote development techniques like Development Containers or GitHub Codespaces to finally overcome the “works on my machine” problem? And also end ...

DevOps
CI/CD
Cloud
Container

12.6.2022 | 15 minutes reading time

Rebooting Accelerate, part 2: How to deliver value faster

So we want to deliver value faster, but how do we do it? The good news is that there are lots of ways to achieve it. The bad news is that it’s hard to pick the right means. What capabilities and approaches are the ones that matter to us as tech people...

Cloud
DevOps

6.6.2022 | 13 minutes reading time

Secretless connections from GitHub Actions to AWS using OIDC

Imagine the following scenario: You set up your GitHub Actions in your repository. And it’s all cool until you want to access your cloud provider resources. Now you might be tempted to create an access key and secret access key, place it as a secret ...

Azure
Cloud
AWS
CI/CD
DevOps
GitHub

29.5.2022 | 8 minutes reading time

Manuel

True KVM Live Migration with OpenStack Icehouse and Ceph based VM storage

Intro

System setup

Migrating Virtual Machines

Problem #1

Fix #1

Problem #2

Fix #2

Problem #3

Fix #3

Finally!

Limitations / Caveats

Conclusion

Footnotes

Was this post helpful?

Blog author

More articles

XFS: Possible Memory Allocation Deadlock in kmem_alloc

Rate Limiting based on HTTP headers with HAProxy

Localizing Mobile Apps

Jinja2 for better Ansible playbooks and templates

Ansible: Simple yet powerful automation

SSH Two-Factor Authentication with Duo Security

Pseudo-Localization for Cocoa Apps

SSL: Man in the middle? – No, thank you!

Easier JBehave steps with variants

Why good metrics values do not equal good quality

Using JMeter to measure binary protocols

Your job at codecentric?

Agile Developer und Consultant (w/d/m)

More articles in this subject area

Serverless from Europe: My Experience with Scaleway as an Alternative ...

The Ultimate Tool for Engineers and Developers: Compass Premium

Living on the edge: building serverless applications with Cloudflare Workers

We deployed our SaaS Application on fly.io (and it was great).

Dangling DNS in cloud infrastructures

Spring Boot and HTMX: Deployment to AWS Lambda

Integrating Dapr with Azure Kubernetes Service (AKS): Portability is key

Modern Microservices: Unleashing the Power of .NET Core, Aspire, and Dapr

From sidecars to sidecarless: Tracing the evolution of service mesh technologies...

Demystifying the Kubernetes Gateway API: What the heck is it and why should...

Cloud-native (application) networking in 2024

Charge your APIs Volume 22: Mastering the Art of API Federation

How to upgrade your Aurora Serverless database schema using CDK and Lambda

Heroku is dead: Let’s deploy Spring Boot containers on fly.io!

CloudWatch on AWS: How to tackle high-security requirements

Tame the multi-cloud beast with Crossplane: Let’s start with AWS S3

Building an instant noodles DevOps starter pack with Terraform and AWS

Development Containers & GitHub Codespaces kill the “works on my machine...

Rebooting Accelerate, part 2: How to deliver value faster

Secretless connections from GitHub Actions to AWS using OIDC