OpenShift project backups

13.9.2018 | 7 minutes reading time

Dr Jekyll’s potion famously owes its effectiveness to an ‘unknown impurity’. This is why, at the end of Stevenson’s tale, the protagonist has to confess to himself and the world that he will never regain control of his destructive alter ego. Configuration errors sometimes present a similar problem. It can be very hard to figure out why an earlier, throwaway version of a service worked when our painstaking attempts to recreate it fail. As I hope to show, creating regular backups of our projects can help.

I’d like to distinguish between two kinds of backup here. On the one hand, there’s a spare vial in the fridge. Its contents match the original potion exactly. This is essentially a database snapshot. On the other hand, there’s a laboratory analysis of the original potion, which represents our only chance of identifying the ‘unknown impurity’.

In many cases, the vial in the fridge is what is needed. Its direct equivalent in the Kubernetes world is a database backup of the master’s etcd store. I want to concentrate instead on the laboratory analysis. It is less convenient when time is short, but it does offer a clear, human-readable glimpse of a particular moment in time when our service was working correctly.

While this approach will probably not allow you to restore the entire cluster to a working state, it enables you to look at an individual project, dissect its parts and hopefully identify the tiny, inadvertent configuration change that separates a failed deployment from a successful one.

There is no need to lock the database prior to taking the backup. We are exporting individual objects to pretty-printed JSON, not dumping blobs.

Why, considering our infrastructure is expressed in code, should we go to the trouble of requesting laboratory analyses? Surely the recipe will suffice as everything of consequence is persisted in Git? The reason is that too often the aspiration to achieve parity between code and infrastructure is never realised. Few of us can say that we never configure services manually (a port changed here, a health check adjusted there); even fewer can claim that we regularly tear down and rebuild our clusters from scratch. If we consider ourselves safe from Dr Jekyll’s error, we may well be deluding ourselves.

Project export

Our starting point is the script export_project.sh in the repository openshift/openshift-ansible-contrib . We will use a substantially modified version (see pull request , now merged).

One of the strengths of the Kubernetes object store is that its contents are serialisable and lend themselves to filtering using standard tools. We decide which objects we deem interesting and we also decide which fields can be skipped. For example, the housekeeping information stored in the .status property is usually a good candidate for deletion.

oc export has been deprecated, so we use oc get -o json (followed by jq pruning) to export object definitions. Take pods, for example. Most pod properties are worth backing up, but some are dispensable: they include not only a pod’s .status, but also its .metadata.uid, .metadata.selfLink, .metadata.resourceVersion, .metadata.creationTimestamp and .metadata.generation fields.

Some caveats are in order. We store pod and replication controller definitions, yet we also store deployment configurations. Clearly the third is perfectly capable of creating the first two. Still, rather than second-guess a given deployment sequence, the backup comprises all three. It is after all possible that the pod definition (its readinessProbe property, for example) has been modified. The resulting history may be repetitive, but we cannot rule out the possibility of a significant yet unseen change.

Another important caveat is that this approach does not back up images or application data (whether stored ephemerally or persistently on disk). It complements full disk backups, but it cannot take their place.

Why not use the original export script? The pull request addresses three central issues: it continues (with a warning) when the cluster does not recognise a resource type, thus supporting older OpenShift versions. It also skips resource types when the system denies access to the user or service account running the export, thus adding support for non-admin users. (Usually the export will be run by a service account, and denying the service account access to secrets is a legitimate choice.) Finally, it always produces valid JSON. The stacked JSON output of the original is supported by jq and indeed oc, but expecting processors to accept invalid, stacked JSON is a risky choice for backup purposes. python -m json.tool, for instance, requires valid JSON input and rejects the output of the original script. Stacked JSON may be an excellent choice for chunked streaming (log messages come to mind) but here it seems out of place.

Backup schedule

Now that the process of exporting the resources is settled, we can automate it. Let’s assume that we want the export to run nightly backups. We want to zip up the output, add a date stamp and write it to persistent storage. If that succeeds we finish by rotating backup archives, that is, deleting all exports older than a week. The parameters (when and how often the export runs, the retention period, and so on) are passed to the template at creation time.

Let’s say we are up and running. What is happening in our backup project?

schematic overview of the backup service — **Fig. 1** Backup service

A nightly CronJob object instantiates a pod that runs the script project_export.sh. Its sole dependencies are oc and jq. It’s tempting at first glance to equip this pod with the ability to restore the exported object definitions, but that would require sweeping write access to the cluster. As mentioned earlier, the pod writes its output to persistent storage. The storage mode is ReadWriteMany, so we can access our files whether an export is currently running or not. Use the spare pod deployed alongside the CronJob object to retrieve the backup archives:

$ oc project cluster-backup
$ POD=$(oc get po | grep Running | cut -d' ' -f1)
$ oc exec ${POD} -- ls -1 /openshift-backup
openshift-backup20180911.zip
openshift-backup20180912.zip
openshift-backup20180913.zip

Policy

The permissions aspect is crucial here. The pod’s service account is granted cluster reader access and an additional, bespoke cluster role secret-reader. It is defined as follows:

kind: ClusterRole
apiVersion: v1
metadata:
  name: ${NAME}-secret-reader
rules:
- apiGroups: [""]
  resources: ["secrets"]
  verbs: ["get", "list"]

Perhaps the greatest benefit of custom cluster roles is that they remove the temptation to grant cluster-admin rights to a service account.

The export should not fail just because we decide that a given resource type (e.g. secrets or routes) is out of bounds. Nor should it be necessary to comment out parts of the export script. To restrict access, simply modify the service account’s permissions. For each resource type, the script checks whether access is possible and exports only resources the service account can view.

an alternative overview centred on rights and permissions — **Fig. 2** Permissions

Administrator permissions are required only to create the project at the outset. The expectation is that this would be done by an authenticated user rather than a service account. As Fig. 2 illustrates, the pod that does the actual work is given security context constraint ‘restricted’ and security context ‘non-privileged’. For the most part, the pod’s service account has read access to the etcd object store and write access to its persistent volume.

How to get started, and why

To set up your own backup service, enter:

$ git clone https://github.com/gerald1248/openshift-backup
$ make -C openshift-backup

If you’d rather not wait until tomorrow, set the permanent pod’s name in variable POD as before and enter:


$ oc exec ${POD} openshift-backup
Exporting 'rc' resources to myproject/rcs.json
Exporting 'rolebindings' resources to myproject/rolebindings.json
Skipped: list empty
Exporting 'serviceaccounts' resources to myproject/serviceaccounts.json
...

Please check that the output has been written to /openshift-backup as intended. You can use the script project_import.sh (found next to project_export.sh in the openshift/openshift-ansible-contrib repository) to restore one project at a time. However, in most cases it will be preferable to use this backup as an analytical tool, and restore individual objects as required.

It’s worth considering the sheer number of objects the object store holds for a typical project. Each of them could have been edited manually or patched programmatically. It could also lack certain properties that are present in the version that is stored in Git. Kubernetes is prone to drop incorrectly indented properties at object creation time.

In short, there is ample scope for ‘unknown impurities’. Given how few computing resources are required, and how little space a week’s worth of project backups takes up, I would suggest that there is every reason to have a laboratory analysis to hand when the vials in the fridge run out.

Was this post helpful?

Blog author

Gerald Schmidt

Do you still have questions? Just send me a message.

fromGerald Schmidt

OpenShift cluster tests

We subject our clusters to a lot of automated tests in the widest sense – monitoring, health checks, load tests, penetration tests, vulnerability scans, the list goes on – but every so often I come across test cases that are not well served by any of...

6.9.2018 | 9 minutes reading time

Gerald Schmidt

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

Integrating Dapr with Cilium: A Sidecar-Less Service Mesh Approach combined...

A few weeks ago, when we introduced Dapr, we also discussed its overlapping capabilities with a service mesh, although Dapr itself is not a service mesh. As already mentioned in a previous blogpost, in recent years service meshes have become a pivotal...

Networking
Microservices
Kubernetes
Cloud native

1.8.2024 | 16 minutes reading time

Manuel Zapf

From sidecars to sidecarless: Tracing the evolution of service mesh technologies...

Ever wondered how the technology that seamlessly manages microservices traffic evolved from early implementations to lean, kernel-level solutions? Let's dive into the fascinating journey of service meshes, from Linkerd 1.x to the cutting-edge technologies...

Cloud
Networking
Infrastructure
Kubernetes
Linux

22.5.2024 | 10 minutes reading time

Manuel Zapf

Demystifying the Kubernetes Gateway API: What the heck is it and why should...

When Gateway API debuted in October last year, this concluded a nearly four-year-long process that started in summer 2019. Gateway API is the successor of core Ingress definition, aiming towards various goals. This blog post will give a brief overview...

API
Open Source
Cloud
Networking
Kubernetes
Cloud native

15.3.2024 | 6 minutes reading time

Manuel Zapf

Cloud-native (application) networking in 2024

It's 2024 and Software is still eating the world. Whether it's powering an e-commerce platform, driving AI applications, or supporting critical business processes within organizations, there's a high likelihood that these applications are running in ...

Cloud
Networking
Infrastructure
Kubernetes

8.3.2024 | 2 minutes reading time

Manuel Zapf

Secure your Kubernetes workloads with OPA Gatekeeper

Last month, Kubernetes 1.25 was released. And with that, the long-announced removal of PodSecurityPolicies (short: PSPs) finally becomes reality. Finally? Yes – as Tabitha Sable from the Kubernetes SIG Security Team said herself in the linked blog post...

IT-Security
Kubernetes
Infrastructure

15.12.2022 | 8 minutes reading time

Introduction to GitOps with ArgoCD

In this post you will learn what GitOps is about and see the steps to create a setup on your laptop to gain some experience with ArgoCD. Using an industry standard container orchestrator such as Kubernetes, this enables developers to continuously deploy...

CI/CD
Kubernetes
GitHub
Open Source
DevOps
Container
Infrastructure as Code
Infrastructure
Spring

31.10.2022 | 10 minutes reading time

How to run Mulesoft Anypoint Runtime Fabric on minikube

How to run Mulesoft Anypoint Runtime Fabric on minikubeAnypoint Runtime Fabric (RTF) is the Mulesoft product for running integration applications and managed APIs in Kubernetes clusters.There are two supported ways of running Runtime Fabric: (1) RTF ...

API
Kubernetes
Integration

31.5.2022 | 6 minutes reading time

GitLab security scanning – part 3: Kubernetes deployments

In part 1 and part 2 , we focused on different types of security scanning practices. In this article we will take a look at Kubernetes deployments with Helm and Helmfile. In particular, we are interested in how to ensure that objects deployed to Kubernetes...

DevOps
IT-Security
CI/CD
GitLab
Cloud
Kubernetes

15.5.2022 | 4 minutes reading time

Sven Hertzberg

Speed up your CI/CD jobs in Kubernetes

A performant and well integrated CI/CD environment is one of the key factors for fast and agile software development. To achieve short feedback cycles and increase development speed, jobs need to be as fast as possible and – ideally – should start instantly...

GitLab
Software architecture
CI/CD
Infrastructure
Cloud
Kubernetes

2.9.2021 | 7 minutes reading time

How to use OAuth2 Proxy for central authentication

This blog post will show you how to use one central OAuth2 Proxy (see the official page ) as authentication proxy for multiple services inside your Kubernetes Cluster .The default example on how to secure a service with Nginx and OAuth2 Proxy shows ...

Infrastructure
Microservices
Cloud
Kubernetes
IT-Security

7.6.2021 | 2 minutes reading time

API Gateway and Service Mesh in the context of service connectivity

When thinking about the development of microservices and their connectivity, one inevitably stumbles across the terms / patterns of API gateway and service mesh. But why do these patterns or technologies exist at all? Sometimes it also happens that the...

Software architecture
Cloud
API
Infrastructure
Kubernetes

23.2.2021 | 1 minutes reading time

Daniel Kocot

Kubernetes deployment concepts

There is a wide variety of tools out there to deploy software to a Kubernetes cluster. In the context of these tools, even a new *Ops term emerged: GitOps . This article will not be another comparison of Kubernetes deployment tools but a comparison of...

CI/CD
DevOps
Kubernetes

5.8.2020 | 3 minutes reading time

Debugging Kubernetes Network Policies with ephemeral containers

As you are developing your new shiny containerized service on Kubernetes (k8s), you might also want to apply Network Policies . But during the process, you experience connection problems inside your containers. You followed best practices and kept your...

Software development
Kubernetes

22.7.2020 | 2 minutes reading time

Kick-start your microservice project with JHipster

I recently looked for a solution on how to prototype a customer project in a short time and came across JHipster. The target architecture used Spring Boot in the backend and an Angular frontend. JHipster can scaffold this in its simplest variant as...

Node.js
Angular
Software development
Container
NoSQL
Cloud
JavaScript
Java
Keycloak
Kubernetes
Microservices
IT-Security
Open Source
React
Spring

12.5.2020 | 13 minutes reading time

Jörg Riegel

Running Spring Boot apps as GraalVM Native Images

All those Micronaut, Quarkus.io & Co. frameworks sound great! But Spring is the undisputed forerunner in Enterprise Java. Wouldn’t it be great to combine Spring Boot with the benefits of GraalVM?! Spring Boot & GraalVM – blog seriesPart 1: Running Spring...

Kubernetes
Microservices
Java
Spring

6.5.2020 | 21 minutes reading time

Brave new container world – an overview

The container ecosystem changes virtualization and the way we work as software developers by providing us with new possibilities to build, distribute and run software. Docker and Kubernetes are surely the best-known technologies in this field. But a ...

Chaos Engineering
Cloud
Community
CI/CD
Kubernetes
Microservices

7.1.2020 | 10 minutes reading time

Bertram Vogel

Multiple-stage Kubernetes deployments with GitLab and Kustomize

This article outlines a lean setup for a CI/CD setup to multiple Kubernetes clusters as a step-by-step guide. We will use GitLab CI with the GitLab Docker Registry and the Kustomize customization engine.A containerized microservice-oriented project is...

DevOps
CI/CD
Kubernetes

27.11.2019 | 8 minutes reading time

Building your own serverless functions with k3s and OpenFaaS on Raspberry...

In recent years, lots of new programming paradigms have emerged – going from monolithic architectures towards microservices and now serverless functions. As a result, less code needs to be deployed, and updating an application becomes easier and faster...

Cloud
DevOps
Open Source
Database
Kubernetes
Raspberry Pi
Serverless

6.8.2019 | 18 minutes reading time

Kubernetes Operators: Deploying Helm Charts without Tiller

Congratulations, you have just finished the first shippable version of your software product. You created container images for your software and want to make deploying to Kubernetes as simple as possible. You could provide plain YAML files along with...

Kubernetes

25.6.2019 | 8 minutes reading time

Roman Seyffarth

Making a React application container environment-aware at Kubernetes deployment

MotivationThis blog is based on the use case of a React web application that is supposed to be deployed to a Kubernetes cluster. To give more context and a better introduction to the problem to be solved, a few practical business logic and infrastructure...

CI/CD
DevOps
Container
Kubernetes
React

6.12.2018 | 6 minutes reading time

OpenShift project backups

Project export

Backup schedule

Policy

How to get started, and why

Was this post helpful?

Blog author

More articles

OpenShift cluster tests

Your job at codecentric?

Agile Developer und Consultant (w/d/m)

More articles in this subject area

Integrating Dapr with Cilium: A Sidecar-Less Service Mesh Approach combined...

From sidecars to sidecarless: Tracing the evolution of service mesh technologies...

Demystifying the Kubernetes Gateway API: What the heck is it and why should...

Cloud-native (application) networking in 2024

Secure your Kubernetes workloads with OPA Gatekeeper

Introduction to GitOps with ArgoCD

How to run Mulesoft Anypoint Runtime Fabric on minikube

GitLab security scanning – part 3: Kubernetes deployments

Speed up your CI/CD jobs in Kubernetes

How to use OAuth2 Proxy for central authentication

API Gateway and Service Mesh in the context of service connectivity

Kubernetes deployment concepts

Debugging Kubernetes Network Policies with ephemeral containers

Kick-start your microservice project with JHipster

Running Spring Boot apps as GraalVM Native Images

Brave new container world – an overview

Multiple-stage Kubernetes deployments with GitLab and Kustomize

Building your own serverless functions with k3s and OpenFaaS on Raspberry...

Kubernetes Operators: Deploying Helm Charts without Tiller

Making a React application container environment-aware at Kubernetes deployment