VOOZH about

URL: https://thenewstack.io/the-whys-whens-and-wherefores-of-kubernetes-backup/

⇱ The Whys, Whens and Wherefores of Kubernetes Backup - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2022-05-09 09:00:41
The Whys, Whens and Wherefores of Kubernetes Backup
contributed,sponsor-cncf,sponsored,sponsored-post-contributed,
Data / Kubernetes / Storage

The Whys, Whens and Wherefores of Kubernetes Backup

In many ways the requirements for backing up Kubernetes are the same as for other critical IT infrastructure, but in other ways they are very different.
May 9th, 2022 9:00am by Bob Adair
👁 Featued image for: The Whys, Whens and Wherefores of Kubernetes Backup
Feature Image by Alexander Gresbek from Pixabay.
CNCF sponsored this post.
Bob Adair
Bob Adair is the product manager for CloudCasa at Catalogic Software. Bob has 30 years of experience in the IT, storage, and data protection industries. Since beginning his career in IT on Wall Street, he has held senior positions in engineering and product development at companies including Storage Networks, Veritas, Symantec, Savvis, EMC, and Dell.

Kubernetes is among the most misunderstood of new technologies when it comes to data protection. Several things contribute to this confusion, including:

  • Kubernetes’ heritage as a solution used for stateless applications,
  • the tendency for applications or even whole clusters to be deployed automatically using infrastructure as code from repositories,
  • blurred organizational lines of responsibility in clouds, and
  • the tendency for Kubernetes to be managed by development or DevOps teams without a background in traditional IT operations.

The Same, but Different

In many ways, the requirements for backing up Kubernetes are the same as for other critical IT infrastructure, but in other ways, they are very different.

With traditional server infrastructure, it is generally assumed that all servers or VMs that run production applications and all systems critical to development will be backed up. Other systems for test/QA, and staging are often excluded, but may also be backed up for convenience or to minimize possible development schedule disruptions. These backups may be done at the server or VM level, at the storage level, or more likely at both. The process, while not necessarily simple, is well understood.

KubeCon + CloudNativeCon conferences gather adopters and technologists to further the education and advancement of cloud native computing. The vendor-neutral events feature domain experts and key maintainers behind popular projects like Kubernetes, Prometheus, Envoy, CoreDNS, containerd and more.
Learn More
The latest from KubeCon + CloudNativeCon

With cloud infrastructure, configuration data for the infrastructure itself must also be backed up. If cloud infrastructure is deployed automatically using IaC tools, the repositories that contain the IaC files should be backed up. But often this isn’t done, and when it is it must be hoped that the actual configuration hasn’t diverged from the deployed configuration. Storage volumes in the cloud may include different types of replication and snapshot capabilities, but these need to be managed and are generally not a replacement for application-level backups. Running in the cloud doesn’t obviate the need for backups, it just changes the requirements.

Kubernetes adds an additional layer of complexity. A cluster is imposed on the underlying nodes or VMs, whose configuration may change over time. On top of this, containerized applications are deployed which may call for various types of persistent storage volumes and can create custom resources or otherwise modify the cluster state.

Several Methods, Same Outcome

There are multiple approaches to protecting Kubernetes and the applications that run on it, but not surprisingly the best approach is to use a solution that actually understands Kubernetes. You could use a tool that just protects the underlying Persistent Volumes (PVs) at the storage level, or you could back up the underlying nodes, which might be easy enough if they are VMs. But where would that leave you when you want to restore? There is a good chance that you may only want to restore a single namespace, or a single PV, or even a single resource such as a secret. A traditional backup tool that isn’t aware of Kubernetes will be no help with this.

It’s become common for stateful applications to run under Kubernetes, making use of persistent volumes and often even running databases on them. As with databases running on traditional server infrastructure, obtaining consistent backups can require application awareness in the form of “hooks” to quiesce the DB or application before volume snapshots are created. On Kubernetes, these hooks must also be cluster-aware so that they are directed to the proper node and container.

To Back up or Not to Back up

You might think that your Kubernetes cluster doesn’t need backups at all, because it only runs stateless applications, and everything is deployed automatically using CI/CD pipelines and IaC tools from files in a git repo. That may be true. But it may not. Are you sure you can easily rebuild that environment the way it was five minutes ago, or two weeks ago, or nine months ago if called on to do so? Are you sure there hasn’t been any configuration drift since your cluster was created by your deployment tools? And are you sure, even though you may not use PVs, that important application state isn’t being stored as Kubernetes custom resources, in CronJob entries, etc.? Are your secrets and certificates protected? Most importantly, are you sure that what your developers told you last month about the lack of application state and possible configuration drift on the cluster is still true today? If you opt to forgo backups, be sure to check with all stakeholders on an ongoing basis that it is still appropriate. Then ignore what they tell you and periodically test a full rebuild from scratch. It’s better to be safe than sorry.

High Availability vs. Backup

Kubernetes and cloud infrastructure together can provide an excellent high-availability platform for your applications. Infrastructure redundancy and replication across multiple availability zones or regions can provide fault tolerance and application-level resilience. But high availability is no substitute for backups. HA solutions protect against data loss and unavailability due to physical failures such as failures of disks, nodes, power, network connectivity, and, with proper design, even entire sites. But they don’t help protect against logical failures. Since the use of RAID and volume replication became common in the 1990s, physical failures in data centers are seldom the cause of restore requests. The primary cause of data loss and subsequent restores is logical errors: user errors, software errors, operator errors, and security breaches. Using highly available cloud solutions doesn’t relieve you of the responsibility to protect your applications and data with backups.

Happy Outcomes

Think carefully about the whys while deciding how, when, and whether to protect your Kubernetes clusters. A good and properly configured backup solution could make the difference between a good and bad Kubernetes experience.

As a data protection company, at CloudCasa by Catalogic we have heard of many approaches to protecting Kubernetes, and many reasons why customers don’t or didn’t, think they needed to back it up at all. Some of these reasons were valid, and others weren’t. Sometimes these decisions lead to unhappy outcomes that we heard about only afterward when customers came to us seeking to prevent them from happening again.

👁 KubeCon EU 2022

The Cloud Native Computing Foundation (CNCF) hosts critical components of the global technology infrastructure including Kubernetes, OpenTelemetry, and Argo. CNCF is the neutral home for cloud native collaboration, bringing together the industry’s top developers, end users, and vendors.
Learn More
The latest from CNCF
TRENDING STORIES
Bob Adair is the product manager for CloudCasa at Catalogic Software. Bob has 30 years of experience in the IT, storage, and data protection industries. Since beginning his career in IT on Wall Street, he has held senior positions in...
Read more from Bob Adair
CNCF sponsored this post.
SHARE THIS STORY
TRENDING STORIES
KubeCon+CloudNativeCon is a sponsor of The New Stack.
TNS owner Insight Partners is an investor in: Pragma.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.