VOOZH about

URL: https://thenewstack.io/the-basics-of-dataops-and-why-it-matters/

⇱ The Basics of DataOps and Why It Matters - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2020-09-30 15:00:22
The Basics of DataOps and Why It Matters
podcast,sponsor-gitlab,sponsored,sponsored-podcast-day-of-podcasting,the-new-stack-makers,
DevOps

The Basics of DataOps and Why It Matters

Panel discusses how DataOps is defined and why its application in the context of DevOps is relevant in today’s highly complex, distributed environments.
Sep 30th, 2020 3:00pm by Alex Williams and B. Cameron Gain
👁 Featued image for: The Basics of DataOps and Why It Matters
GitLab sponsored this post.

GitLab sponsored this podcast.

The practice of DataOps has emerged as another variant of DevOps. It involves how data is managed. But DataOps remains a work in progress for many organizations.

For this latest episode of The New Stack Makers podcast, Alex Williams, founder and publisher of The New Stack talks with guest speakers Dina Graves Portman, developer relations engineer for Google; Emilie Schario, internal strategy consultant, data, for GitLab; and Nicole Schultz, assistant director of engineering for Northwestern Mutual. They discussed how DataOps is defined and why its application is particularly relevant in today’s highly complex and increasingly distributed environments.


DataOps: The Basics and Why It Matters

Like DevOps, DataOps can be described as workflow-related but helps resolve data-management challenges. “I think of DataOps as creating a workflow or a way of working as a data team that creates efficiencies and allows you to get more done with the resources you have in a better, more stable way,” said Schario.

DataOps also helps to solve many issues IT teams struggle with related to data. Sadly, broken dashboards and incorrect tallies “seem to be the norm everywhere,” Schario explained.

“DataOps is really about catching those problems — it’s about creating a workflow that catches problems before they make it to your end user,” said Schario. “I think people are feeling the pain.”

A motto of Google site reliability engineers is that “hope is not a strategy,” said Portman. “DataOps is taking ‘hope is not a strategy’ and applying that to data and using all of the different tools that software engineers have been using for years.”

Automating data-collection processes are also important for implementing DataOps, Schultz explained.

“By using CI/CD DevOps practices, [IT teams] are automating more so you’re not reliant on one person doing one certain thing, while still ensuring that you have high data quality with the data-collection processes that you’re automating as you’re moving data throughout workflows,” said Schultz.

Version control should play a role in helping to maintain DataOps discipline just as code repositories have served the modern, distributed development process.

“It really starts with version control to solve the ‘single source of truth’ problem because you create your repository where you’re storing your code, which is the place where your code lives,” said Schario.

Working on data-related processes in a “version control environment” means “everyone’s working off a single collaborative code repository,” Schario explained. “That’s the first step in making sure everyone has the same backbone to the work they’re doing.”

DataOps should also help usher in a new era in data and database management, such as supporting the application of new technologies applied across cloud native environments. As a subset of automation, unsupervised machine learning applied to data analysis and management should reveal some amazing applications in the not-so-distant future.

Portman explained how unsupervised machine learning can help manage some systems for which DataOps teams might not necessary have direct control over. If there is a field that stores elements such as views or clicks in a 32-bit signed integer, the fields start overflowing, causing the numbers to look negative. Unsupervised learning can separate the dataset into unlabeled groups. Regular unsupervised learning on datasets could thus bring to light such issues with the data as they develop.

“How do you identify when something goes wrong when there’s an anomaly? I think unsupervised learning would be a really great way of understanding that when our data usually looks like ‘x’ and suddenly there is this part of it that looks like ‘y,’” said Portman. “That would be a very useful thing to do.”

GitLab is the most comprehensive, intelligent DevSecOps platform for software innovation. GitLab enables organizations to increase developer productivity, improve operational efficiency, reduce security and compliance risk, and accelerate digital transformation.
Learn More
The latest from GitLab
TRENDING STORIES
Alex Williams is founder and publisher of The New Stack. He's a longtime technology journalist who did stints at TechCrunch, SiliconAngle and what is now known as ReadWrite. Alex has been a journalist since the late 1980s, starting at the...
Read more from Alex Williams
BC Gain is founder and principal analyst for ReveCom Media. His obsession with computers began when he hacked a Space Invaders console to play all day for 25 cents at the local video arcade in the early 1980s. He then...
Read more from B. Cameron Gain
GitLab sponsored this post.
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.