VOOZH about

URL: https://thenewstack.io/uncovering-biases-the-importance-of-data-diversity-in-speech-recognition/

⇱ Uncovering Biases: The Importance of Data Diversity in Speech Recognition - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2020-09-21 12:00:35
Uncovering Biases: The Importance of Data Diversity in Speech Recognition
contributed,
Operations

Uncovering Biases: The Importance of Data Diversity in Speech Recognition

why does bias still persistently exist within ML algorithms? It’s because ML is only as good as the data it’s given.
Sep 21st, 2020 12:00pm by Scott Stephenson
👁 Featued image for: Uncovering Biases: The Importance of Data Diversity in Speech Recognition
Dr. Scott Stephenson
Scott Stephenson is a dark matter physicist turned Deep Learning entrepreneur. He earned a Ph.D. in particle physics from the University of Michigan where his research involved building a lab two miles underground to detect dark matter. Scott left his physics research position to co-found Deepgram, a San Francisco-based Speech AI company where he serves as CEO.

Companies can use artificial intelligence (AI) and machine learning (ML) models for a variety of reasons such as reviewing job candidates, monitoring employee productivity, or analyzing voice data to better understand customer’s needs. ML models are typically trained to recognize certain types of patterns over a set of data, providing an algorithm they can use to reason over and learn from. In theory, this model should be able to provide unbiased outputs based on the data set it’s given. So why does bias still persistently exist within ML algorithms? It’s because ML is only as good as the data it’s given.

As more companies turn to automatic speech recognition (ASR) tools and deploy AI and ML models across their organization it will be imperative to build robust models, free of biases. Bias-free AI models are important so that they work for every person and organization, and create meaningful results that help solve problems, not create them. This piece will uncover how to eliminate bias in your models, and ensure you have a diverse data set that’s as representative as possible of your organization for impartial functionality.

Recognizing Bias in Model Training

The first step to eliminating bias in model training is to acknowledge that inherent bias does exist in ML models. We saw one such occurrence of unconscious bias play out earlier this year when a Stanford study that observed five different companies in the voice assistant space uncovered a racial divide in its speech recognition technologies. This study showed that voice assistants misidentified 35% of words from Black users, while the systems only misidentified 19% of words from White users. These systems learn by analyzing vast amounts of data, so if the ML model is only analyzing White user’s voice patterns, bias will inevitably occur.

If a model is built by white heterosexual males living in coastal states then the model will be more reflective of their biases.

Another more tangible example is gender bias. We interact with this type of bias almost every day with voice assistant technology that is programmed with a female voice by default. Startups in the voice assistant and speech technology space are beginning to change this, but there is a long road to truly genderless AI. Moving forward, AI and ML could help businesses understand the biases that humans have and work to correct them over time. This gender bias study is just one example of how bias exists in speech recognition but it’s representative of an issue most companies face every day. Acknowledging this bias is the first step in working towards a solution.

Have a Diverse Data Set

The next and most important step to reducing bias in ML models is to have a diverse data set. As seen in the Stanford study example above, your data must not only represent different dialects but genders as well, in order to reduce biases and be more accurate.

When developing a representative data set, make sure you are aware of the people around you and their individual experiences. If a model is built by white heterosexual males living in coastal states then the model will be more reflective of their biases, such as the words they use, the cadence of speech, dialects and accents. Utilizing a technology that learns from multiple diverse data sets is a way of reducing the gaps and allowing all voices to be heard. It’s important to emphasize that the ML itself isn’t biased and the algorithm isn’t biased upfront — but they can attain inherent biases if the data set provided does not accurately represent your population. In order to limit as many of these biases as possible within AI, you need to make sure you are representing a wide range of people with various demographics in an organization, not just the organization’s leaders.

Use a Deep Neural Network (DNN)

The third step to ultimately eliminating bias in ML models is for companies to use a Deep Neural Network (DNN). With typical ML models, there is the monster (the model) and the creator of the “monster” (humans). The “monster” learns based on the data that it is given. With a DNN, humans aren’t doing any kind of hard coding. Companies like Amazon or Google that have one part heuristics and rule-based and the other part convolutional network are at risk of inserting their own biases.

On the other hand, if you simply use a DNN, only the data that you give your model is what trains it and how it learns over time. This way, it has nothing to do with the beliefs of the person that built the network, eliminating inherent biases. However, to play devil’s advocate, now bias is transferred to the beliefs of the people labeling the training data (or how data is collected). Ultimately, the more robust of a data science strategy you have, and how well you train your model with representative data sets, the higher your accuracy rates will be.

Eliminating bias in ASR is by no means a simple three-step process. Yet, it’s critical that you acknowledge bias exists and how it came about, to understand the importance of employing a diverse data set. From there, source audio that is representative of the conversations you are looking to transcribe and consider implementing a DNN, making on-going training part of the speech recognition process. This will improve the accuracy of your speech recognition model and help to eliminate bias, allowing you to better understand and serve the needs of your customers.

Feature image via Pixabay.

TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.