VOOZH about

URL: https://thenewstack.io/can-self-supervised-learning-teach-ai-systems-common-sense/

⇱ Can Self-Supervised Learning Teach AI Systems Common Sense? - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2021-09-16 10:00:48
Can Self-Supervised Learning Teach AI Systems Common Sense?
contributed,
Software Development

Can Self-Supervised Learning Teach AI Systems Common Sense?

Despite recent advancements in AI (especially in the fields of natural language processing (NLP) and computer vision applications), mastering the unique complexities of human language continues to be one of AI’s biggest challenges.
Sep 16th, 2021 10:00am by Pieter Buteneers
👁 Featued image for: Can Self-Supervised Learning Teach AI Systems Common Sense?
Feature image via Pixabay.
Pieter Buteneers
Pieter Buteneers is an industrial and ICT-electronics engineer. He started his career in academia, first as a Ph.D. student and later as a postdoc, where he did research on Machine Learning, Deep Learning, Brain Computer Interfaces and Epilepsy. He won the first prize in the biggest Deep Learning competition of 2015 together with a team of machine learners from Ghent University: the National Data Science Bowl hosted on kaggle.com. In the same year, he gave a TEDx talk on Brain Computer Interfaces. In 2019 he became the CTO of Chatlayer.ai, a platform to build multilingual chatbots who communicate on a human level. In 2020 Chatlayer.ai was acquired by Sinch and now Pieter leads all Machine Learning efforts at Sinch as Director of Engineering in ML & AI.

Imagine having an artificial intelligence (AI) system that is capable of mimicking human language and intelligence. Given AI’s capabilities, it seems simple, right? Not quite. Despite recent advancements in AI (especially in the fields of natural language processing (NLP) and computer vision applications), mastering the unique complexities of human language continues to be one of AI’s biggest challenges.

According to IDC, worldwide revenues for the AI market are forecast to grow 16.4 percent year over year in 2021, as the market is expected to break the $500 billion mark by 2024.

As companies continue to develop and deploy AI solutions to automate processes, solve complex problems and enhance customer experiences, many are realizing its shortcomings — including the amount of data required to train machine learning (ML) algorithms and the flexibility of these algorithms in understanding human language.

The ability for computers to effectively understand all human language would completely transform how we engage with brands and businesses on a global scale. As businesses begin transitioning away from high-frequency, one-way communications and toward two-way conversations, it will be critical for organizations to gain a deeper understanding of human language as they look to improve customer interactions.

Think about it: If AI systems can get a deeper understanding beyond the traditional means of analyzing data, they’ll exceed human performance in language tasks and bring AI one step closer to human-level intelligence. The big question is: Is this level of human performance achievable?

Yes — and the secret lies within self-supervised learning.

Self-Supervised Learning: How Can It Improve AI?

Most of what we learn about the world, especially as babies, is mainly through observation and trial and error. As we learn, we develop common sense and the ability to learn complex tasks such as driving a car.

While ML algorithms can’t directly mimic the way babies learn, self-supervised learning can help systems predict what comes next. If we want AI to act more like humans, then we need vast amounts of high-quality labeled data.

Self-supervised learning allows ML algorithms to train on low-quality unlabeled data. The technique typically involves taking an input dataset and concealing part of it. The self-supervised learning algorithm must then analyze visible data and enable it to predict the remaining hidden data. As a result, this process creates the labels that will allow the system to learn and gives the system the ability to fill in the blanks.

Self-supervised learning eliminates the need for data labeling, opening up a huge opportunity for organizations to better utilize unlabeled data and streamline data processes. It creates a data-efficient AI system that can analyze and process data without the need for human intervention, eliminating the need for full “supervision.”

While self-supervised learning is a relatively new concept to the world of AI, it has already enabled major advancements in NLP. For example, Google introduced the BERT model in 2018, where engineers recycled an architecture typically used for machine translation and made it learn the meaning of a word in relation to its context in a sentence.

Facebook eventually took this a step further and was able to train a BERT-like model on more than 100 languages simultaneously. In 2020, Google pushed the BERT architecture to its limits by training a much larger network on even more data. The language-agnostic mT5 model performs better than humans in labeling sentences and finding the right answers to a question.

But with all these recent advancements, why aren’t we seeing these algorithms everywhere?

What’s Holding Us Back?

First and foremost, training the T5 algorithm is costly. While Google publicly shared these models, they can’t be used for anything specific without fine-tuning them to accomplish the task at hand — ultimately adding more cost. Furthermore, once these models are optimized to accomplish your specific problem, they still require a lot of time and power to compute and execute.

Most deep learning algorithms and workflows remain inefficient. While deep learning has made significant strides in recent years, it requires large amounts of data in order to have useful outputs.

Reducing AI’s data-dependency and moving beyond the limitations of deep learning will require the capabilities of self-supervised learning in order to both be successful and teach AI systems common sense.

Over time, as companies invest in advancing AI systems and fine-tuning their efforts, I expect that new applications will emerge. We could see more complex applications in the coming years, but I also foresee new models emerging to outperform the T5 algorithm.

TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.