VOOZH about

URL: https://thenewstack.io/ai-testing-more-coverage-fewer-bugs-new-risks/

⇱ AI Testing: More Coverage, Fewer Bugs, New Risks - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2024-11-02 04:00:29
AI Testing: More Coverage, Fewer Bugs, New Risks
contributed,
AI / DevOps / Software Testing

AI Testing: More Coverage, Fewer Bugs, New Risks

AI tools are reshaping software testing with faster automation, round-the-clock coverage, and improved accuracy.
Nov 2nd, 2024 4:00am by Tal Barmeir
👁 Featued image for: AI Testing: More Coverage, Fewer Bugs, New Risks
Photo by RoonZ nl on Unsplash.

The advent of generative AI is swiftly ushering in a new era in software testing. With AI test recorders designed to perform equivalent work to a human test automation engineer, GenAI has become so sophisticated that it interprets plain-language instructions to generate test automation code autonomously.

This capability democratizes testing, allowing individuals without coding expertise to interact with testing frameworks directly. It simplifies the entire test automation process by enabling users to effortlessly generate code by recording their interactions with websites — no manual coding required. GenAI-driven testers seamlessly integrate into CI/CD pipelines, autonomously detecting bugs and alerting teams about potential issues.

Transforming QA With AI

From experience, the journey from human automation tester to AI test automation engineer is a transformative process. Traditionally, transitioning to test automation required significant time and resources, including learning to code and understanding automation frameworks.

AI removes these barriers and speeds up development cycles, dramatically reducing time-to-market and improving accuracy, all while decreasing the level of admin tasks for software testers.

Through AI software testing, companies can rapidly decrease time to market. They can interpret test scenarios written in plain language, automatically generate the necessary code for test automation, and execute tests across various platforms and languages. This dramatically reduces the enablement time, allowing QA professionals to focus on strategic tasks instead of coding complexities. This, in turn, addresses the severe shortage in test automation, helping companies achieve often higher quality across digital offerings in no time, for any number of scenarios in parallel in a matter of minutes, creating test automation for multilingual websites and apps as well as for any screen size, including computers, tablets, and mobiles.

The productivity gains from AI in testing are substantial. We now have a vast international bank that we have helped leverage our solution to such an extent it managed to increase test automation coverage across two of its websites (supporting around ten different languages), taking it from a mere forty percent to almost ninety percent in a matter of weeks. I believe this is an amazing achievement, not only because of the end results but also because working in an enterprise environment with its security and integrations can typically take forever.

While traditional test automation might be limited to a single platform or language and the capacity of one person, AI-enhanced testing breaks these limitations. Testers can now create and execute tests on any platform (web, mobile, desktop), in multiple languages, and with the capacity of numerous testers. This amplifies testing capabilities and introduces a new level of flexibility and efficiency.

The AI Advantage: Multilingual and 24/7 Testing

Upskilling QA teams with AI brings the significant advantage of multilingual testing and 24/7 operation. In today’s global market, software products must often cater to diverse users, requiring testing in multiple languages. AI makes this possible without requiring testers to know each language, expanding the reach and usability of software products.

The ability to test around the clock ensures that testing doesn’t become a bottleneck in fast-paced development. AI’s autonomous maintenance capabilities further reduce the time and effort needed to update test cases, ensuring tests remain relevant and practical. A recent report by IDC predicts that by 2028, GenAI-based tools will be able to write 70% of software tests. This will decrease the need for manual testing and improve test coverage, software usability, and code quality.

Navigating the Challenges

The IDC predicts that by “2026, 45% of DevOps teams will use DevSecOps tools leveraging AI to identify security challenges in applications and supply chains.” So, while GenAI offers significant advantages, it also presents operational and security challenges that organizations must address:

Some of these include hallucinations — AI may generate inaccurate or fabricated outputs during testing, leading to incorrect results and potentially overlooking critical issues. Data Privacy — the risk of sensitive data used during testing being mishandled or leaked raises significant privacy concerns. Another challenge is the lack of transparency. The “black box” nature of many AI systems makes it challenging to trace decision-making processes, hindering debugging and trust in the system.

Finally, security vulnerabilities mean systems are susceptible to adversarial attacks that could exploit system weaknesses, potentially compromising the testing process. Another challenge worth mentioning is inconsistent outputs — AI might produce erratic or irrelevant results, affecting test reliability and making it challenging to maintain consistent testing standards.

Strategies for Risk Mitigation

To harness GenAI’s power while mitigating these risks, organizations can implement several strategies, such as Human-in-the-Loop (HITL) Supervision or human oversight. This ensures that AI-generated outputs are rigorously validated for accuracy and reliability. Human supervisors can review and approve AI-generated test cases, ensuring they meet necessary standards before implementation. Another way is restricting AI autonomy, which helps limit the AI’s creative freedom and prevents the system from making unwarranted assumptions or actions.

Setting clear boundaries and guidelines for the AI ensures it operates within acceptable parameters, maintaining a predictable and reliable testing process. A third option requires reasoning for actions to enforce a policy where AI must explain its decisions, promote transparency, and build trust in AI-generated results. By demanding reasoning for each action, developers can gain valuable insights into the AI’s thought process and make informed adjustments. Lastly, secure data management practices help implement robust policies that safeguard sensitive information from being misused during AI training. Encryption, anonymization, and access controls are critical to protect data privacy.

Use GenAI-Driven Testing Practices

Understanding its capabilities and limitations is paramount as GenAI becomes increasingly integrated into software development life cycles. By effectively managing these dynamics, development teams can leverage GenAI’s potential to enhance their testing practices while ensuring the integrity of their software products. With careful consideration of the outlined challenges and mitigation strategies, organizations can harness the full power of GenAI to drive innovation in software testing and deliver high-quality software products.


This article is part of The New Stack’s contributor network. Have insights on the latest challenges and innovations affecting developers? We’d love to hear from you. Become a contributor and share your expertise by filling out this form or emailing Matt Burns at mattburns@thenewstack.io.

TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.