VOOZH about

URL: https://thenewstack.io/claude-small-business-test/

⇱ I buried 20 problems in a fake P&L to see if Claude for Small Business could find them - The New Stack


TNS
SUBSCRIBE
Join our community of software engineering leaders and aspirational developers. Always stay in-the-know by getting the most important news and exclusive content delivered fresh to your inbox to learn more about at-scale software development.
REQUIRED
It seems that you've previously unsubscribed from our newsletter in the past. Click the button below to open the re-subscribe form in a new tab. When you're done, simply close that tab and continue with this form to complete your subscription.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.
Welcome and thank you for joining The New Stack community!
Please answer a few simple questions to help us deliver the news and resources you are interested in.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Great to meet you!
Tell us a bit about your job so we can cover the topics you find most relevant.
REQUIRED
REQUIRED
REQUIRED
REQUIRED
REQUIRED
Welcome!

We’re so glad you’re here. You can expect all the best TNS content to arrive Monday through Friday to keep you on top of the news and at the top of your game.

What’s next?

Check your inbox for a confirmation email where you can adjust your preferences and even join additional groups.

Follow TNS on your favorite social media networks.

Become a TNS follower on LinkedIn.

Check out the latest featured and trending stories while you wait for your first TNS newsletter.

PREV
1 of 2
NEXT
VOXPOP
As a JavaScript developer, what non-React tools do you use most often?
Angular
0%
Astro
0%
Svelte
0%
Vue.js
0%
Other
0%
I only use React
0%
I don't use JavaScript
0%
Thanks for your opinion! Subscribe below to get the final results, published exclusively in our TNS Update newsletter:
NEW! Try Stackie AI
From clobbered drafts to real-time sync
Apr 14th 2026 10:00am, by David Moore
TypeScript 6.0 RC arrives as a bridge to a faster future
Mar 14th 2026 9:00am, by Darryl K. Taft
Mastra empowers web devs to build AI agents in TypeScript
Jan 28th 2026 11:00am, by Loraine Lawson
2026-05-22 09:00:00
I buried 20 problems in a fake P&L to see if Claude for Small Business could find them
AI Agents / AI Strategy / Operations

I buried 20 problems in a fake P&L to see if Claude for Small Business could find them

Testing Claude for Small Business: how Anthropic's new connectors for QuickBooks, Canva, and Gmail handled a financial audit with hidden problems.
May 22nd, 2026 9:00am by Jessica Wachtel
👁 Featued image for: I buried 20 problems in a fake P&L to see if Claude for Small Business could find them

Anthropic launched Claude for Small Business this month, adding native connectors to business tools like QuickBooks, HubSpot, Canva, Google Workspace, and about a dozen others directly inside Claude Cowork. That’s a lot of brands — but does it work?

I wanted to see if Claude could read a semi-complicated financial report, understand and summarize key findings, turn it into a presentable slide deck in Canva, and email a short summary with the deck to my fictional colleagues.

The test

I used Google Sheets, Canva, and my Gmail.

Before I could test Claude for Small Business, I built a fictional seven-month P&L for a small software consultancy. The P&L included nine tabs, twelve clients, twenty line items of expenses, and twenty deliberately planted problems ranging from obvious red flags to subtle but important nuances you’d need an expert to uncover. I built it using a different Claude account since I’m not a financial expert. 

I connected Google Drive to Claude Cowork and gave it the following prompt:

I am sharing a Google Sheet with you called Meridian Dev Studio P&L. It has nine tabs covering seven months of financial data for a small software consultancy. Please do the following:

  • Read all nine tabs carefully before drawing any conclusions
  • Write a plain English executive summary of the company’s financial health
  • Flag every anomaly, risk, or concern you find, no matter how small
  • Identify any clients or cost lines that need immediate attention
  • Note anything that appears inconsistent across tabs
  • Tell me what questions you would ask the CEO if you were their CFO walking into this for the first time

Do not just summarize the numbers. I want analysis, not a restatement of the data.

Here are examples of the “problems” across varying levels of complexity hidden in the financial data:

  • Easy
    • The company lost money every month for 7 months, with a cumulative net loss of $134,885.
    • Gross margin collapsed from 58% in November to 10.6% in March, coinciding exactly with the TerraLoop ramp.
  • Medium
    • January’s headline revenue of $112,080 includes a $24,000 one-time late payment recovery from Client B. Without the late payment, January is actually below October.
    • Recruiting spend ran for three months, then stopped completely, with no new payroll line appearing, so the outcome of the search is unaccounted for in the data.
  • Hard
    • Interest income is exactly $180 every single month for seven consecutive months. Real interest income fluctuates with cash balance, suggesting a manual journal entry input by rote rather than a real bank account.
    • A $4,400 bad debt write-off appears in March against a client that never appears anywhere in the revenue lines, a ghost receivable from an engagement not represented in this P&L.

How did Claude perform?

Claude caught 17 of the 20 problems hidden in the financial data in less than six minutes. It caught all of the easy and medium-level items. From the hard tier, it caught five of the eight. This includes findings that one Claude account expected the other Claude account to miss. Missed items included a ghost receivable behind a bad debt write-off, a churned client with no explanation anywhere in the notes, and a reimbursables discrepancy hidden across two separate tabs. 

Claude caught 17 of the 20 problems hidden in the financial data in less than six minutes.

Seventeen out of twenty isn’t bad, if I were grading the test like we were in school, that’s a solid 85%. But we aren’t in school. And small businesses need to know what their numbers say. The three items Claude missed were the most forensic items on the list. Perfectly flat interest income for seven consecutive months suggests a manual journal entry rather than a real bank account.

A drop in bank fees in April, against stable revenue, could signal growing accounts receivable. Suspiciously clean depreciation math on an equipment purchase doesn’t add up. These problems would prompt an expert to ask why they look so perfect, which is a different cognitive mode that Claude does not yet reliably operate in.

Claude also identified five irregularities I didn’t plant: It found a commission plan paying on bookings rather than gross profit, a designer cost jump with no explanation, conference spend hitting its peak in the same month gross margin collapsed, a client referral dependency previously unflagged, and a typo in the Google Drive file name (oops).

Claude and the creative assets

After completing its analysis, Claude built an 18-slide Canva deck and drafted an email summarizing the financial data, which it attached to the deck. The deck was fine. It looked like a standard design with stock images, which is what I’d expect. It wasn’t presentation-ready. What it lacked in perfection, it more than made up for with speed. Claude built that 18-slide deck in about three minutes. That leaves more than enough time for some light design/ copy polish.

I also want to highlight the personalization Claude did in the email. My Gmail account was set up with my name written as Jessica, but lately I’ve been using Jess in emails. Claude picked up on that and used Jess to sign off. 

The value

Claude for Small Business is a huge step forward for small business owners. Claude did in 20 minutes what I can only assume would take days to a week to complete. But — and this is a big but — it doesn’t replace the need for a human in the loop.

Claude did in 20 minutes what I can only assume would take days to a week to complete.

Claude missed three items hidden in the data. That means those three items didn’t make it into the deck or the summary either. People using these tools still need to either understand complex accounting and finance or hire someone who does. Don’t get me wrong, this is leaps and bounds ahead of anything I’ve ever seen before.  But if it were my company, I wouldn’t want anything missed. 

It’s a great tool, but it still requires human intervention.

TRENDING STORIES
Jessica Wachtel is a developer marketing writer at InfluxData where she creates content that helps make the world of time series data more understandable and accessible. Jessica has a background in software development and technical journalism.
Read more from Jessica Wachtel
SHARE THIS STORY
TRENDING STORIES
TNS owner Insight Partners is an investor in: Real, Canva, Anthropic.
SHARE THIS STORY
TRENDING STORIES
TNS DAILY NEWSLETTER Receive a free roundup of the most recent TNS articles in your inbox each day.
The New Stack does not sell your information or share it with unaffiliated third parties. By continuing, you agree to our Terms of Use and Privacy Policy.