I’ve been using Claude Pro long enough that I don’t even really give much thought to what I gained from the subscription. The five-hour reset on the free tier was getting old, so I upgraded, and that was kind of the end of the internal debate. What I didn’t have a clear answer for at first was what exactly I was getting for it beyond longer sessions - not in the feature-list sense, I know what the features are. More like which of those features I’d actually miss if they went away, and how they made my workflow possible.

I’ve been running Qwen 3.5 9B locally for a minute now and it’s not a side experiment anymore - it’s a daily tool at this point. So I swapped it in for a week and used it the same way I’d normally use Claude, so same workflow, just a different model. Most of the overlap was closer than I expected - right up until one thing completely wasn’t.

What I actually use Claude for

Well, what I thought I used it for

I had a pretty confident answer to this going in: document drops, image analysis, long research sessions, Projects for keeping context loaded across chats, the interactive visuals that came out last month. I use all of it daily and I’d have told you I knew exactly which parts of that I’d miss if they went away.

That turned out to be wrong - not completely, but enough. When you actually swap the tool out and run your real workflow through something else, the thing you thought was load-bearing isn’t always the thing that actually is. Some of what I was reaching for Claude for was more habit than necessity. Some of it Qwen covered better than I expected, or at least well enough that the gap didn’t really matter in practice. The thing I’m actually paying for has a more specific shape than just "a Claude feature".

My local setup

The quick rundown

If you’ve read anything else I’ve written about local LLMs, then you know my setup already so I’ll keep this short. Qwen 3.5 9B, running through LM Studio on an RTX 3070 with 8GB VRAM, sitting at a 60k context window. The reason it can do that on modest hardware is GDN - a hybrid architecture that keeps memory usage mostly flat as context grows instead of climbing with every token the way a standard transformer would. That’s the whole reason I switched to it from gpt-oss 20B a while back.

The other thing worth mentioning is that this isn’t a fresh install. I’ve been running this model for a while now so I have a feel for where it hits walls and how to prompt it properly (though I’m still discovering better local AI habits every day). Point is that I didn’t walk into the comparison blind, and already had an idea of how to handle Qwen.

Where my local LLM held up, and where it didn’t

It went better than I expected, but also worse than I expected

Image analysis was the first real surprise. I use this in Claude constantly - instead of writing a long and tedious prompt, I just show Claude what I’m working with by adding some screenshots to the chat. Qwen handles it just as well - as in, really well. It can read software from interface screenshots even without context, describe scenes accurately, flag design inconsistencies, all of it.

Back-and-forth research and document handling are fine too. The biggest difference here is in how you formulate your prompts. With Claude, I can get messy and it will know what I mean. Local models don’t give you much room to get as messy because they interpret your prompts more literally, so you will need to adopt a prompting habit around clarity. But once you do, it can handle more than you realize. I’m talking day-to-day thinking-partner type of stuff like working through a concept you’re stuck on or having it summarize something long. It understood every angle I was throwing at it and gave me the explanations and comparisons I needed. I still like Claude’s more personable and conversational style, but that’s just preference and not a reflection of Qwen’s capability.

Of course, something where any local LLM will pull ahead is freedom. There’s no rolling window usages watching over your shoulder or additional charges to your card, or concern that your conversation might be retained on an unknown server for years. This makes you less guarded in your workflow.

Then there was the render loop when it came to visualizing and prototyping.

👁 XDA
Quiz
8 Questions · Test Your Knowledge

How much do you know about Claude?
Trivia challenge

Think you know Anthropic's AI assistant? Put your knowledge of Claude to the test.

OriginsCapabilitiesSafetyFeaturesDesign
01 / 8Origins

Which company created Claude?

Correct! Claude was created by Anthropic, an AI safety company founded in 2021. Anthropic was co-founded by Dario Amodei and Daniela Amodei, among others who previously worked at OpenAI.
Not quite. Claude is made by Anthropic, not to be confused with OpenAI, which makes ChatGPT. Anthropic was founded in 2021 with a strong focus on AI safety research.
02 / 8Safety

What is the name of the safety and values framework Anthropic developed to guide Claude's behavior?

Correct! Anthropic developed Constitutional AI (CAI), a technique that trains Claude using a set of principles — a 'constitution' — to guide its responses toward being helpful, harmless, and honest.
Not quite. The framework is called Constitutional AI (CAI). It is a novel training approach pioneered by Anthropic that uses a written set of principles to help the model self-critique and improve its own outputs.
03 / 8Origins

What is the name most commonly associated with inspiring Claude's name?

Correct! Claude Shannon is widely cited as the inspiration behind the name. Shannon founded information theory, which is foundational to all modern computing and digital communication — a fitting namesake for an AI.
Not quite. The name Claude is most commonly associated with Claude Shannon, the mathematician and electrical engineer who founded information theory. His pioneering work laid the groundwork for the digital age.
04 / 8Capabilities

Which of the following best describes Claude's context window capability in its more advanced versions?

Correct! Advanced versions of Claude support context windows of 100,000 tokens or more, allowing it to process entire books, lengthy codebases, or large documents in a single conversation — a standout feature at the time of its release.
Not quite. Claude's advanced versions support context windows of 100,000 tokens or more. This was a significant leap beyond many contemporaries and allows Claude to reason over very large amounts of text in one session.
05 / 8Design

Which of the following principles is NOT part of Anthropic's core goal for Claude?

Correct! Anthropic's guiding principles for Claude are to be Helpful, Harmless, and Honest — often called the 'three H's.' Hierarchical is not part of this framework. The goal is to make AI that is safe and beneficial for everyone.
Not quite. Anthropic's three guiding principles for Claude are Helpful, Harmless, and Honest. 'Hierarchical' is not one of them. These three H's shape how Claude is trained to interact with users responsibly.
06 / 8Features

What was a key distinguishing feature of Claude 2 when it launched compared to many rival models at the time?

Correct! Claude 2 launched with a 100,000-token context window, which was remarkable at the time. This allowed users to feed in entire books or massive codebases for analysis, setting Claude apart from many competing models.
Not quite. The standout feature of Claude 2 was its 100,000-token context window. Claude does not natively generate images, and real-time browsing and built-in voice were not launch features of Claude 2.
07 / 8Safety

Anthropic describes itself primarily as which type of company?

Correct! Anthropic describes itself as an AI safety and research company. Unlike some competitors who lead with products or platforms, Anthropic's founding mission centers on building AI systems that are safe, interpretable, and steerable.
Not quite. Anthropic is primarily an AI safety and research company. Its founding mission is rooted in making AI that is safe and understandable, which is why safety-focused training methods like Constitutional AI are central to its work.
08 / 8Features

Which of the following tasks is Claude specifically designed to handle well?

Correct! Claude excels at long-form writing, summarization, coding assistance, and complex reasoning tasks. Its large context window and nuanced language understanding make it particularly well suited for handling detailed, multi-step text-based work.
Not quite. Claude is designed for text-based tasks like writing, summarization, analysis, and reasoning. It does not render graphics, autonomously execute system commands, or perform live video analysis — it is a large language model at its core.
Challenge Complete

Your Score

/ 8

Thanks for playing!

Qwen can write the HTML and I can open it in my browser; that’s not the wall. The wall is that rendering a visual and interactive version of it locally means either wrestling with Open WebUI to render the code, which doesn’t connect to LM studio cleanly, or rebuilding my whole local setup around Ollama, which is the preferred backend for Open WebUI. Even if setup was smooth, Open WebUI’s artifacts depend on the model responding in exactly the right format to trigger the render panel, and a general-purpose local model like 3.5 isn’t tuned for that the way Claude is.

Claude just does it every time. I drop in a screenshot or a brief, it builds something interactive in the artifacts panel without me having to ask, and I can iterate cleanly from there. For someone primarily using Claude as a design learning tool, that reliability without setup is what I’m paying for.

The subscription makes more sense now

The context window is the other gap worth mentioning, though I saw it coming. I can’t push Qwen to even half of Claude’s context on my hardware. This just means my chats got cut short earlier than I’d hoped. Not a surprise, just a real limit that’s worth being upfront about.

The render workflow is the thing I couldn’t get around. Everything else was close enough to make it a bit harder justifying the subscription on those things alone. But that one design workflow, working reliably every time without setup, is what twenty dollars a month actually buys.