VOOZH about

URL: https://willitrunai.com/models/qwen-3-235b-a22b

โ‡ฑ Qwen 3 235B A22B VRAM Requirements โ€” GPU Compatibility


๐Ÿ‘ Alibaba
Alibaba

Qwen 3 235B A22B

Frontier
๐Ÿ‘ huggingface
HuggingFace
98.3KDownloads784LikesApr 2025Released131K tokensContextApache 2.0License89 StrongQuality

Qwen 3 235B A22B (235B parameters) requires approximately 147.7 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 22B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 170 GB of VRAM.

Get started

โ€” copy & paste to run locally

Copy-paste commands to run Qwen 3 235B A22B on your machine.

Run

lms load Qwen3-235B-A22B-Instruct-2507 && lms server start

Quick specs

Parameters235B (22B active)
Architecturemoe (MoE)
Context131K tokens
Modalitytext
Min RAM91.7 GB
Rec. RAM143.4 GB (Q4_K_M)
LicenseApache 2.0
FamilyQwen
โœ“ Chatโœ“ Reasoning

About this model

We introduce the updated version of the Qwen3-235B-A22B non-thinking mode, named Qwen3-235B-A22B-Instruct-2507, featuring the following key enhancements:

  • โ€ขSignificant improvements: in general capabilities, including **instruction following, logical reasoning, text comprehension, mathematics, science,...
  • โ€ขSubstantial gains: in long-tail knowledge coverage across multiple languages
  • โ€ขMarkedly better alignment: with user preferences in subjective and open-ended tasks, enabling more helpful responses and higher-quality text...
  • โ€ขEnhanced capabilities: in 256K long-context understanding

Related models

Your hardware

Detecting...

Quick picks

Best budgetS
Mac Studio M3 Ultra 256GB~$6,999 โ€” 11 tok/s
Best overallS
AMD Instinct MI325X 256GB~$20,000 โ€” 89 tok/s

Best hardware

Top picks for Qwen 3 235B A22B

AMD Instinct MI325X 256GBS
256 GB
AMD Instinct MI350X 288GBS
288 GB
NVIDIA B200 180GBS
180 GB
NVIDIA GB200 192GBS
192 GB
B100 192GBS
192 GB

Run this model

Qwen 3 235B A22B on AMD Instinct MI325X 256GBQwen 3 235B A22B on AMD Instinct MI350X 288GBQwen 3 235B A22B on NVIDIA B200 180GB

Quantization options

VRAM estimates by quant level

No hardware detected โ€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
91.7 GB
Lowโ€”
Q3_K_S
3
115.2 GB
Lowโ€”
NVFP4
4
131.6 GB
Mediumโ€”
Q4_K_M
4
143.4 GB
Mediumโ€”
Q5_K_M
5
169.2 GB
Highโ€”
Q6_K
6
192.7 GB
Highโ€”
Q8_0
8
251.5 GB
Very Highโ€”
F16
16
481.7 GB
Maximumโ€”

Quality benchmarks

Qwen 3 235B A22B benchmark scores

Benchmark verified

Coding

SWE-bench Verifiedโ€”
HumanEval+โ€”
Aider Polyglot57.3%
LiveCodeBench70.7%

Reasoning

MMLU-Pro83.0%
GPQA Diamond77.5%
MATH-50090.2%
ARC Challengeโ€”

General

Chatbot Arenaโ€”
IFEval88.7%

Source: official ยท 2025-07-14

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights143.4 GB
KV Cache2.9 GB
Runtime0.9 GB
Headroom0.6 GB

Frequently asked questions

FAQ โ€” Qwen 3 235B A22B

See also

Quantization GuideScoring MethodologyVRAM Calculator