๐ Alibaba
Alibaba
Qwen 3.6 35B A3B
Frontier5.6MDownloads2.3KLikesApr 2026Released262K tokensContextApache 2.0License98 ExceptionalQuality
Qwen 3.6 35B A3B (35B parameters) requires approximately 28.5 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 3B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 33 GB of VRAM.
Get started
โ copy & paste to run locallyCopy-paste commands to run Qwen 3.6 35B A3B on your machine.
Run
docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \
--hf-repo "Qwen/Qwen3.6-35B-A3B" \
--hf-file "Qwen3.6-35B-A3B-Q4_K_M.gguf" \
-c 4096 -ngl 99Quick specs
Parameters35B (3B active)
Architecturemoe (MoE)
Context262K tokens
Modalitytext+vision
Min RAM13.7 GB
Rec. RAM21.3 GB (Q4_K_M)
LicenseApache 2.0
FamilyQwen
โ Visionโ Codeโ Chatโ Reasoning
About this model
- โข35B total params with only 3B active per token
- โข262K native context with preserve-thinking support
- โขMultimodal open-weights model tuned for coding and agent workflows
Related models
Your hardware
Detecting...
Quick picks
Best budgetS
Mac mini M4 64GB~$1,099 โ 11 tok/sBest overallS
NVIDIA A100 40GB~$10,000 โ 126 tok/sBest hardware
Top picks for Qwen 3.6 35B A3B
Run this model
Quantization options
VRAM estimates by quant level
No hardware detected โ fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 13.7 GB | Low | โ |
Q3_K_S | 3 | 17.2 GB | Low | โ |
NVFP4 | 4 | 19.6 GB | Medium | โ |
Q4_K_M | 4 | 21.3 GB | Medium | โ |
Q5_K_M | 5 | 25.2 GB | High | โ |
Q6_K | 6 | 28.7 GB | High | โ |
Q8_0 | 8 | 37.5 GB | Very High | โ |
F16 | 16 | 71.8 GB | Maximum | โ |
Quality benchmarks
Qwen 3.6 35B A3B benchmark scores
Coding
SWE-bench Verified73.4%
HumanEval+โ
Aider Polyglotโ
LiveCodeBench80.4%
Reasoning
MMLU-Pro85.2%
GPQA Diamond86.0%
MATH-500โ
ARC Challengeโ
Source: official ยท 2026-04-15
Hardware compatibility
Fit estimates across all hardware
Computing compatibility...
Memory breakdown
Reference: RTX 2060 6GB
Weights21.3 GB
KV Cache4.1 GB
Runtime2.4 GB
Headroom0.6 GB
Frequently asked questions
FAQ โ Qwen 3.6 35B A3B
See also
