VOOZH about

URL: https://willitrunai.com/models/hf-unsloth--qwen3-5-35b-a3b-gguf

⇱ Qwen3.5 35B A3B VRAM Requirements — GPU Compatibility


👁 Unsloth
Unsloth

Qwen3.5 35B A3B

👁 huggingface
HuggingFace
Limited data available — some specs may be incomplete or estimated.
1.7MDownloads694Likes0K tokensContextUnknownLicense5 EntryQuality

Qwen3.5 35B A3B (35B parameters) requires approximately 27.3 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 32 GB of VRAM.

Get started

— copy & paste to run locally

Copy-paste commands to run Qwen3.5 35B A3B on your machine.

Run

docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \ --hf-repo "unsloth/Qwen3.5-35B-A3B-GGUF" \ --hf-file "Qwen3.5-35B-A3B-GGUF-Q4_K_M.gguf" \ -c 4096 -ngl 99

Quick specs

Parameters35B
Architecturedense
Context0K tokens
Modalitytext
Min RAM13.7 GB
Rec. RAM21.3 GB (Q4_K_M)
LicenseUnknown
FamilyQwen
✓ Vision✓ Chat

Related models

Your hardware

Detecting...

Quick picks

Best budgetC
Mac mini M4 64GB~$1,099 — 7 tok/s
👁 NVIDIA
Best overallB
NVIDIA A100 40GB~$10,000 — 61 tok/s

Best hardware

Top picks for Qwen3.5 35B A3B

NVIDIA A100 40GBB
40 GB
RTX PRO 5000 Blackwell 48GBC
48 GB
MacBook Pro M4 Max 64GBC
64 GB
RTX 6000 Ada 48GBC
48 GB
NVIDIA L40S 48GBC
48 GB

Run this model

Qwen3.5 35B A3B on NVIDIA A100 40GBQwen3.5 35B A3B on RTX PRO 5000 Blackwell 48GBQwen3.5 35B A3B on MacBook Pro M4 Max 64GB

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
13.7 GB
Low
Q3_K_S
3
17.2 GB
Low
NVFP4
4
19.6 GB
Medium
Q4_K_M
4
21.3 GB
Medium
Q5_K_M
5
25.2 GB
High
Q6_K
6
28.7 GB
High
Q8_0
8
37.5 GB
Very High
F16
16
71.8 GB
Maximum

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights21.3 GB
KV Cache4.1 GB
Runtime1.2 GB
Headroom0.6 GB

Frequently asked questions

FAQ — Qwen3.5 35B A3B

See also

Quantization GuideScoring MethodologyVRAM Calculator