VOOZH about

URL: https://willitrunai.com/models/qwen-2.5-math-72b

โ‡ฑ Qwen 2.5 Math 72B VRAM Requirements โ€” GPU Compatibility


๐Ÿ‘ Alibaba
Alibaba

Qwen 2.5 Math 72B

Frontier
๐Ÿ‘ huggingface
HuggingFace
1.1KDownloads31LikesSep 2024Released4K tokensContextApache 2.0License34 BasicQuality

Qwen 2.5 Math 72B (72B parameters) requires approximately 50.3 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 58 GB of VRAM.

Get started

โ€” copy & paste to run locally

Copy-paste commands to run Qwen 2.5 Math 72B on your machine.

Run

docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \ --hf-repo "Qwen/Qwen2.5-Math-72B-Instruct" \ --hf-file "Qwen2.5-Math-72B-Instruct-Q4_K_M.gguf" \ -c 4096 -ngl 99

Quick specs

Parameters72B
Architecturedense
Context4K tokens
Modalitytext
Min RAM28.1 GB
Rec. RAM43.9 GB (Q4_K_M)
LicenseApache 2.0
FamilyQwen
โœ“ Reasoning

About this model

> [!Warning] > > > ๐Ÿšจ Qwen2.5-Math mainly supports solving English and Chinese math problems through CoT and TIR. We do not recommend using this series of models for other tasks. > >

Related models

Your hardware

Detecting...

Quick picks

Best budgetB
MacBook Pro M4 Max 96GB~$2,499 โ€” 15 tok/s
๐Ÿ‘ NVIDIA
Best overallB
NVIDIA H100 80GB~$40,000 โ€” 70 tok/s

Best hardware

Top picks for Qwen 2.5 Math 72B

NVIDIA H100 80GBB
80 GB
NVIDIA H800 80GBB
80 GB
NVIDIA GH200 96GBB
96 GB
NVIDIA H20 96GBB
96 GB
NVIDIA A100 80GBB
80 GB

Run this model

Qwen 2.5 Math 72B on NVIDIA H100 80GBQwen 2.5 Math 72B on NVIDIA H800 80GBQwen 2.5 Math 72B on NVIDIA GH200 96GB

Quantization options

VRAM estimates by quant level

No hardware detected โ€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
28.1 GB
Lowโ€”
Q3_K_S
3
35.3 GB
Lowโ€”
NVFP4
4
40.3 GB
Mediumโ€”
Q4_K_M
4
43.9 GB
Mediumโ€”
Q5_K_M
5
51.8 GB
Highโ€”
Q6_K
6
59.0 GB
Highโ€”
Q8_0
8
77.0 GB
Very Highโ€”
F16
16
147.6 GB
Maximumโ€”

Quality benchmarks

Qwen 2.5 Math 72B benchmark scores

Benchmark verified

Reasoning

MMLU-Pro42.4%
GPQA Diamond10.9%
MATH-50087.8%
ARC Challengeโ€”

General

Chatbot Arenaโ€”
IFEval40.0%

Source: official ยท 2024-09-19

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights43.9 GB
KV Cache4.9 GB
Runtime0.9 GB
Headroom0.6 GB

Frequently asked questions

FAQ โ€” Qwen 2.5 Math 72B

See also

Quantization GuideScoring MethodologyVRAM Calculator