Qwen 2.5 Math 72B

Frontier

1.1KDownloads31LikesSep 2024Released4K tokensContextApache 2.0License34 BasicQuality

Qwen 2.5 Math 72B (72B parameters) requires approximately 50.3 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 58 GB of VRAM.

Get started

— copy & paste to run locally

Copy-paste commands to run Qwen 2.5 Math 72B on your machine.

Run

docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \
 --hf-repo "Qwen/Qwen2.5-Math-72B-Instruct" \
 --hf-file "Qwen2.5-Math-72B-Instruct-Q4_K_M.gguf" \
 -c 4096 -ngl 99

Quick specs

Parameters72B

Architecturedense

Context4K tokens

Modalitytext

Min RAM28.1 GB

Rec. RAM43.9 GB (Q4_K_M)

LicenseApache 2.0

FamilyQwen

✓ Reasoning

About this model

> [!Warning] > > > 🚨 Qwen2.5-Math mainly supports solving English and Chinese math problems through CoT and TIR. We do not recommend using this series of models for other tasks. > >

Related models

Your hardware

Detecting...

Quick picks

Best budgetB

MacBook Pro M4 Max 96GB~$2,499 — 15 tok/s

👁 NVIDIA

Best overallB

NVIDIA H100 80GB~$40,000 — 70 tok/s

Best hardware

Top picks for Qwen 2.5 Math 72B

👁 NVIDIA

NVIDIA H100 80GBB

80 GB

👁 NVIDIA

NVIDIA H800 80GBB

80 GB

👁 NVIDIA

NVIDIA GH200 96GBB

96 GB

👁 NVIDIA

NVIDIA H20 96GBB

96 GB

👁 NVIDIA

NVIDIA A100 80GBB

80 GB

Run this model

Qwen 2.5 Math 72B on NVIDIA H100 80GB Qwen 2.5 Math 72B on NVIDIA H800 80GB Qwen 2.5 Math 72B on NVIDIA GH200 96GB

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	28.1 GB	Low	—
Q3_K_S	3	35.3 GB	Low	—
NVFP4	4	40.3 GB	Medium	—
Q4_K_M	4	43.9 GB	Medium	—
Q5_K_M	5	51.8 GB	High	—
Q6_K	6	59.0 GB	High	—
Q8_0	8	77.0 GB	Very High	—
F16	16	147.6 GB	Maximum	—

Quality benchmarks

Qwen 2.5 Math 72B benchmark scores

Benchmark verified

Reasoning

MMLU-Pro42.4%

GPQA Diamond10.9%

MATH-50087.8%

ARC Challenge—

General

Chatbot Arena—

IFEval40.0%

Source: official · 2024-09-19

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights43.9 GB

KV Cache4.9 GB

Runtime0.9 GB

Headroom0.6 GB

Frequently asked questions

URL: https://willitrunai.com/models/qwen-2.5-math-72b

⇱ Qwen 2.5 Math 72B VRAM Requirements — GPU Compatibility

Qwen 2.5 Math 72B

Top picks for Qwen 2.5 Math 72B

VRAM estimates by quant level

Qwen 2.5 Math 72B benchmark scores

Fit estimates across all hardware

Reference: RTX 2060 6GB

FAQ — Qwen 2.5 Math 72B