VOOZH about

URL: https://willitrunai.com/models/qwen-2.5-coder-7b

โ‡ฑ Qwen 2.5 Coder 7B VRAM Requirements โ€” GPU Compatibility


๐Ÿ‘ Alibaba
Alibaba

Qwen 2.5 Coder 7B

Current
๐Ÿ‘ huggingface
HuggingFace๐Ÿ‘ ollama
Ollama
2.0MDownloads743LikesSep 2024Released131K tokensContextApache 2.0License48 BasicQuality

Qwen 2.5 Coder 7B (7B parameters) requires approximately 6.6 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 8 GB of VRAM.

Get started

โ€” copy & paste to run locally

Copy-paste commands to run Qwen 2.5 Coder 7B on your machine.

Run

ollama run qwen2.5-coder:7b

Quick specs

Parameters7B
Architecturedense
Context131K tokens
Modalitytext
Min RAM2.7 GB
Rec. RAM4.3 GB (Q4_K_M)
LicenseApache 2.0
FamilyQwen
โœ“ Code

About this model

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). As of now, Qwen2.5-Coder has covered six mainstream model sizes, 0.5, 1.5, 3, 7, 14, 32 billion parameters, to meet the needs of different developers. Qwen2.5-Coder brings the following improvements upon CodeQwen1.5:

  • โ€ขSignificantly improvements in code generation, code reasoning and code fixing. Base on the strong Qwen2.5, we scale up the training...
  • โ€ขA more comprehensive foundation for real-world applications such as Code Agents. Not only enhancing coding capabilities but also maintaining...
  • โ€ขLong-context Support: up to 128K tokens

Related models

Your hardware

Detecting...

Quick picks

๐Ÿ‘ Intel
Best budgetA
Intel Arc A580 8GB~$179 โ€” 64 tok/s
๐Ÿ‘ NVIDIA
Best overallA
RTX 3080 10GB~$699 โ€” 98 tok/s

Best hardware

Top picks for Qwen 2.5 Coder 7B

RTX 3080 10GBA
10 GB
RTX 2080 Ti 11GBA
11 GB
GTX 1080 Ti 11GBA
11 GB
RTX 3080 Ti 12GBA
12 GB
RTX 4070 12GBA
12 GB

Run this model

Qwen 2.5 Coder 7B on RTX 3080 10GBQwen 2.5 Coder 7B on RTX 2080 Ti 11GBQwen 2.5 Coder 7B on GTX 1080 Ti 11GB

Quantization options

VRAM estimates by quant level

No hardware detected โ€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
2.7 GB
Lowโ€”
Q3_K_S
3
3.4 GB
Lowโ€”
NVFP4
4
3.9 GB
Mediumโ€”
Q4_K_M
4
4.3 GB
Mediumโ€”
Q5_K_M
5
5.0 GB
Highโ€”
Q6_K
6
5.7 GB
Highโ€”
Q8_0
8
7.5 GB
Very Highโ€”
F16
16
14.3 GB
Maximumโ€”

Quality benchmarks

Qwen 2.5 Coder 7B benchmark scores

Benchmark verified

Coding

SWE-bench Verified22.0%
HumanEval+84.1%
Aider Polyglotโ€”
LiveCodeBench37.6%

Reasoning

MMLU-Pro45.6%
GPQA Diamond35.6%
MATH-50066.8%
ARC Challengeโ€”

General

Chatbot Arenaโ€”
IFEval58.6%

Source: official ยท 2024-11-12

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights4.3 GB
KV Cache0.9 GB
Runtime0.9 GB
Headroom0.6 GB

Frequently asked questions

FAQ โ€” Qwen 2.5 Coder 7B

See also

Quantization GuideScoring MethodologyVRAM Calculator