VOOZH about

URL: https://willitrunai.com/models/phi-4-mini-4b

โ‡ฑ Phi 4 Mini 4B VRAM Requirements โ€” GPU Compatibility


๐Ÿ‘ Microsoft
Microsoft

Phi 4 Mini 4B

Frontier
๐Ÿ‘ huggingface
HuggingFace๐Ÿ‘ ollama
Ollama
707.1KDownloads783LikesFeb 2025Released128K tokensContextMITLicense50 BasicQuality

Phi 4 Mini 4B (4B parameters) requires approximately 5.7 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 7 GB of VRAM.

Get started

โ€” copy & paste to run locally

Copy-paste commands to run Phi 4 Mini 4B on your machine.

Run

ollama run phi4-mini

Quick specs

Parameters4B
Architecturedense
Context128K tokens
Modalitytext
Min RAM1.6 GB
Rec. RAM2.4 GB (Q4_K_M)
LicenseMIT
FamilyPhi
โœ“ Chatโœ“ Reasoning

About this model

Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.

  • โ€ขMemory/compute constrained environments
  • โ€ขLatency bound scenarios
  • โ€ขStrong reasoning (especially math and logic)

Related models

Your hardware

Detecting...

Quick picks

๐Ÿ‘ Intel
Best budgetA
Intel Arc A380 6GB~$139 โ€” 40 tok/s
๐Ÿ‘ NVIDIA
Best overallA
RTX 5060 8GB~$299 โ€” 76 tok/s

Best hardware

Top picks for Phi 4 Mini 4B

RTX 5060 8GBA
8 GB
RTX 5060 Ti 8GBA
8 GB
RTX 5050 8GBA
8 GB
RTX 4060 8GBA
8 GB
RTX 4070 Laptop 8GBA
8 GB

Run this model

Phi 4 Mini 4B on RTX 5060 8GBPhi 4 Mini 4B on RTX 5060 Ti 8GBPhi 4 Mini 4B on RTX 5050 8GB

Quantization options

VRAM estimates by quant level

No hardware detected โ€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
1.6 GB
Lowโ€”
Q3_K_S
3
2.0 GB
Lowโ€”
NVFP4
4
2.2 GB
Mediumโ€”
Q4_K_M
4
2.4 GB
Mediumโ€”
Q5_K_M
5
2.9 GB
Highโ€”
Q6_K
6
3.3 GB
Highโ€”
Q8_0
8
4.3 GB
Very Highโ€”
F16
16
8.2 GB
Maximumโ€”

Quality benchmarks

Phi 4 Mini 4B benchmark scores

Benchmark verified

Coding

SWE-bench Verifiedโ€”
HumanEval+68.3%
Aider Polyglotโ€”
LiveCodeBench30.5%

Reasoning

MMLU-Pro52.8%
GPQA Diamond25.2%
MATH-50064.0%
ARC Challenge83.7%

General

Chatbot Arenaโ€”
IFEval73.8%

Source: official ยท 2025-02-27

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights2.4 GB
KV Cache1.5 GB
Runtime1.2 GB
Headroom0.6 GB

Frequently asked questions

FAQ โ€” Phi 4 Mini 4B

See also

Quantization GuideScoring MethodologyVRAM Calculator