VOOZH about

URL: https://willitrunai.com/models/phi-4-reasoning-plus-14b

โ‡ฑ Phi-4-reasoning-plus 14B VRAM Requirements โ€” GPU Compatibility


๐Ÿ‘ Microsoft
Microsoft

Phi-4-reasoning-plus 14B

Frontier
๐Ÿ‘ huggingface
HuggingFace๐Ÿ‘ ollama
Ollama
19.4KDownloads344LikesApr 2025Released33K tokensContextMITLicense92 ExceptionalQuality

Phi-4-reasoning-plus 14B (14.699999809265137B parameters) requires approximately 13.8 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 16 GB of VRAM.

Get started

โ€” copy & paste to run locally

Copy-paste commands to run Phi-4-reasoning-plus 14B on your machine.

Run

ollama run phi4-reasoning

Quick specs

Parameters14.7B
Architecturedense
Context33K tokens
Modalitytext
Min RAM5.7 GB
Rec. RAM9 GB (Q4_K_M)
LicenseMIT
FamilyPhi
โœ“ Chatโœ“ Reasoning

About this model

> [!IMPORTANT] > To fully take advantage of the model's capabilities, inference must use `temperature=0.8`, `top_k=50`, `top_p=0.95`, and `do_sample=True`. For more complex queries, set `max_new_tokens=32768` to allow for longer chain-of-thought (CoT).

  • โ€ขAIME 2025, 2024, 2023, and 2022:: Math olympiad questions
  • โ€ขGPQA-Diamond:: Complex, graduate-level science questions
  • โ€ขOmniMath:: Collection of over 4000 olympiad-level math problems with human annotation
  • โ€ขLiveCodeBench:: Code generation benchmark gathered from competitive coding contests
  • โ€ข3SAT (3-literal Satisfiability Problem) and TSP (Traveling Salesman Problem):: Algorithmic problem solving

Related models

Your hardware

Detecting...

Quick picks

Best budgetS
RX 7600 XT 16GB~$329 โ€” 20 tok/s
๐Ÿ‘ NVIDIA
Best overallS
RTX 4090 24GB~$1,599 โ€” 92 tok/s

Best hardware

Top picks for Phi-4-reasoning-plus 14B

RTX 4090 24GBS
24 GB
RTX 5090 Laptop 24GBS
24 GB
NVIDIA A30 24GBS
24 GB
RTX 3090 Ti 24GBS
24 GB
RTX A4500 20GBS
20 GB

Run this model

Phi-4-reasoning-plus 14B on RTX 4090 24GBPhi-4-reasoning-plus 14B on RTX 5090 Laptop 24GBPhi-4-reasoning-plus 14B on NVIDIA A30 24GB

Quantization options

VRAM estimates by quant level

No hardware detected โ€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
5.7 GB
Lowโ€”
Q3_K_S
3
7.2 GB
Lowโ€”
NVFP4
4
8.2 GB
Mediumโ€”
Q4_K_M
4
9.0 GB
Mediumโ€”
Q5_K_M
5
10.6 GB
Highโ€”
Q6_K
6
12.1 GB
Highโ€”
Q8_0
8
15.7 GB
Very Highโ€”
F16
16
30.1 GB
Maximumโ€”

Quality benchmarks

Phi-4-reasoning-plus 14B benchmark scores

Benchmark verified

Coding

SWE-bench Verifiedโ€”
HumanEval+92.3%
Aider Polyglotโ€”
LiveCodeBench53.1%

Reasoning

MMLU-Pro76.0%
GPQA Diamond68.9%
MATH-50095.4%
ARC Challengeโ€”

General

Chatbot Arenaโ€”
IFEval84.9%

Source: official ยท 2025-04-30

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights9.0 GB
KV Cache3.1 GB
Runtime1.2 GB
Headroom0.6 GB

Frequently asked questions

FAQ โ€” Phi-4-reasoning-plus 14B

See also

Quantization GuideScoring MethodologyVRAM Calculator