VOOZH about

URL: https://willitrunai.com/models/gpt-oss-120b

โ‡ฑ GPT-OSS 120B VRAM Requirements โ€” GPU Compatibility


๐Ÿ‘ OpenAI
OpenAI

GPT-OSS 120B

Frontier
๐Ÿ‘ huggingface
HuggingFace๐Ÿ‘ ollama
Ollama
Jun 2025Released131K tokensContextMITLicense94 ExceptionalQuality

GPT-OSS 120B (117B parameters) requires approximately 77.8 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 90 GB of VRAM.

Get started

โ€” copy & paste to run locally

Copy-paste commands to run GPT-OSS 120B on your machine.

Run

ollama run gpt-oss:120b

Quick specs

Parameters117B
Architecturedense
Context131K tokens
Modalitytext
Min RAM45.6 GB
Rec. RAM71.4 GB (Q4_K_M)
LicenseMIT
FamilyGPT-OSS
โœ“ Codeโœ“ Chatโœ“ Reasoning

About this model

GPT-OSS 120B is OpenAI's large open-source model, offering strong reasoning and coding capabilities.

Related models

Your hardware

Detecting...

Quick picks

Best budgetS
Mac Studio M3 Ultra 256GB~$6,999 โ€” 9 tok/s
Best overallS
AMD Instinct MI300A 128GB~$12,000 โ€” 57 tok/s

Best hardware

Top picks for GPT-OSS 120B

AMD Instinct MI300A 128GBS
128 GB
NVIDIA H200 141GBS
141 GB
NVIDIA H200 PCIe 141GBS
141 GB
Gaudi 3 128GBS
128 GB
AMD Instinct MI250X 128GBS
128 GB

Run this model

GPT-OSS 120B on AMD Instinct MI300A 128GBGPT-OSS 120B on NVIDIA H200 141GBGPT-OSS 120B on NVIDIA H200 PCIe 141GB

Quantization options

VRAM estimates by quant level

No hardware detected โ€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
45.6 GB
Lowโ€”
Q3_K_S
3
57.3 GB
Lowโ€”
NVFP4
4
65.5 GB
Mediumโ€”
Q4_K_M
4
71.4 GB
Mediumโ€”
Q5_K_M
5
84.2 GB
Highโ€”
Q6_K
6
95.9 GB
Highโ€”
Q8_0
8
125.2 GB
Very Highโ€”
F16
16
239.8 GB
Maximumโ€”

Quality benchmarks

GPT-OSS 120B benchmark scores

Benchmark verified

Coding

SWE-bench Verified62.4%
HumanEval+โ€”
Aider Polyglotโ€”
LiveCodeBench81.9%

Reasoning

MMLU-Pro90.0%
GPQA Diamond80.1%
MATH-500โ€”
ARC Challengeโ€”

Source: official ยท 2025-08-15

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights71.4 GB
KV Cache4.9 GB
Runtime0.9 GB
Headroom0.6 GB

Frequently asked questions

FAQ โ€” GPT-OSS 120B

See also

Quantization GuideScoring MethodologyVRAM Calculator