VOOZH about

URL: https://willitrunai.com/models/gemma-4-31b

โ‡ฑ Gemma 4 31B VRAM Requirements โ€” GPU Compatibility


๐Ÿ‘ Google
Google

Gemma 4 31B

Frontier
๐Ÿ‘ huggingface
HuggingFace๐Ÿ‘ ollama
Ollama
Apr 2026Released256K tokensContextApache-2.0License86 StrongQuality

Gemma 4 31B (30.700000762939453B parameters) requires approximately 35.2 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 41 GB of VRAM.

Get started

โ€” copy & paste to run locally

Copy-paste commands to run Gemma 4 31B on your machine.

Run

ollama run gemma4:31b

Quick specs

Parameters30.7B
Architecturedense
Context256K tokens
Modalitytext
Min RAM12 GB
Rec. RAM18.7 GB (Q4_K_M)
LicenseApache-2.0
FamilyGemma
โœ“ Codeโœ“ Chatโœ“ Reasoning

About this model

Gemma 4 31B is the largest and most capable open Gemma model. Dense architecture with 30.7B parameters. 256K context window. Achieves 2150 Codeforces ELO and 89.2% AIME 2026. Apache 2.0 licensed.

  • โ€ขHighest quality open Gemma model
  • โ€ข256K context window
  • โ€ข2150 Codeforces ELO
  • โ€ขApache 2.0 license
  • โ€ข89.2% AIME 2026

Related models

Your hardware

Detecting...

Quick picks

Best budgetA
Mac mini M4 64GB~$1,099 โ€” 7 tok/s
Best overallS
AMD Instinct MI210 64GB~$10,000 โ€” 63 tok/s

Best hardware

Top picks for Gemma 4 31B

AMD Instinct MI210 64GBS
64 GB
RTX PRO 5000 Blackwell 48GBS
48 GB
NVIDIA A100 80GBS
80 GB
NVIDIA H100 80GBS
80 GB
NVIDIA H800 80GBS
80 GB

Run this model

Gemma 4 31B on AMD Instinct MI210 64GBGemma 4 31B on RTX PRO 5000 Blackwell 48GBGemma 4 31B on NVIDIA A100 80GB

Quantization options

VRAM estimates by quant level

No hardware detected โ€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
12.0 GB
Lowโ€”
Q3_K_S
3
15.0 GB
Lowโ€”
NVFP4
4
17.2 GB
Mediumโ€”
Q4_K_M
4
18.7 GB
Mediumโ€”
Q5_K_M
5
22.1 GB
Highโ€”
Q6_K
6
25.2 GB
Highโ€”
Q8_0
8
32.8 GB
Very Highโ€”
F16
16
62.9 GB
Maximumโ€”

Quality benchmarks

Gemma 4 31B benchmark scores

Benchmark verified

Coding

SWE-bench Verifiedโ€”
HumanEval+โ€”
Aider Polyglotโ€”
LiveCodeBench80.0%

Reasoning

MMLU-Pro85.2%
GPQA Diamond84.3%
MATH-500โ€”
ARC Challengeโ€”

Source: official ยท 2026-04-02

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights18.7 GB
KV Cache14.6 GB
Runtime1.2 GB
Headroom0.6 GB

Frequently asked questions

FAQ โ€” Gemma 4 31B

See also

Quantization GuideScoring MethodologyVRAM Calculator