๐ Google
Google
Gemma 4 26B A4B
FrontierGemma 4 26B A4B (25.200000762939453B parameters) requires approximately 20.8 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 3.799999952316284B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 24 GB of VRAM.
Get started
โ copy & paste to run locallyCopy-paste commands to run Gemma 4 26B A4B on your machine.
Run
ollama run gemma4:26bQuick specs
About this model
- โขMoE: 128 experts, 8 active per token
- โข256K context window
- โข#3 open model on Arena
- โขApache 2.0 license
- โข89% AIME 2026
Related models
Your hardware
Detecting...
Quick picks
Best hardware
Top picks for Gemma 4 26B A4B
Run this model
Quantization options
VRAM estimates by quant level
No hardware detected โ fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 9.8 GB | Low | โ |
Q3_K_S | 3 | 12.3 GB | Low | โ |
NVFP4 | 4 | 14.1 GB | Medium | โ |
Q4_K_M | 4 | 15.4 GB | Medium | โ |
Q5_K_M | 5 | 18.1 GB | High | โ |
Q6_K | 6 | 20.7 GB | High | โ |
Q8_0 | 8 | 27.0 GB | Very High | โ |
F16 | 16 | 51.7 GB | Maximum | โ |
Quality benchmarks
Gemma 4 26B A4B benchmark scores
Coding
Reasoning
Source: official ยท 2026-04-02
Hardware compatibility
Fit estimates across all hardware
Computing compatibility...
Memory breakdown
Reference: RTX 2060 6GB
Frequently asked questions
FAQ โ Gemma 4 26B A4B
See also
