๐ Meta
Meta
Llama 4 Scout 17B 16E
FrontierLlama 4 Scout 17B 16E (109B parameters) requires approximately 71.2 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 17B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 82 GB of VRAM.
Get started
โ copy & paste to run locallyCopy-paste commands to run Llama 4 Scout 17B 16E on your machine.
Run
lms load Llama-4-Scout-17B-16E-Instruct && lms server startQuick specs
About this model
Related models
Your hardware
Detecting...
Quick picks
Best hardware
Top picks for Llama 4 Scout 17B 16E
Run this model
Quantization options
VRAM estimates by quant level
No hardware detected โ fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 42.5 GB | Low | โ |
Q3_K_S | 3 | 53.4 GB | Low | โ |
NVFP4 | 4 | 61.0 GB | Medium | โ |
Q4_K_M | 4 | 66.5 GB | Medium | โ |
Q5_K_M | 5 | 78.5 GB | High | โ |
Q6_K | 6 | 89.4 GB | High | โ |
Q8_0 | 8 | 116.6 GB | Very High | โ |
F16 | 16 | 223.5 GB | Maximum | โ |
Quality benchmarks
Llama 4 Scout 17B 16E benchmark scores
Coding
Reasoning
Source: official ยท 2025-04-05
Hardware compatibility
Fit estimates across all hardware
Computing compatibility...
Memory breakdown
Reference: RTX 2060 6GB
Frequently asked questions
FAQ โ Llama 4 Scout 17B 16E
See also
