VOOZH about

URL: https://willitrunai.com/models/nemotron-nano-8b

โ‡ฑ Nemotron Nano 8B VRAM Requirements โ€” GPU Compatibility


๐Ÿ‘ NVIDIA
NVIDIA

Nemotron Nano 8B

๐Ÿ‘ huggingface
HuggingFace
20.1KDownloads221LikesMar 2025Released131K tokensContextNVIDIA Open ModelLicense82 StrongQuality

Nemotron Nano 8B (8B parameters) requires approximately 8.6 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 10 GB of VRAM.

Get started

โ€” copy & paste to run locally

Copy-paste commands to run Nemotron Nano 8B on your machine.

Run

lms load Llama-3.1-Nemotron-Nano-8B-v1 && lms server start

Quick specs

Parameters8B
Architecturedense
Context131K tokens
Modalitytext
Min RAM3.1 GB
Rec. RAM4.9 GB (Q4_K_M)
LicenseNVIDIA Open Model
FamilyNemotron
โœ“ Chatโœ“ Reasoning

About this model

Nemotron Nano 8B is NVIDIA's reasoning model derived from Llama 3.1 8B Instruct, post-trained for switchable reasoning with on/off modes. Achieves 95.4% on MATH-500 and 54.1% on GPQA Diamond with reasoning enabled. Fits on a single RTX GPU for local deployment.

  • โ€ขSwitchable reasoning: toggle detailed thinking on/off via system prompt
  • โ€ข95.4% on MATH-500 with reasoning on, 36.6% with reasoning off
  • โ€ขDerived from Llama 3.1 8B with multi-phase post-training
  • โ€ขFits on a single RTX GPU for local inference

Related models

Your hardware

Detecting...

Quick picks

๐Ÿ‘ Intel
Best budgetS
Intel Arc B570 10GB~$219 โ€” 45 tok/s
๐Ÿ‘ NVIDIA
Best overallS
RTX 3080 Ti 12GB~$1,199 โ€” 112 tok/s

Best hardware

Top picks for Nemotron Nano 8B

RTX 3080 Ti 12GBS
12 GB
RTX 5070 12GBS
12 GB
RTX 3080 12GBS
12 GB
RTX 2080 Ti 11GBS
11 GB
RTX 4070 Super 12GBS
12 GB

Run this model

Nemotron Nano 8B on RTX 3080 Ti 12GBNemotron Nano 8B on RTX 5070 12GBNemotron Nano 8B on RTX 3080 12GB

Quantization options

VRAM estimates by quant level

No hardware detected โ€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
3.1 GB
Lowโ€”
Q3_K_S
3
3.9 GB
Lowโ€”
NVFP4
4
4.5 GB
Mediumโ€”
Q4_K_M
4
4.9 GB
Mediumโ€”
Q5_K_M
5
5.8 GB
Highโ€”
Q6_K
6
6.6 GB
Highโ€”
Q8_0
8
8.6 GB
Very Highโ€”
F16
16
16.4 GB
Maximumโ€”

Quality benchmarks

Nemotron Nano 8B benchmark scores

Benchmark verified

Reasoning

MMLU-Proโ€”
GPQA Diamond54.1%
MATH-50095.4%
ARC Challengeโ€”

General

Chatbot Arenaโ€”
IFEval74.7%

Source: official ยท 2025-03-18

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights4.9 GB
KV Cache2.0 GB
Runtime1.2 GB
Headroom0.6 GB

Frequently asked questions

FAQ โ€” Nemotron Nano 8B

See also

Quantization GuideScoring MethodologyVRAM Calculator