Nemotron 70B

Current

👁 huggingface
HuggingFace 👁 ollama
Ollama

103Downloads568LikesOct 2024Released131K tokensContextNVIDIA Open ModelLicense52 GoodQuality

Nemotron 70B (70B parameters) requires approximately 49.1 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 57 GB of VRAM.

Get started

— copy & paste to run locally

Copy-paste commands to run Nemotron 70B on your machine.

Run

ollama run nemotron

Quick specs

Parameters70B

Architecturedense

Context131K tokens

Modalitytext

Min RAM27.3 GB

Rec. RAM42.7 GB (Q4_K_M)

LicenseNVIDIA Open Model

FamilyNemotron

✓ Chat✓ Reasoning

About this model

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses to user queries.

•Please sign up to get free and immediate access to NVIDIA NeMo Framework container. If you don’t have an NVIDIA NGC account, you will be...
•If you don’t have an NVIDIA NGC API key, sign into NVIDIA NGC, selecting organization/team: ea-bignlp/ga-participants and click Generate API key....
•On your machine, docker login to nvcr.io using

Related models

Your hardware

Detecting...

Quick picks

Best budgetA

MacBook Pro M4 Max 96GB~$2,499 — 17 tok/s

👁 NVIDIA

Best overallA

NVIDIA H100 80GB~$40,000 — 72 tok/s

Best hardware

Top picks for Nemotron 70B

👁 NVIDIA

NVIDIA H100 80GBA

80 GB

👁 NVIDIA

NVIDIA H800 80GBA

80 GB

👁 NVIDIA

NVIDIA GH200 96GBA

96 GB

👁 NVIDIA

NVIDIA H20 96GBA

96 GB

👁 NVIDIA

NVIDIA A100 80GBA

80 GB

Run this model

Nemotron 70B on NVIDIA H100 80GB Nemotron 70B on NVIDIA H800 80GB Nemotron 70B on NVIDIA GH200 96GB

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	27.3 GB	Low	—
Q3_K_S	3	34.3 GB	Low	—
NVFP4	4	39.2 GB	Medium	—
Q4_K_M	4	42.7 GB	Medium	—
Q5_K_M	5	50.4 GB	High	—
Q6_K	6	57.4 GB	High	—
Q8_0	8	74.9 GB	Very High	—
F16	16	143.5 GB	Maximum	—

Quality benchmarks

Nemotron 70B benchmark scores

Benchmark verified

Reasoning

MMLU-Pro85.2%

GPQA Diamond1.1%

MATH-50042.7%

ARC Challenge—

General

Chatbot Arena—

IFEval73.8%

Source: community · 2024-10-16

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights42.7 GB

KV Cache4.9 GB

Runtime0.9 GB

Headroom0.6 GB

Frequently asked questions

URL: https://willitrunai.com/models/nemotron-70b

⇱ Nemotron 70B VRAM Requirements — GPU Compatibility

Nemotron 70B

Top picks for Nemotron 70B

VRAM estimates by quant level

Nemotron 70B benchmark scores

Fit estimates across all hardware

Reference: RTX 2060 6GB

FAQ — Nemotron 70B