Llama 3.2 1B

Legacy

8.4MDownloads1.5KLikesSep 2024Released128K tokensContextCommunityLicense14 EntryQuality

Llama 3.2 1B (1B parameters) requires approximately 2.9 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 4 GB of VRAM.

Get started

— copy & paste to run locally

Copy-paste commands to run Llama 3.2 1B on your machine.

Run

ollama run llama3.2:1b

Quick specs

Parameters1B

Architecturedense

Context128K tokens

Modalitytext

Min RAM0.4 GB

Rec. RAM0.6 GB (Q4_K_M)

LicenseCommunity

FamilyLlama

✓ Chat

About this model

Llama 3.2 1B is Meta's smallest text model designed for on-device inference. Optimized for multilingual text generation, summarization, and instruction following on resource-constrained hardware.

Related models

Your hardware

Detecting...

Quick picks

👁 Intel

Best budgetC

Intel Arc A380 6GB~$139 — 14 tok/s

👁 NVIDIA

Best overallC

GTX 1650 4GB~$149 — 14 tok/s

Best hardware

Top picks for Llama 3.2 1B

👁 NVIDIA

GTX 1650 4GBC

4 GB

👁 NVIDIA

RTX 3050 Ti Laptop 4GBC

4 GB

👁 Intel

Intel Arc A370M 4GBC

4 GB

👁 NVIDIA

RTX 2060 6GBC

6 GB

👁 NVIDIA

RTX 4050 Laptop 6GBC

6 GB

Run this model

Llama 3.2 1B on GTX 1650 4GB Llama 3.2 1B on RTX 3050 Ti Laptop 4GB Llama 3.2 1B on Intel Arc A370M 4GB

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	0.4 GB	Low	—
Q3_K_S	3	0.5 GB	Low	—
NVFP4	4	0.6 GB	Medium	—
Q4_K_M	4	0.6 GB	Medium	—
Q5_K_M	5	0.7 GB	High	—
Q6_K	6	0.8 GB	High	—
Q8_0	8	1.1 GB	Very High	—
F16	16	2.1 GB	Maximum	—

Quality benchmarks

Llama 3.2 1B benchmark scores

Benchmark verified

Reasoning

MMLU-Pro7.6%

GPQA Diamond27.2%

MATH-50030.6%

ARC Challenge59.4%

General

Chatbot Arena—

IFEval59.5%

Source: official · 2024-09-25

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights0.6 GB

KV Cache0.5 GB

Runtime1.2 GB

Headroom0.6 GB

Frequently asked questions

URL: https://willitrunai.com/models/llama-3.2-1b

⇱ Llama 3.2 1B VRAM Requirements — GPU Compatibility

Llama 3.2 1B

Top picks for Llama 3.2 1B

VRAM estimates by quant level

Llama 3.2 1B benchmark scores

Fit estimates across all hardware

Reference: RTX 2060 6GB

FAQ — Llama 3.2 1B