Llama 3.2 11B Vision

Legacy

47.0KDownloads1.6KLikesSep 2024Released16K tokensContextCommunityLicense36 BasicQuality

Llama 3.2 11B Vision (11B parameters) requires approximately 10.5 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 13 GB of VRAM.

Get started

— copy & paste to run locally

Copy-paste commands to run Llama 3.2 11B Vision on your machine.

Run

ollama run llama3.2-vision:11b

Quick specs

Parameters11B

Architecturevision

Context16K tokens

Modalitytext+vision

Min RAM4.3 GB

Rec. RAM6.7 GB (Q4_K_M)

LicenseCommunity

FamilyLlama Vision

✓ Vision✓ Chat

About this model

Llama 3.2 11B Vision is Meta's multimodal model that processes both text and images. Supports visual question answering, image captioning, and document understanding alongside standard text generation.

Your hardware

Detecting...

Quick picks

👁 Intel

Best budgetB

Intel Arc B580 12GB~$249 — 35 tok/s

👁 NVIDIA

Best overallA

RTX 4080 Super 16GB~$999 — 98 tok/s

Best hardware

Top picks for Llama 3.2 11B Vision

👁 NVIDIA

RTX 4080 Super 16GBA

16 GB

👁 NVIDIA

RTX 5070 Ti 16GBA

16 GB

👁 NVIDIA

RTX 5080 16GBA

16 GB

👁 NVIDIA

RTX 5080 Laptop 16GBA

16 GB

👁 NVIDIA

RTX 4070 Ti Super 16GBA

16 GB

Run this model

Llama 3.2 11B Vision on RTX 4080 Super 16GB Llama 3.2 11B Vision on RTX 5070 Ti 16GB Llama 3.2 11B Vision on RTX 5080 16GB

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	4.3 GB	Low	—
Q3_K_S	3	5.4 GB	Low	—
NVFP4	4	6.2 GB	Medium	—
Q4_K_M	4	6.7 GB	Medium	—
Q5_K_M	5	7.9 GB	High	—
Q6_K	6	9.0 GB	High	—
Q8_0	8	11.8 GB	Very High	—
F16	16	22.5 GB	Maximum	—

Quality benchmarks