VOOZH about

URL: https://willitrunai.com/models/llava-1.5-7b

⇱ LLaVA 1.5 7B VRAM Requirements — GPU Compatibility


👁 LLaVA
LLaVA

LLaVA 1.5 7B

Legacy
👁 huggingface
HuggingFace👁 ollama
Ollama
240.6KDownloads556LikesOct 2023Released4K tokensContextApache 2.0License44 BasicQuality

LLaVA 1.5 7B (7B parameters) requires approximately 13.9 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 16 GB of VRAM.

Get started

— copy & paste to run locally

Copy-paste commands to run LLaVA 1.5 7B on your machine.

Run

ollama run llava

Quick specs

Parameters7B
Architecturedense
Context4K tokens
Modalitytext+vision
Min RAM2.7 GB
Rec. RAM4.3 GB (Q4_K_M)
LicenseApache 2.0
FamilyLLaVA
✓ Vision✓ Chat

About this model

Model type: LLaVA is an open-source chatbot trained by fine-tuning LLaMA/Vicuna on GPT-generated multimodal instruction-following data. It is an auto-regressive language model, based on the transformer architecture.

  • 558K filtered image-text pairs from LAION/CC/SBU, captioned by BLIP
  • 158K GPT-generated multimodal instruction-following data
  • 450K academic-task-oriented VQA data mixture
  • 40K ShareGPT data

Related models

Your hardware

Detecting...

Quick picks

Best budgetB
RX 7600 XT 16GB~$329 — 39 tok/s
Best overallA
RX 7900 XT 20GB~$899 — 98 tok/s

Best hardware

Top picks for LLaVA 1.5 7B

RX 7900 XT 20GBA
20 GB
RTX A4500 20GBA
20 GB
RTX 3090 24GBA
24 GB
RTX 3090 Ti 24GBA
24 GB
RTX 4090 24GBA
24 GB

Run this model

LLaVA 1.5 7B on RX 7900 XT 20GBLLaVA 1.5 7B on RTX A4500 20GBLLaVA 1.5 7B on RTX 3090 24GB

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
2.7 GB
Low
Q3_K_S
3
3.4 GB
Low
NVFP4
4
3.9 GB
Medium
Q4_K_M
4
4.3 GB
Medium
Q5_K_M
5
5.0 GB
High
Q6_K
6
5.7 GB
High
Q8_0
8
7.5 GB
Very High
F16
16
14.3 GB
Maximum

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights4.3 GB
KV Cache7.8 GB
Runtime1.2 GB
Headroom0.6 GB

Frequently asked questions

FAQ — LLaVA 1.5 7B

See also

Quantization GuideScoring MethodologyVRAM Calculator