VOOZH about

URL: https://willitrunai.com/models/ministral-3-3b

⇱ Ministral 3 3B VRAM Requirements — GPU Compatibility


👁 Mistral
Mistral

Ministral 3 3B

Frontier
👁 huggingface
HuggingFace
1.2MDownloads252LikesOct 2025Released262K tokensContextApache 2.0License54 GoodQuality

Ministral 3 3B (3B parameters) requires approximately 5.6 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 7 GB of VRAM.

Get started

— copy & paste to run locally

Copy-paste commands to run Ministral 3 3B on your machine.

Run

docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \ --hf-repo "mistralai/Ministral-3-3B-Instruct-2512" \ --hf-file "Ministral-3-3B-Instruct-2512-Q4_K_M.gguf" \ -c 4096 -ngl 99

Quick specs

Parameters3B
Architecturemultimodal
Context262K tokens
Modalitytext+vision
Min RAM1.2 GB
Rec. RAM1.8 GB (Q4_K_M)
LicenseApache 2.0
FamilyMinistral
✓ Vision✓ Chat

About this model

The smallest model in the Ministral 3 family, Ministral 3 3B is a powerful, efficient tiny language model with vision capabilities.

  • 3.4B Language Model
  • 0.4B Vision Encoder
  • Vision: Enables the model to analyze images and provide insights based on visual content, in addition to text
  • Multilingual: Supports dozens of languages, including English, French, Spanish, German, Italian, Portuguese, Dutch, Chinese, Japanese, Korean, Arabic
  • System Prompt: Maintains strong adherence and support for system prompts

Related models

Your hardware

Detecting...

Quick picks

👁 Intel
Best budgetA
Intel Arc A380 6GB~$139 — 42 tok/s
👁 NVIDIA
Best overallA
RTX 3050 8GB~$249 — 42 tok/s

Best hardware

Top picks for Ministral 3 3B

RTX 3050 8GBA
8 GB
RTX 3060 Ti 8GBA
8 GB
RTX 3070 8GBA
8 GB
RTX 4060 8GBA
8 GB
RTX 4070 Laptop 8GBA
8 GB

Run this model

Ministral 3 3B on RTX 3050 8GBMinistral 3 3B on RTX 3060 Ti 8GBMinistral 3 3B on RTX 3070 8GB

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
1.2 GB
Low
Q3_K_S
3
1.5 GB
Low
NVFP4
4
1.7 GB
Medium
Q4_K_M
4
1.8 GB
Medium
Q5_K_M
5
2.2 GB
High
Q6_K
6
2.5 GB
High
Q8_0
8
3.2 GB
Very High
F16
16
6.1 GB
Maximum

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights1.8 GB
KV Cache0.7 GB
Runtime2.4 GB
Headroom0.6 GB

Frequently asked questions

FAQ — Ministral 3 3B

See also

Quantization GuideScoring MethodologyVRAM Calculator