Voozh

BGE M30.5680000185966492B

F163.6 GB VRAM8.0 tok/sTight fit

AGreat

mxbai Embed Large0.33500000834465027B

F163.5 GB VRAM4.7 tok/sTight fit

AGreat

Snowflake Arctic Embed L0.33500000834465027B

F163.5 GB VRAM4.7 tok/sTight fit

AGreat

Nomic Embed Text v1.50.13699999451637268B

F162.1 GB VRAM2.0 tok/sRuns great

AGreat

BGE Large EN v1.50.33500000834465027B

F163.5 GB VRAM4.7 tok/sTight fit

AGreat

All MiniLM L6 v20.023000000044703484B

F161.6 GB VRAM2.0 tok/sRuns great

BGood

Ministral 3 3B3B

Q4_K_M3.9 GB VRAM42.0 tok/sNeeds offload

AGreat

Qwen 3.5 2B2B

Q4_K_M4.2 GB VRAM28.0 tok/sNeeds offload

BGood

Qwen 3 1.7B1.7000000476837158B

Q4_K_M4.0 GB VRAM23.8 tok/sNeeds offload

BGood

Qwen 2.5 Coder 1.5B1.5B

Q4_K_M2.6 GB VRAM21.0 tok/sRuns great

BGood

TinyLlama 1.1B1.100000023841858B

Q4_K_M2.3 GB VRAM15.4 tok/sRuns great

BGood

DeepSeek R1 1.5B1.5B

Q4_K_M2.6 GB VRAM21.0 tok/sRuns great

BGood

Qwen 2.5 Coder 0.5B0.5B

Q4_K_M1.8 GB VRAM7.0 tok/sRuns great

CUsable

Qwen 2.5 1.5B1.5B

Q4_K_M2.6 GB VRAM21.0 tok/sRuns great

BGood

Gemma 3 1B1B

Q4_K_M2.3 GB VRAM14.0 tok/sRuns great

BGood

Qwen 3 0.6B0.6000000238418579B

Q4_K_M2.5 GB VRAM8.4 tok/sRuns great

CUsable

Gemma 2 2B2B

Q4_K_M4.1 GB VRAM28.0 tok/sNeeds offload

CUsable

Llama 3.2 1B1B

Q4_K_M2.4 GB VRAM14.0 tok/sRuns great

CUsable

Qwen 3.5 0.6B0.6000000238418579B

Q4_K_M2.5 GB VRAM8.4 tok/sRuns great

CUsable

Qwen 2.5 0.5B0.5B

Q4_K_M1.8 GB VRAM7.0 tok/sRuns great

CUsable

gemma 2b2B

Q4_K_M2.8 GB VRAM28.0 tok/sRuns great

CUsable

gemma 2 2b it2B

Q6_K3.2 GB VRAM28.0 tok/sRuns great

CUsable

Llama 3.2 3B Instruct3B

Q5_K_M3.8 GB VRAM42.0 tok/sNeeds offload

CUsable

Qwen3.5 4B4B

Q4_K_M4.2 GB VRAM56.0 tok/sNeeds offload

CUsable

Llama 3.2 1B Instruct Q8 01B

Q6_K2.2 GB VRAM14.0 tok/sRuns great

CUsable

Qwen2.5 3B Instruct3B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

Qwen2.5 1.5B Instruct1.5B

Q4_K_M2.4 GB VRAM21.0 tok/sRuns great

CUsable

SmolVLM 500M Instruct0.5B

Q6_K1.8 GB VRAM7.0 tok/sRuns great

CUsable

TinyLlama 1.1B Chat v1.01.100000023841858B

Q4_K_M2.1 GB VRAM15.4 tok/sRuns great

CUsable

Gemmasutra Mini 2B v12B

Q4_K_M2.8 GB VRAM28.0 tok/sRuns great

CUsable

gemma 3 4b it4B

Q4_K_M4.2 GB VRAM56.0 tok/sNeeds offload

CUsable

embeddinggemma 300M0.30000001192092896B

Q6_K1.6 GB VRAM4.2 tok/sRuns great

CUsable

DeepSeek R1 Distill Qwen 1.5B1.5B

Q4_K_M2.4 GB VRAM21.0 tok/sRuns great

CUsable

gemma 3 4b it4B

Q4_K_M4.2 GB VRAM56.0 tok/sNeeds offload

CUsable

Yi Coder 1.5B Chat1.5B

Q4_K_M2.4 GB VRAM21.0 tok/sRuns great

CUsable

Llama 3.2 1B Instruct1B

Q4_K_M2.0 GB VRAM14.0 tok/sRuns great

CUsable

gemma 2 2b it2B

Q4_K_M2.8 GB VRAM28.0 tok/sRuns great

CUsable

Llama 3.2 3B Instruct3B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

gemma 3 1b it1B

Q4_K_M2.0 GB VRAM14.0 tok/sRuns great

CUsable

Ministral 3 3B Instruct 25123B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

TinyLlama 1.1B Chat v0.31.100000023841858B

Q4_K_M2.1 GB VRAM15.4 tok/sRuns great

CUsable

TinyLlama 1.1B Chat v0.61.100000023841858B

Q4_K_M2.1 GB VRAM15.4 tok/sRuns great

CUsable

HELVETE 3B3B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

granite embedding 107m multilingual0.10700000077486038B

Q4_K_M1.5 GB VRAM2.0 tok/sRuns great

DPoor

Hermes 3 Llama 3.2 3B3B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

stablelm zephyr 3b3B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

EXAONE 4.0 1.2B1.2000000476837158B

Q4_K_M2.2 GB VRAM16.8 tok/sRuns great

CUsable

StarCoder2 3B3B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

EXAONE 3.5 2.4B Instruct2.4000000953674316B

Q4_K_M3.0 GB VRAM33.6 tok/sRuns great

CUsable

Yi Coder 1.5B1.5B

Q4_K_M2.4 GB VRAM21.0 tok/sRuns great

CUsable

Falcon H1 Tiny 90M Instruct0.09000000357627869B

Q4_K_M1.5 GB VRAM2.0 tok/sRuns great

DPoor

AI21 Jamba Reasoning 3B3B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

Falcon3 1B Instruct abliterated1B

Q4_K_M2.0 GB VRAM14.0 tok/sRuns great

CUsable

stablelm 3b 4e1t3B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

ai21labs AI21 Jamba Reasoning 3B3B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

stablelm 2 zephyr 1.6b1.600000023841858B

Q4_K_M2.5 GB VRAM22.4 tok/sRuns great

CUsable

Falcon H1 1.5B Instruct1.5B

Q4_K_M2.4 GB VRAM21.0 tok/sRuns great

CUsable

logos16v2 stablelm2 1.6b i11.600000023841858B

Q4_K_M2.5 GB VRAM22.4 tok/sRuns great

CUsable

ai21labs AI21 Jamba2 3B3B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

TinyLlama 1.1B Chat v1.0 imatrix1.100000023841858B

Q4_K_M2.1 GB VRAM15.4 tok/sRuns great

CUsable

HelpingAI 3B hindi i13B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

AI21 Jamba2 3B3B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

HelpingAI 3B hindi3B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

AI21 Jamba2 3B i13B

Q4_K_M3.5 GB VRAM42.0 tok/sTight fit

CUsable

StarCoder2 3B3B

Q4_K_M3.6 GB VRAM42.0 tok/sTight fit

CUsable

URL: https://willitrunai.com/browse/best-for/4gb

⇱ Best AI Models for 4GB VRAM — Local LLMs | WillItRunAI