VOOZH about

URL: https://willitrunai.com/browse/best-for/4gb

⇱ Best AI Models for 4GB VRAM — Local LLMs | WillItRunAI


1
BGE M30.5680000185966492B
F163.6 GB VRAM8.0 tok/sTight fit
AGreat
2
mxbai Embed Large0.33500000834465027B
F163.5 GB VRAM4.7 tok/sTight fit
AGreat
3
Snowflake Arctic Embed L0.33500000834465027B
F163.5 GB VRAM4.7 tok/sTight fit
AGreat
4
Nomic Embed Text v1.50.13699999451637268B
F162.1 GB VRAM2.0 tok/sRuns great
AGreat
5
BGE Large EN v1.50.33500000834465027B
F163.5 GB VRAM4.7 tok/sTight fit
AGreat
6
All MiniLM L6 v20.023000000044703484B
F161.6 GB VRAM2.0 tok/sRuns great
BGood
7
Q4_K_M3.9 GB VRAM42.0 tok/sNeeds offload
AGreat
8
Q4_K_M4.2 GB VRAM28.0 tok/sNeeds offload
BGood
9
Qwen 3 1.7B1.7000000476837158B
Q4_K_M4.0 GB VRAM23.8 tok/sNeeds offload
BGood
10
Q4_K_M2.6 GB VRAM21.0 tok/sRuns great
BGood
11
TinyLlama 1.1B1.100000023841858B
Q4_K_M2.3 GB VRAM15.4 tok/sRuns great
BGood
12
Q4_K_M2.6 GB VRAM21.0 tok/sRuns great
BGood
13
Q4_K_M1.8 GB VRAM7.0 tok/sRuns great
CUsable
14
Q4_K_M2.6 GB VRAM21.0 tok/sRuns great
BGood
15
Q4_K_M2.3 GB VRAM14.0 tok/sRuns great
BGood
16
Qwen 3 0.6B0.6000000238418579B
Q4_K_M2.5 GB VRAM8.4 tok/sRuns great
CUsable
17
Q4_K_M4.1 GB VRAM28.0 tok/sNeeds offload
CUsable
18
Q4_K_M2.4 GB VRAM14.0 tok/sRuns great
CUsable
19
Qwen 3.5 0.6B0.6000000238418579B
Q4_K_M2.5 GB VRAM8.4 tok/sRuns great
CUsable
20
Q4_K_M1.8 GB VRAM7.0 tok/sRuns great
CUsable
21
Q4_K_M2.8 GB VRAM28.0 tok/sRuns great
CUsable
22
Q6_K3.2 GB VRAM28.0 tok/sRuns great
CUsable
23
Q5_K_M3.8 GB VRAM42.0 tok/sNeeds offload
CUsable
24
Q4_K_M4.2 GB VRAM56.0 tok/sNeeds offload
CUsable
25
Q6_K2.2 GB VRAM14.0 tok/sRuns great
CUsable
26
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
27
Q4_K_M2.4 GB VRAM21.0 tok/sRuns great
CUsable
28
Q6_K1.8 GB VRAM7.0 tok/sRuns great
CUsable
29
TinyLlama 1.1B Chat v1.01.100000023841858B
Q4_K_M2.1 GB VRAM15.4 tok/sRuns great
CUsable
30
Q4_K_M2.8 GB VRAM28.0 tok/sRuns great
CUsable
31
Q4_K_M4.2 GB VRAM56.0 tok/sNeeds offload
CUsable
32
embeddinggemma 300M0.30000001192092896B
Q6_K1.6 GB VRAM4.2 tok/sRuns great
CUsable
33
Q4_K_M2.4 GB VRAM21.0 tok/sRuns great
CUsable
34
Q4_K_M4.2 GB VRAM56.0 tok/sNeeds offload
CUsable
35
Q4_K_M2.4 GB VRAM21.0 tok/sRuns great
CUsable
36
Q4_K_M2.0 GB VRAM14.0 tok/sRuns great
CUsable
37
Q4_K_M2.8 GB VRAM28.0 tok/sRuns great
CUsable
38
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
39
Q4_K_M2.0 GB VRAM14.0 tok/sRuns great
CUsable
40
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
41
TinyLlama 1.1B Chat v0.31.100000023841858B
Q4_K_M2.1 GB VRAM15.4 tok/sRuns great
CUsable
42
TinyLlama 1.1B Chat v0.61.100000023841858B
Q4_K_M2.1 GB VRAM15.4 tok/sRuns great
CUsable
43
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
44
Q4_K_M1.5 GB VRAM2.0 tok/sRuns great
DPoor
45
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
46
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
47
EXAONE 4.0 1.2B1.2000000476837158B
Q4_K_M2.2 GB VRAM16.8 tok/sRuns great
CUsable
48
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
49
EXAONE 3.5 2.4B Instruct2.4000000953674316B
Q4_K_M3.0 GB VRAM33.6 tok/sRuns great
CUsable
50
Q4_K_M2.4 GB VRAM21.0 tok/sRuns great
CUsable
51
Falcon H1 Tiny 90M Instruct0.09000000357627869B
Q4_K_M1.5 GB VRAM2.0 tok/sRuns great
DPoor
52
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
53
Q4_K_M2.0 GB VRAM14.0 tok/sRuns great
CUsable
54
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
55
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
56
stablelm 2 zephyr 1.6b1.600000023841858B
Q4_K_M2.5 GB VRAM22.4 tok/sRuns great
CUsable
57
Q4_K_M2.4 GB VRAM21.0 tok/sRuns great
CUsable
58
logos16v2 stablelm2 1.6b i11.600000023841858B
Q4_K_M2.5 GB VRAM22.4 tok/sRuns great
CUsable
59
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
60
Q4_K_M2.1 GB VRAM15.4 tok/sRuns great
CUsable
61
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
62
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
63
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
64
Q4_K_M3.5 GB VRAM42.0 tok/sTight fit
CUsable
65
Q4_K_M3.6 GB VRAM42.0 tok/sTight fit
CUsable