VOOZH about

URL: https://willitrunai.com/models/all-minilm-l6-v2

⇱ All MiniLM L6 v2 VRAM Requirements — GPU Compatibility


👁 Sentence Transformers
Sentence Transformers

All MiniLM L6 v2

Current
👁 huggingface
HuggingFace👁 ollama
Ollama
207.7MDownloads4.6KLikesAug 2021Released0K tokensContextApache 2.0License64 GoodQuality

All MiniLM L6 v2 (0.023000000044703484B parameters) requires approximately 2.1 GB of VRAM with F16 quantization. For the best balance of quality and speed, we recommend hardware with at least 3 GB of VRAM.

Get started

— copy & paste to run locally

Copy-paste commands to run All MiniLM L6 v2 on your machine.

Run

ollama run all-minilm

Quick specs

Parameters0.02B
Architecturedense
Context0K tokens
Modalityembedding
Min RAM0 GB
Rec. RAM0 GB (F16)
LicenseApache 2.0
FamilyMiniLM
✓ RAG

About this model

This is a sentence-transformers model: It maps sentences & paragraphs to a 384 dimensional dense vector space and can be used for tasks like clustering or semantic search.

Your hardware

Detecting...

Quick picks

👁 Intel
Best budgetB
Intel Arc A380 6GB~$139 — 2 tok/s
👁 NVIDIA
Best overallB
GTX 1650 4GB~$149 — 2 tok/s

Best hardware

Top picks for All MiniLM L6 v2

Intel Arc A370M 4GBB
4 GB
GTX 1650 4GBB
4 GB
RTX 3050 Ti Laptop 4GBB
4 GB
RTX 2060 6GBB
6 GB
RTX 4050 Laptop 6GBB
6 GB

Run this model

All MiniLM L6 v2 on Intel Arc A370M 4GBAll MiniLM L6 v2 on GTX 1650 4GBAll MiniLM L6 v2 on RTX 3050 Ti Laptop 4GB

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
0.0 GB
Low
Q3_K_S
3
0.0 GB
Low
NVFP4
4
0.0 GB
Medium
Q4_K_M
4
0.0 GB
Medium
Q5_K_M
5
0.0 GB
High
Q6_K
6
0.0 GB
High
Q8_0
8
0.0 GB
Very High
F16
16
0.0 GB
Maximum

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights0.0 GB
KV Cache0.3 GB
Runtime1.2 GB
Headroom0.6 GB

Frequently asked questions

FAQ — All MiniLM L6 v2

See also

Quantization GuideScoring MethodologyVRAM Calculator