👁 Sentence Transformers
Sentence Transformers

All MiniLM L6 v2

Current

👁 huggingface
HuggingFace 👁 ollama
Ollama

207.7MDownloads4.6KLikesAug 2021Released0K tokensContextApache 2.0License64 GoodQuality

All MiniLM L6 v2 (0.023000000044703484B parameters) requires approximately 2.1 GB of VRAM with F16 quantization. For the best balance of quality and speed, we recommend hardware with at least 3 GB of VRAM.

Get started

— copy & paste to run locally

Copy-paste commands to run All MiniLM L6 v2 on your machine.

Run

ollama run all-minilm

Quick specs

Parameters0.02B

Architecturedense

Context0K tokens

Modalityembedding

Min RAM0 GB

Rec. RAM0 GB (F16)

LicenseApache 2.0

FamilyMiniLM

✓ RAG

About this model

This is a sentence-transformers model: It maps sentences & paragraphs to a 384 dimensional dense vector space and can be used for tasks like clustering or semantic search.

Your hardware

Detecting...

Quick picks

👁 Intel

Best budgetB

Intel Arc A380 6GB~$139 — 2 tok/s

👁 NVIDIA

Best overallB

GTX 1650 4GB~$149 — 2 tok/s

Best hardware

Top picks for All MiniLM L6 v2

👁 Intel

Intel Arc A370M 4GBB

4 GB

👁 NVIDIA

GTX 1650 4GBB

4 GB

👁 NVIDIA

RTX 3050 Ti Laptop 4GBB

4 GB

👁 NVIDIA

RTX 2060 6GBB

6 GB

👁 NVIDIA

RTX 4050 Laptop 6GBB

6 GB

Run this model

All MiniLM L6 v2 on Intel Arc A370M 4GB All MiniLM L6 v2 on GTX 1650 4GB All MiniLM L6 v2 on RTX 3050 Ti Laptop 4GB

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	Quality	Fit
Q2_K	2	Low	—
Q3_K_S	3	Low	—
NVFP4	4	Medium	—
Q4_K_M	4	Medium	—
Q5_K_M	5	High	—
Q6_K	6	High	—
Q8_0	8	Very High	—
F16	16	Maximum	—

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights0.0 GB

KV Cache0.3 GB

Runtime1.2 GB

Headroom0.6 GB

Frequently asked questions

URL: https://willitrunai.com/models/all-minilm-l6-v2

⇱ All MiniLM L6 v2 VRAM Requirements — GPU Compatibility

All MiniLM L6 v2

Top picks for All MiniLM L6 v2

VRAM estimates by quant level

Fit estimates across all hardware

Reference: RTX 2060 6GB

FAQ — All MiniLM L6 v2