👁 Sentence Transformers
Sentence Transformers
All MiniLM L6 v2
Current207.7MDownloads4.6KLikesAug 2021Released0K tokensContextApache 2.0License64 GoodQuality
All MiniLM L6 v2 (0.023000000044703484B parameters) requires approximately 2.1 GB of VRAM with F16 quantization. For the best balance of quality and speed, we recommend hardware with at least 3 GB of VRAM.
Get started
— copy & paste to run locallyCopy-paste commands to run All MiniLM L6 v2 on your machine.
Run
ollama run all-minilmQuick specs
Parameters0.02B
Architecturedense
Context0K tokens
Modalityembedding
Min RAM0 GB
Rec. RAM0 GB (F16)
LicenseApache 2.0
FamilyMiniLM
✓ RAG
About this model
Your hardware
Detecting...
Quick picks
Best hardware
Top picks for All MiniLM L6 v2
Run this model
Quantization options
VRAM estimates by quant level
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 0.0 GB | Low | — |
Q3_K_S | 3 | 0.0 GB | Low | — |
NVFP4 | 4 | 0.0 GB | Medium | — |
Q4_K_M | 4 | 0.0 GB | Medium | — |
Q5_K_M | 5 | 0.0 GB | High | — |
Q6_K | 6 | 0.0 GB | High | — |
Q8_0 | 8 | 0.0 GB | Very High | — |
F16 | 16 | 0.0 GB | Maximum | — |
Hardware compatibility
Fit estimates across all hardware
Computing compatibility...
Memory breakdown
Reference: RTX 2060 6GB
Weights0.0 GB
KV Cache0.3 GB
Runtime1.2 GB
Headroom0.6 GB
Frequently asked questions
FAQ — All MiniLM L6 v2
See also
