VOOZH about

URL: https://willitrunai.com/models/stablelm-2-12b

โ‡ฑ StableLM 2 12B VRAM Requirements โ€” GPU Compatibility


๐Ÿ‘ Stability AI
Stability AI

StableLM 2 12B

Legacy
๐Ÿ‘ huggingface
HuggingFace
265Downloads89LikesApr 2024Released4K tokensContextStability AI CommunityLicense4 EntryQuality

StableLM 2 12B (12B parameters) requires approximately 22.3 GB of VRAM with Q5_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 26 GB of VRAM.

Get started

โ€” copy & paste to run locally

Copy-paste commands to run StableLM 2 12B on your machine.

Run

docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \ --hf-repo "stabilityai/stablelm-2-12b-chat" \ --hf-file "stablelm-2-12b-chat-Q5_K_M.gguf" \ -c 4096 -ngl 99

Quick specs

Parameters12B
Architecturedense
Context4K tokens
Modalitytext
Min RAM4.7 GB
Rec. RAM8.6 GB (Q5_K_M)
LicenseStability AI Community
FamilyStableLM
โœ“ Chat

About this model

`Stable LM 2 12B Chat` is a 12 billion parameter instruction tuned language model trained on a mix of publicly available datasets and synthetic datasets, utilizing Direct Preference Optimization (DPO).

  • โ€ขDeveloped by: Stability AI
  • โ€ขModel type: StableLM 2 12B Chat model is an auto-regressive language model based on the transformer decoder architecture
  • โ€ขLanguage(s): English
  • โ€ขPaper: Stable LM 2 Chat Technical Report
  • โ€ขLibrary: Alignment Handbook

Your hardware

Detecting...

Quick picks

Best budgetC
Mac mini M4 64GB~$1,099 โ€” 8 tok/s
๐Ÿ‘ NVIDIA
Best overallB
RTX 5090 32GB~$1,999 โ€” 103 tok/s

Best hardware

Top picks for StableLM 2 12B

RTX 5090 32GBB
32 GB
RTX PRO 4500 Blackwell 32GBB
32 GB
AMD Instinct MI100 32GBB
32 GB
NVIDIA A100 40GBB
40 GB
NVIDIA V100 32GBB
32 GB

Run this model

StableLM 2 12B on RTX 5090 32GBStableLM 2 12B on RTX PRO 4500 Blackwell 32GBStableLM 2 12B on AMD Instinct MI100 32GB

Quantization options

VRAM estimates by quant level

No hardware detected โ€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
4.7 GB
Lowโ€”
Q3_K_S
3
5.9 GB
Lowโ€”
NVFP4
4
6.7 GB
Mediumโ€”
Q4_K_M
4
7.3 GB
Mediumโ€”
Q5_K_M
5
8.6 GB
Highโ€”
Q6_K
6
9.8 GB
Highโ€”
Q8_0
8
12.8 GB
Very Highโ€”
F16
16
24.6 GB
Maximumโ€”

Quality benchmarks

StableLM 2 12B benchmark scores

Benchmark verified

Reasoning

MMLU-Pro19.3%
GPQA Diamond2.2%
MATH-5005.4%
ARC Challenge65.0%

General

Chatbot Arenaโ€”
IFEval40.8%

Source: official ยท 2024-02-01

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights8.6 GB
KV Cache12.2 GB
Runtime0.9 GB
Headroom0.6 GB

Frequently asked questions

FAQ โ€” StableLM 2 12B

See also

Quantization GuideScoring MethodologyVRAM Calculator