VOOZH about

URL: https://willitrunai.com/models/kimi-linear-48b-a3b

โ‡ฑ Kimi Linear 48B A3B VRAM Requirements โ€” GPU Compatibility


๐Ÿ‘ Moonshot AI
Moonshot AI

Kimi Linear 48B A3B

Current
๐Ÿ‘ huggingface
HuggingFace
71.1KDownloads564LikesOct 2025Released1.0M tokensContextMITLicense76 StrongQuality

Kimi Linear 48B A3B (48B parameters) requires approximately 33.2 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 39 GB of VRAM.

Get started

โ€” copy & paste to run locally

Copy-paste commands to run Kimi Linear 48B A3B on your machine.

Run

docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \ --hf-repo "moonshotai/Kimi-Linear-48B-A3B-Instruct" \ --hf-file "Kimi-Linear-48B-A3B-Instruct-Q4_K_M.gguf" \ -c 4096 -ngl 99

Quick specs

Parameters48B
Architecturelinear
Context1.0M tokens
Modalitytext
Min RAM18.7 GB
Rec. RAM29.3 GB (Q4_K_M)
LicenseMIT
FamilyKimi Linear
โœ“ Codeโœ“ Chatโœ“ Reasoning

About this model

Kimi Linear is Moonshot AI's long-context efficient architecture release, using Kimi Delta Attention to cut KV-cache pressure and improve decoding throughput at very long sequence lengths.

  • โ€ข48B total params with 3B activated and 1M context
  • โ€ขKimi Delta Attention for lower KV-cache usage
  • โ€ขDesigned for efficient long-context inference and high decode throughput

Your hardware

Detecting...

Quick picks

Best budgetA
Mac mini M4 64GB~$1,099 โ€” 5 tok/s
๐Ÿ‘ NVIDIA
Best overallS
RTX PRO 5000 Blackwell 48GB~$4,999 โ€” 31 tok/s

Best hardware

Top picks for Kimi Linear 48B A3B

RTX PRO 5000 Blackwell 48GBS
48 GB
RTX 6000 Ada 48GBA
48 GB
NVIDIA H100 80GBA
80 GB
AMD Instinct MI210 64GBA
64 GB
NVIDIA H800 80GBA
80 GB

Run this model

Kimi Linear 48B A3B on RTX PRO 5000 Blackwell 48GBKimi Linear 48B A3B on RTX 6000 Ada 48GBKimi Linear 48B A3B on NVIDIA H100 80GB

Quantization options

VRAM estimates by quant level

No hardware detected โ€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
18.7 GB
Lowโ€”
Q3_K_S
3
23.5 GB
Lowโ€”
NVFP4
4
26.9 GB
Mediumโ€”
Q4_K_M
4
29.3 GB
Mediumโ€”
Q5_K_M
5
34.6 GB
Highโ€”
Q6_K
6
39.4 GB
Highโ€”
Q8_0
8
51.4 GB
Very Highโ€”
F16
16
98.4 GB
Maximumโ€”

Quality benchmarks

Kimi Linear 48B A3B benchmark scores

Benchmark verified

Reasoning

MMLU-Pro51.0%
GPQA Diamondโ€”
MATH-500โ€”
ARC Challengeโ€”

Source: official ยท 2025-10-30

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights29.3 GB
KV Cache0.9 GB
Runtime2.4 GB
Headroom0.6 GB

Frequently asked questions

FAQ โ€” Kimi Linear 48B A3B

See also

Quantization GuideScoring MethodologyVRAM Calculator