VOOZH about

URL: https://willitrunai.com/models/internlm-20b

โ‡ฑ InternLM 20B VRAM Requirements โ€” GPU Compatibility


๐Ÿ‘ InternLM
InternLM

InternLM 20B

Legacy
๐Ÿ‘ huggingface
HuggingFace
7.0KDownloads94LikesJul 2024Released8K tokensContextInternLMLicense22 EntryQuality

InternLM 20B (20B parameters) requires approximately 34.2 GB of VRAM with Q5_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 40 GB of VRAM.

Get started

โ€” copy & paste to run locally

Copy-paste commands to run InternLM 20B on your machine.

Run

docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \ --hf-repo "internlm/internlm2_5-20b-chat" \ --hf-file "internlm2_5-20b-chat-Q5_K_M.gguf" \ -c 4096 -ngl 99

Quick specs

Parameters20B
Architecturedense
Context8K tokens
Modalitytext
Min RAM7.8 GB
Rec. RAM14.4 GB (Q5_K_M)
LicenseInternLM
FamilyInternLM
โœ“ Codeโœ“ Chat

About this model

InternLM2.5 has open-sourced a 20 billion parameter base model and a chat model tailored for practical scenarios. The model has the following characteristics:

  • โ€ขOutstanding reasoning capability: State-of-the-art performance on Math reasoning, surpassing models like Llama3 and Gemma2-27B
  • โ€ขStronger tool use: InternLM2.5 supports gathering information from more than 100 web pages, corresponding implementation has be released in...

Related models

Your hardware

Detecting...

Quick picks

Best budgetC
Mac mini M4 64GB~$1,099 โ€” 8 tok/s
๐Ÿ‘ NVIDIA
Best overallB
RTX PRO 5000 Blackwell 48GB~$4,999 โ€” 80 tok/s

Best hardware

Top picks for InternLM 20B

RTX PRO 5000 Blackwell 48GBB
48 GB
RTX 6000 Ada 48GBB
48 GB
AMD Instinct MI210 64GBB
64 GB
RTX A6000 48GBB
48 GB
NVIDIA L40S 48GBB
48 GB

Run this model

InternLM 20B on RTX PRO 5000 Blackwell 48GBInternLM 20B on RTX 6000 Ada 48GBInternLM 20B on AMD Instinct MI210 64GB

Quantization options

VRAM estimates by quant level

No hardware detected โ€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
7.8 GB
Lowโ€”
Q3_K_S
3
9.8 GB
Lowโ€”
NVFP4
4
11.2 GB
Mediumโ€”
Q4_K_M
4
12.2 GB
Mediumโ€”
Q5_K_M
5
14.4 GB
Highโ€”
Q6_K
6
16.4 GB
Highโ€”
Q8_0
8
21.4 GB
Very Highโ€”
F16
16
41.0 GB
Maximumโ€”

Quality benchmarks

InternLM 20B benchmark scores

Benchmark verified

Reasoning

MMLU-Pro33.3%
GPQA Diamond9.5%
MATH-50040.8%
ARC Challengeโ€”

General

Chatbot Arenaโ€”
IFEval70.1%

Source: community ยท 2025-01-01

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights14.4 GB
KV Cache18.3 GB
Runtime0.9 GB
Headroom0.6 GB

Frequently asked questions

FAQ โ€” InternLM 20B

See also

Quantization GuideScoring MethodologyVRAM Calculator