VOOZH about

URL: https://willitrunai.com/models/cogvlm2-19b

⇱ CogVLM2 19B VRAM Requirements β€” GPU Compatibility


πŸ‘ Tsinghua/Zhipu
Tsinghua/Zhipu

CogVLM2 19B

Current
πŸ‘ huggingface
HuggingFace
6.0KDownloads220LikesMay 2024Released8K tokensContextApache 2.0License78 StrongQuality

CogVLM2 19B (19B parameters) requires approximately 15.5 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 18 GB of VRAM.

Get started

β€” copy & paste to run locally

Copy-paste commands to run CogVLM2 19B on your machine.

Run

docker run --rm -it ghcr.io/ggerganov/llama.cpp:full \ --hf-repo "THUDM/cogvlm2-llama3-chat-19B" \ --hf-file "cogvlm2-llama3-chat-19B-Q4_K_M.gguf" \ -c 4096 -ngl 99

Quick specs

Parameters19B
Architecturedense
Context8K tokens
Modalitytext+vision
Min RAM7.4 GB
Rec. RAM11.6 GB (Q4_K_M)
LicenseApache 2.0
FamilyCogVLM
βœ“ Visionβœ“ Chat

About this model

πŸ‘‹ Wechat Β· πŸ’‘Online Demo Β· 🎈Github Page Β· πŸ“‘ Paper

  • β€’Significant improvements in many benchmarks such as TextVQA, DocVQA
  • β€’Support 8K content length
  • β€’Support image resolution up to **1344 * 1344**
  • β€’Provide an open source model version that supports both Chinese and English

Your hardware

Detecting...

Quick picks

πŸ‘ Intel
Best budgetS
Intel Arc Pro B60 24GB~$599 β€” 23 tok/s
πŸ‘ NVIDIA
Best overallS
NVIDIA A30 24GB~$5,500 β€” 68 tok/s

Best hardware

Top picks for CogVLM2 19B

RTX 5090 Laptop 24GBS
24 GB
NVIDIA A30 24GBS
24 GB
RX 7900 XTX 24GBS
24 GB
RTX 3090 Ti 24GBS
24 GB
RTX 4090 24GBS
24 GB

Run this model

CogVLM2 19B on RTX 5090 Laptop 24GBCogVLM2 19B on NVIDIA A30 24GBCogVLM2 19B on RX 7900 XTX 24GB

Quantization options

VRAM estimates by quant level

No hardware detected β€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
7.4 GB
Lowβ€”
Q3_K_S
3
9.3 GB
Lowβ€”
NVFP4
4
10.6 GB
Mediumβ€”
Q4_K_M
4
11.6 GB
Mediumβ€”
Q5_K_M
5
13.7 GB
Highβ€”
Q6_K
6
15.6 GB
Highβ€”
Q8_0
8
20.3 GB
Very Highβ€”
F16
16
38.9 GB
Maximumβ€”

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights11.6 GB
KV Cache2.4 GB
Runtime0.9 GB
Headroom0.6 GB

Frequently asked questions

FAQ β€” CogVLM2 19B

See also

Quantization GuideScoring MethodologyVRAM Calculator