DeepSeek Coder V2 16B

Current

1.1MDownloads615LikesJun 2024Released131K tokensContextDeepSeekLicense68 GoodQuality

DeepSeek Coder V2 16B (16B parameters) requires approximately 14.9 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 2.4000000953674316B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 18 GB of VRAM.

Get started

— copy & paste to run locally

Copy-paste commands to run DeepSeek Coder V2 16B on your machine.

Run

lms load DeepSeek-Coder-V2-Lite-Instruct && lms server start

Quick specs

Parameters16B (2.4B active)

Architecturemoe (MoE)

Context131K tokens

Modalitytext

Min RAM6.2 GB

Rec. RAM9.8 GB (Q4_K_M)

LicenseDeepSeek

FamilyDeepSeek

✓ Chat

About this model

We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. Specifically, DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens. Through this continued pre-training, DeepSeek-Coder-V2 substantially enhances the coding and mathematical reasoning capabilities of DeepSeek-V2, while maintaining comparable performance in general language tasks.

Related models

Your hardware

Detecting...

Quick picks

👁 Intel

Best budgetA

Intel Arc Pro B60 24GB~$599 — 60 tok/s

Best overallS

RX 7900 XT 20GB~$899 — 117 tok/s

Best hardware

Top picks for DeepSeek Coder V2 16B

RX 7900 XT 20GBS

20 GB

👁 NVIDIA

RTX A4500 20GBS

20 GB

👁 NVIDIA

RTX 3090 24GBS

24 GB

👁 NVIDIA

RTX 3090 Ti 24GBS

24 GB

👁 NVIDIA

RTX 4090 24GBS

24 GB

Run this model

DeepSeek Coder V2 16B on RX 7900 XT 20GB DeepSeek Coder V2 16B on RTX A4500 20GB DeepSeek Coder V2 16B on RTX 3090 24GB

Quantization options

VRAM estimates by quant level

No hardware detected — fit column shows raw VRAM estimates

Quant	Bits	VRAM	Quality	Fit
Q2_K	2	6.2 GB	Low	—
Q3_K_S	3	7.8 GB	Low	—
NVFP4	4	9.0 GB	Medium	—
Q4_K_M	4	9.8 GB	Medium	—
Q5_K_M	5	11.5 GB	High	—
Q6_K	6	13.1 GB	High	—
Q8_0	8	17.1 GB	Very High	—
F16	16	32.8 GB	Maximum	—

Quality benchmarks