VOOZH about

URL: https://willitrunai.com/models/mpt-7b-instruct

โ‡ฑ MPT-7B-Instruct VRAM Requirements โ€” GPU Compatibility


๐Ÿ‘ MosaicML
MosaicML

MPT-7B-Instruct

Legacy
๐Ÿ‘ huggingface
HuggingFace
May 2023Released8K tokensContextApache 2.0License40 BasicQuality

MPT-7B-Instruct (7B parameters) requires approximately 13.9 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 16 GB of VRAM.

Get started

โ€” copy & paste to run locally

Copy-paste commands to run MPT-7B-Instruct on your machine.

Run

lms load mpt-7b-instruct && lms server start

Quick specs

Parameters7B
Architecturedense
Context8K tokens
Modalitytext
Min RAM2.7 GB
Rec. RAM4.3 GB (Q4_K_M)
LicenseApache 2.0
FamilyMPT
โœ“ Chatโœ“ Reasoning

About this model

MPT-7B Instruct is MosaicML's instruction-tuned model with a commercially permissive license. Supports 65K context with ALiBi positional encoding for efficient long-document processing.

Related models

Your hardware

Detecting...

Quick picks

Best budgetB
RX 7600 XT 16GB~$329 โ€” 39 tok/s
Best overallA
RX 7900 XT 20GB~$899 โ€” 98 tok/s

Best hardware

Top picks for MPT-7B-Instruct

RX 7900 XT 20GBA
20 GB
RTX A4500 20GBA
20 GB
RTX 3090 24GBA
24 GB
RTX 3090 Ti 24GBA
24 GB
RTX 4090 24GBA
24 GB

Run this model

MPT-7B-Instruct on RX 7900 XT 20GBMPT-7B-Instruct on RTX A4500 20GBMPT-7B-Instruct on RTX 3090 24GB

Quantization options

VRAM estimates by quant level

No hardware detected โ€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
2.7 GB
Lowโ€”
Q3_K_S
3
3.4 GB
Lowโ€”
NVFP4
4
3.9 GB
Mediumโ€”
Q4_K_M
4
4.3 GB
Mediumโ€”
Q5_K_M
5
5.0 GB
Highโ€”
Q6_K
6
5.7 GB
Highโ€”
Q8_0
8
7.5 GB
Very Highโ€”
F16
16
14.3 GB
Maximumโ€”

Quality benchmarks

MPT-7B-Instruct benchmark scores

Benchmark verified

Reasoning

MMLU-Proโ€”
GPQA Diamondโ€”
MATH-500โ€”
ARC Challenge46.5%

Source: community ยท 2023-05-05

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights4.3 GB
KV Cache7.8 GB
Runtime1.2 GB
Headroom0.6 GB

Frequently asked questions

FAQ โ€” MPT-7B-Instruct

See also

Quantization GuideScoring MethodologyVRAM Calculator