VOOZH about

URL: https://willitrunai.com/models/mixtral-8x7b

โ‡ฑ Mixtral 8x7B VRAM Requirements โ€” GPU Compatibility


๐Ÿ‘ Mistral
Mistral

Mixtral 8x7B

Current
๐Ÿ‘ huggingface
HuggingFace๐Ÿ‘ ollama
Ollama
524.9KDownloads4.7KLikesDec 2023Released33K tokensContextApache 2.0License38 BasicQuality

Mixtral 8x7B (47B parameters) requires approximately 32.4 GB of VRAM with Q4_K_M quantization. As a Mixture of Experts model with 13B active parameters, it uses less memory than its total parameter count suggests. For the best balance of quality and speed, we recommend hardware with at least 38 GB of VRAM.

Get started

โ€” copy & paste to run locally

Copy-paste commands to run Mixtral 8x7B on your machine.

Run

ollama run mixtral

Quick specs

Parameters47B (13B active)
Architecturemoe (MoE)
Context33K tokens
Modalitytext
Min RAM18.3 GB
Rec. RAM28.7 GB (Q4_K_M)
LicenseApache 2.0
FamilyMixtral
โœ“ Chatโœ“ Reasoning

About this model

from mistral_common.tokens.tokenizers.mistral import MistralTokenizer from mistral_common.protocol.instruct.messages import UserMessage from mistral_common.protocol.instruct.request import ChatCompletionRequest

Related models

Your hardware

Detecting...

Quick picks

Best budgetB
Mac mini M4 64GB~$1,099 โ€” 6 tok/s
๐Ÿ‘ NVIDIA
Best overallA
RTX PRO 5000 Blackwell 48GB~$4,999 โ€” 81 tok/s

Best hardware

Top picks for Mixtral 8x7B

RTX PRO 5000 Blackwell 48GBA
48 GB
RTX 6000 Ada 48GBA
48 GB
NVIDIA L40S 48GBA
48 GB
NVIDIA L40 48GBA
48 GB
NVIDIA L20 48GBA
48 GB

Run this model

Mixtral 8x7B on RTX PRO 5000 Blackwell 48GBMixtral 8x7B on RTX 6000 Ada 48GBMixtral 8x7B on NVIDIA L40S 48GB

Quantization options

VRAM estimates by quant level

No hardware detected โ€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
18.3 GB
Lowโ€”
Q3_K_S
3
23.0 GB
Lowโ€”
NVFP4
4
26.3 GB
Mediumโ€”
Q4_K_M
4
28.7 GB
Mediumโ€”
Q5_K_M
5
33.8 GB
Highโ€”
Q6_K
6
38.5 GB
Highโ€”
Q8_0
8
50.3 GB
Very Highโ€”
F16
16
96.4 GB
Maximumโ€”

Quality benchmarks

Mixtral 8x7B benchmark scores

Benchmark verified

Coding

SWE-bench Verifiedโ€”
HumanEval+64.0%
Aider Polyglotโ€”
LiveCodeBenchโ€”

Reasoning

MMLU-Pro54.0%
GPQA Diamond7.0%
MATH-50047.0%
ARC Challengeโ€”

General

Chatbot Arenaโ€”
IFEval62.0%

Source: official ยท 2023-12-11

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights28.7 GB
KV Cache2.0 GB
Runtime1.2 GB
Headroom0.6 GB

Frequently asked questions

FAQ โ€” Mixtral 8x7B

See also

Quantization GuideScoring MethodologyVRAM Calculator