Mradermacher
Dolphin Mistral GLM 4.7 Flash 24B Venice Edition Thinking Uncensored i1
Limited data available — some specs may be incomplete or estimated.
0K tokensContextUnknownLicense4 EntryQuality
Dolphin Mistral GLM 4.7 Flash 24B Venice Edition Thinking Uncensored i1 (24B parameters) requires approximately 19.3 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 23 GB of VRAM.
Quick specs
Parameters24B
Architecturedense
Context0K tokens
Modalitytext
Min RAM9.4 GB
Rec. RAM14.6 GB (Q4_K_M)
LicenseUnknown
FamilyMistral
✓ Reasoning
Related models
Your hardware
Detecting...
Quick picks
Best hardware
Top picks for Dolphin Mistral GLM 4.7 Flash 24B Venice Edition Thinking Uncensored i1
Run this model
Quantization options
VRAM estimates by quant level
No hardware detected — fit column shows raw VRAM estimates
| Quant | Bits | VRAM | Quality | Fit |
|---|---|---|---|---|
Q2_K | 2 | 9.4 GB | Low | — |
Q3_K_S | 3 | 11.8 GB | Low | — |
NVFP4 | 4 | 13.4 GB | Medium | — |
Q4_K_M | 4 | 14.6 GB | Medium | — |
Q5_K_M | 5 | 17.3 GB | High | — |
Q6_K | 6 | 19.7 GB | High | — |
Q8_0 | 8 | 25.7 GB | Very High | — |
F16 | 16 | 49.2 GB | Maximum | — |
Hardware compatibility
Fit estimates across all hardware
Computing compatibility...
Memory breakdown
Reference: RTX 2060 6GB
Weights14.6 GB
KV Cache2.8 GB
Runtime1.2 GB
Headroom0.6 GB
Frequently asked questions
FAQ — Dolphin Mistral GLM 4.7 Flash 24B Venice Edition Thinking Uncensored i1
See also
