Qwen3-VL-8B-Heretic-Stable-GGUF
Qwen3-VL-8B-Heretic-Stable is a stability-focused abliterated evolution built on top of prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX, originally derived from Qwen/Qwen3-VL-8B-Instruct. This model applies advanced abliteration and refusal-suppression training strategies while emphasizing improved output consistency, multimodal reasoning stability, and reliable instruction adherence across complex visual and textual tasks.
[Base: Qwen/Qwen3-VL-8B-Instruct]
└───► [Intermediate: Qwen3-VL-8B-Instruct-Unredacted-MAX]
└───► [Current: prithivMLmods/Qwen3-VL-8B-Heretic-Stable]
├───► [Format: BF16 Base Weights]
└───► [Quant: prithivMLmods/Qwen3-VL-8B-Heretic-Stable-GGUF]
This model is materialized for research and learning purposes only. The model has reduced internal refusal behaviors, and any content generated by it is used at the user’s own risk. The authors and hosting page disclaim any liability for content generated by this model. Users are responsible for ensuring that the model is used in a safe, ethical, and lawful manner.
Evaluation [Self Reported]
| Metric | Result |
|---|---|
| Refusal Rate (harm_bench) | 0 / 250 |
| Test Setup | 250 random harmful prompts |
| Inference Pipeline | Transformers |
| Inference Type | text-generation |
| Dataset | harm_bench |
Model Files
| File Name | Quant Type | File Size | File Link |
|---|---|---|---|
| Qwen3-VL-8B-Heretic-Stable.BF16.gguf | BF16 | 16.4 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.F16.gguf | F16 | 16.4 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.Q2_K.gguf | Q2_K | 3.28 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.Q3_K_L.gguf | Q3_K_L | 4.43 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.Q3_K_M.gguf | Q3_K_M | 4.12 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.Q3_K_S.gguf | Q3_K_S | 3.77 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.Q4_0.gguf | Q4_0 | 4.77 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.Q4_K_M.gguf | Q4_K_M | 5.03 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.Q4_K_S.gguf | Q4_K_S | 4.8 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.Q5_0.gguf | Q5_0 | 5.72 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.Q5_K_M.gguf | Q5_K_M | 5.85 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.Q5_K_S.gguf | Q5_K_S | 5.72 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.Q6_K.gguf | Q6_K | 6.73 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.Q8_0.gguf | Q8_0 | 8.71 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.mmproj-bf16.gguf | mmproj-bf16 | 1.16 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.mmproj-f16.gguf | mmproj-f16 | 1.16 GB | Download |
| Qwen3-VL-8B-Heretic-Stable.mmproj-q8_0.gguf | mmproj-q8_0 | 752 MB | Download |
Quants Usage
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):
- Downloads last month
- 2,884
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit
Model tree for prithivMLmods/Qwen3-VL-8B-Heretic-Stable-GGUF
Base model
Qwen/Qwen3-VL-8B-Instruct