Support this work → · X · GitHub · REAP paper · Cerebras REAP

Qwen3.6-28B-GGUF

GGUF quantization of the base model.

At a glance


Base model	—
Format	GGUF
Total params	28B
Active / token	3B
Experts / layer	—
Layers	—
Hidden size	—
Context	—
On-disk size	147 GB

Which variant should I pick?

Variant	Format	Link
`Qwen3.6-28B`	BF16	link
`Qwen3.6-28B-GGUF` (this)	GGUF	link
`Qwen3.6-35B-GGUF`	GGUF	link

License & citation

License inherited from the base model.

@misc{lasby2025reap,
 title = {REAP the Experts: Why Pruning Prevails for One-Shot MoE Compression},
 author = {Mike Lasby and Ivan Lazarevich and Nish Sinnadurai and Sean Lie and Yani Ioannou and Vithursan Thangarasa},
 year = {2025}, eprint = {2510.13999}, archivePrefix = {arXiv}
}

Collection including 0xSero/Qwen3.6-28B-GGUF

REAP-pruned & quantized Qwen3.5 / 3.6 / Coder variants. • 15 items • Updated 20 days ago

Paper for 0xSero/Qwen3.6-28B-GGUF

Paper • 2510.13999 • Published Oct 15, 2025 • 20

URL: https://huggingface.co/0xSero/Qwen3.6-28B-GGUF

⇱ 0xSero/Qwen3.6-28B-GGUF · Hugging Face

Qwen3.6-28B-GGUF

At a glance

Which variant should I pick?

License & citation

Sponsors

Collection including 0xSero/Qwen3.6-28B-GGUF

Paper for 0xSero/Qwen3.6-28B-GGUF