qwen3-4b-uzbek-v2-gguf

gguf suite for inspirebek/qwen3-4b-uzbek-v2. cpu / apple silicon / vulkan / rocm via llama.cpp, ollama, lm studio, etc.

files

quant	size	notes
`f16`	8.8 gb	reference fp16
`Q8_0`	4.7 gb	near-lossless
`Q6_K`	3.6 gb	recommended for quality
`Q5_K_M`	3.2 gb	balanced
`Q5_K_S`	3.1 gb	slightly lighter
`Q4_K_M`	2.7 gb	recommended for most users
`Q4_K_S`	2.6 gb	smaller, slight quality loss
`Q3_K_M`	2.2 gb	aggressive
`Q2_K`	1.8 gb	edge / low-ram only

usage

llama.cpp:

llama-cli -m qwen3-4b-uzbek-v2-q4_k_m.gguf -p "Salom! Qalaysan?" -cnv

ollama:

ollama run hf.co/inspirebek/qwen3-4b-uzbek-v2-GGUF:Q4_K_M

quantization

converted from the bf16 merged model via llama.cpp's convert_hf_to_gguf.py → llama-quantize. no calibration data (k-quants are statistics-only).

datasets

stage a — fluency (continued pretraining):

stage b — instruct (sft):

saillab/alpaca_uzbek_taco · CC-BY-NC-4.0
behbudiy/alpaca-cleaned-uz · CC-BY-4.0
UAzimov/uzbek-instruct-llm · Apache-2.0
CohereLabs/aya_collection_language_split · Apache-2.0
med-alex/qa_mt_ru_to_uzn · unspecified
med-alex/qa_mt_tr_to_uzn · unspecified

⚠️ licensing note: saillab/alpaca_uzbek_taco is cc-by-nc-4.0, which restricts commercial use of derivative models. downstream users who need a fully permissive license should retrain without that subset.

sibling formats

Downloads last month: 94

GGUF

Model size

4B params

Architecture

qwen3

Hardware compatibility

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Model tree for inspirebek/qwen3-4b-uzbek-v2-GGUF

Base model

Qwen/Qwen3-4B-Base

Finetuned

Qwen/Qwen3-4B

Adapter

inspirebek/qwen3-4b-uzbek-v2

Quantized

(3)

this model

Datasets used to train inspirebek/qwen3-4b-uzbek-v2-GGUF

Collection including inspirebek/qwen3-4b-uzbek-v2-GGUF

all distribution formats of the uzbek fine-tune of qwen3-4b: merged bf16, lora adapter, bnb nf4, awq, and the gguf suite. • 5 items • Updated Apr 20 • 1

URL: https://huggingface.co/inspirebek/qwen3-4b-uzbek-v2-GGUF

⇱ inspirebek/qwen3-4b-uzbek-v2-GGUF · Hugging Face