VOOZH about

URL: https://huggingface.co/bigatuna/Qwen3.5-9b-Sushi-Coder

⇱ bigatuna/Qwen3.5-9b-Sushi-Coder · Hugging Face


Qwen3.5-9b-Sushi-Coder

Merged Unsloth fine-tune based on unsloth/qwen3.5-9b.

Training lineage

  • Base model: unsloth/qwen3.5-9b
  • Earlier training data used for this model line: open-r1/codeforces-cots
  • Continuation training dataset: nohurry/Opus-4.6-Reasoning-3000x-filtered
  • Continuation method: LoRA continuation from an adapter-only Unsloth Studio output
  • Continuation precision: 16bit LoRA / bf16

Current uploaded files

  • merged safetensor model shards
  • tokenizer files
  • processor and generation config
  • chat template

Built with Unsloth and TRL.

Downloads last month
6
Safetensors
Model size
9B params
Tensor type
F32
·
BF16
·

Model tree for bigatuna/Qwen3.5-9b-Sushi-Coder

Adapters
1 model
Quantizations
2 models

Datasets used to train bigatuna/Qwen3.5-9b-Sushi-Coder