Si QLoRA - no DPO, pero que sabe cocinar eh! • 17 items • Updated • 1
- Downloads last month
- 3
Safetensors
Model size
4B params
Tensor type
F32
·
BF16 ·
U8 ·
Model tree for somosnlp-hackathon-2025/mistral-7b-gastronomia-hispana-dpo-4bit
Base model
mistralai/Mistral-7B-v0.3 Finetuned
mistralai/Mistral-7B-Instruct-v0.3