Models for 16GB+ VRAM • 4 items • Updated
Mistral-Small-3.2-AntiRep-24B:
- Exactly what it says on the tin, Orpo'd Mistral Small 3.2 to remove repetition.
- Trained to reduce infinite repetition, repetition of structure and sentences in multi turn conversation, and repetition within responses.
- Got really annoyed with all of my Mistral Small test models having repetition issues, so I decided to whip this up.
- Produced by doing orpo with Qwen 3 8B at 0 temp + .7 rep pen (<1 increases repetition) as rejected vs V3 03/24 as chosen.
- The LoRA is also available too, if you want to use it to reduce repetition on other MS3.2 tunes.
Enjoy!
- Downloads last month
- 10
Safetensors
Model size
24B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for ConicCat/Mistral-Small-3.2-AntiRep-24B
Base model
mistralai/Mistral-Small-3.1-24B-Base-2503