PAINTED FANTASY VISAGE
Overview
Another experimental release. Mistral Small 3.2 24B upscaled by 18 layers to create a 33.6B model. This model then went through pretraining, SFT & DPO.
Can't guarantee the Mistral 3.2 repetition issues are fixed, but this model seems to be less repetitive than my previous attempt.
This is an uncensored creative model intended to excel at character driven RP / ERP where characters are portrayed creatively and proactively.
SillyTavern Settings
Recommended Roleplay Format
Recommended Samplers
Instruct
Mistral v7 Tekken
Quantizations
Creation Process
Creation process: Upscale > Pretrain > SFT > DPO
All training was qlora (including pretrain).
Pretrained on 177MB of data. Dataset consisteted mostly of Light Novels, NSFW stories, SFW stories and filled out with general corpus text from Huggingface FineWeb-2 dataset.
The model then went through SFT using a dataset of approx 3.6 million tokens, 700 RP conversations, 1000 creative writing / instruct samples and about 100 summaries. The bulk of this data has been made public.
Finally, DPO was used to make the model more consistent.
- Downloads last month
- 8
Model tree for zerofata/MS3.2-PaintedFantasy-Visage-33B
Base model
mistralai/Mistral-Small-3.1-24B-Base-2503