PAINTED FANTASY VISAGE

Mistrall Small 3.2 Upscaled 33B

Overview

Another experimental release. Mistral Small 3.2 24B upscaled by 18 layers to create a 33.6B model. This model then went through pretraining, SFT & DPO.

Can't guarantee the Mistral 3.2 repetition issues are fixed, but this model seems to be less repetitive than my previous attempt.

This is an uncensored creative model intended to excel at character driven RP / ERP where characters are portrayed creatively and proactively.

SillyTavern Settings

Recommended Roleplay Format

> Actions: In plaintext

> Dialogue: "In quotes"

> Thoughts: *In asterisks*

Recommended Samplers

> Temp: 0.6

> MinP: 0.03 - 0.05

> TopP: 0.95 - 1.0

> Dry: 0.8, 1.75, 4

Instruct

Mistral v7 Tekken

Quantizations

GGUF

Static (mradermacher)

iMatrix (mradermacher)

EXL3

3bpw

4bpw

5bpw

6bpw

Creation Process

Creation process: Upscale > Pretrain > SFT > DPO

All training was qlora (including pretrain).

Pretrained on 177MB of data. Dataset consisteted mostly of Light Novels, NSFW stories, SFW stories and filled out with general corpus text from Huggingface FineWeb-2 dataset.

The model then went through SFT using a dataset of approx 3.6 million tokens, 700 RP conversations, 1000 creative writing / instruct samples and about 100 summaries. The bulk of this data has been made public.

Finally, DPO was used to make the model more consistent.

Downloads last month: 8

Safetensors

Model size

34B params

Tensor type

BF16

Model tree for zerofata/MS3.2-PaintedFantasy-Visage-33B

Base model

mistralai/Mistral-Small-3.1-24B-Base-2503

Finetuned

mistralai/Mistral-Small-3.2-24B-Instruct-2506

Finetuned

(65)

this model

Finetunes

1 model

Quantizations

8 models

Datasets used to train zerofata/MS3.2-PaintedFantasy-Visage-33B

Collection including zerofata/MS3.2-PaintedFantasy-Visage-33B

The retirement home. • 7 items • Updated Mar 20

URL: https://huggingface.co/zerofata/MS3.2-PaintedFantasy-Visage-33B

⇱ zerofata/MS3.2-PaintedFantasy-Visage-33B · Hugging Face

PAINTED FANTASY VISAGE

Overview

SillyTavern Settings

Recommended Roleplay Format

Recommended Samplers

Instruct

Quantizations

GGUF

EXL3

Creation Process

Model tree for zerofata/MS3.2-PaintedFantasy-Visage-33B

Datasets used to train zerofata/MS3.2-PaintedFantasy-Visage-33B

Collection including zerofata/MS3.2-PaintedFantasy-Visage-33B