GPT-2 ChatML FP32 (SFT on no_robots)
This is a fine-tuned GPT-2 model (124M parameters) trained on the human-curated SFT dataset HuggingFaceH4/no_robots using ChatML conversational formatting.
Model Details
- Base Model:
gpt2 - Dataset:
HuggingFaceH4/no_robots - Conversational Format: ChatML (
<|im_start|>/<|im_end|>) - Training Epochs: 2 epochs
- Eval Perplexity: 14.46
For GGUF quantized formats (including IQ4_NL and IQ3_XXS), please visit the GGUF repository: JustACluelessKid2/gpt2-chatml-fp32-GGUF.
- Downloads last month
- 222
Safetensors
Model size
0.1B params
Tensor type
F32
·
