VOOZH about

URL: https://huggingface.co/qikp/kite-2.6-13m

โ‡ฑ qikp/kite-2.6-13m ยท Hugging Face


A newer version of this model is available: qikp/kite-4.1-14m

Kite

๐ŸŽ‰ You are looking at Kite 2.6, which is now using a Mistral base!

Kite is a small, trained, 13 million parameter language model, without any special optimizations.

Training

It was trained on 50K rows of this dataset using 12500 steps, 1 epoch, 4 batch size, 5e-4 learning rate, and the pika 2 tokenizer.

Limitations

Due to its size, the model is not suitable for production workloads.

Loss

๐Ÿ‘ Image

Downloads last month
96
Safetensors
Model size
12.7M params
Tensor type
F32
ยท

Model tree for qikp/kite-2.6-13m

Finetunes
1 model

Dataset used to train qikp/kite-2.6-13m