A newer version of this model is available: qikp/kite-4.2-14m

Kite

🎉 You are looking at Kite 4.1, which is more powerful yet hardware-aligned!

Kite is a small, trained, 14 million parameter language model.

Training

It was trained on a tokenized version of qikp/small-data, which is a mixture of various datasets, using 1 epoch, 32 batch size, 1.5e-4 learning rate, and the pika 4 tokenizer.

Limitations

Due to its size, the model is not suitable for production workloads.

Downloads last month: 19

Safetensors

Model size

14.2M params

Tensor type

F32

URL: https://huggingface.co/qikp/kite-4.1-14m

⇱ qikp/kite-4.1-14m · Hugging Face

Kite

Training

Limitations

Dataset used to train qikp/kite-4.1-14m