This model is a preview, unfinished, and still in development. It is not representative of any final product and has only been remotely published to prove that I am doing something productive with my life

Koto Large 106B-a6B (Preview)

Koto-Large-106B-Preview is a version of Ling-Flash-Base-2.0 trained on almost a billion tokens of creative writing data.

Thanks to lium.io for the compute! <3

Um. Don't please?

But if you must, our testers found that:

Temp 1.1, min_p 0.01, rep pen 1.02, freq pen -0.04

is the best. somehow

Some of the data used to train this model includes:

Most of The Anarchist Library, a repository for anarchist manifestos and writing (see allura-org/the-anarchist-library)
A random sample of public domain books from Project Gutenberg
Furry (anthro and feral) storytelling and smut
A small subset of known high-quality books and story data

ilya <3

Safetensors

Model size

106B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

(5)

this model

Quantizations