VOOZH about

URL: https://huggingface.co/allura-org/Koto-Large-106B-Preview

⇱ allura-org/Koto-Large-106B-Preview · Hugging Face


This model is a preview, unfinished, and still in development. It is not representative of any final product and has only been remotely published to prove that I am doing something productive with my life

Koto Large 106B-a6B (Preview)

Koto-Large-106B-Preview is a version of Ling-Flash-Base-2.0 trained on almost a billion tokens of creative writing data.

Thanks to lium.io for the compute! <3

Usage

Um. Don't please?

But if you must, our testers found that:

Temp 1.1, min_p 0.01, rep pen 1.02, freq pen -0.04

is the best. somehow

Datasets

Some of the data used to train this model includes:

  • Most of The Anarchist Library, a repository for anarchist manifestos and writing (see allura-org/the-anarchist-library)
  • A random sample of public domain books from Project Gutenberg
  • Furry (anthro and feral) storytelling and smut
  • A small subset of known high-quality books and story data

Acknowledgements

  • thanks again to fish and co from lium for compute
  • thanks to curse for testing, ideas
  • thanks to toasty for some data, ideas
  • thanks to everyone else in allura for moral support

ilya <3

Technical Appendix

Downloads last month
5
Safetensors
Model size
106B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for allura-org/Koto-Large-106B-Preview

Finetuned
(5)
this model
Quantizations
1 model