VOOZH about

URL: https://huggingface.co/Delta-Vector/Ohashi-NeMo-12B

โ‡ฑ Delta-Vector/Ohashi-NeMo-12B ยท Hugging Face


๐Ÿ‘ image/png

A finetune of Mistral-Nemo-Instruct-2407 using conversational data, aiming for prose that's best described as 'short' and 'sweet.' The model strictly adheres to one-on-one roleplay and is very dialogue heavy.

Quants

GGUF: https://huggingface.co/Delta-Vector/Ohashi-NeMo-12B-gguf

EXL2 : https://huggingface.co/Delta-Vector/Ohashi-NeMo-12B-EXL2

Prompting

Model has been tuned with the Mistral formatting. A typical input would look like this:

<s>[INST] SYSTEM MESSAGE
USER MESSAGE[/INST] ASSISTANT MESSAGE</s>[INST] USER MESSAGE[/INST]

System Prompting

I would highly recommend using either Euryale's system prompt or the EVA system prompt with the model.



Axolotl config


Credits

Thank you to Lucy Knada, Intervitens, Tav, Trappu, Cgato, Kubernetes Bad and the rest of Anthracite

Training

The training was done for 4 epochs. We used 4 x RTX 3090s GPUs graciously provided by Intervitens for the fine-tuning of the model.

๐Ÿ‘ Built with Axolotl

Safety

๐Ÿ‘ image/png

Downloads last month
18
Safetensors
Model size
12B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Delta-Vector/Ohashi-NeMo-12B

Finetuned
(216)
this model
Merges
10 models
Quantizations
3 models

Datasets used to train Delta-Vector/Ohashi-NeMo-12B

Collection including Delta-Vector/Ohashi-NeMo-12B