VOOZH about

URL: https://huggingface.co/anthracite-org/magnum-v4-12b

⇱ anthracite-org/magnum-v4-12b · Hugging Face


👁 image/png

This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet and Opus.

This model is fine-tuned on top of mistralai/Mistral-Nemo-Instruct-2407.

Prompting

A typical input would look like this:

<s>[INST] SYSTEM MESSAGE
USER MESSAGE[/INST] ASSISTANT MESSAGE</s>[INST] USER MESSAGE[/INST]

SillyTavern templates

Below are Instruct and Context templates for use within SillyTavern.



Axolotl config


Credits

We'd like to thank Recursal / Featherless for sponsoring the compute for this train, Featherless has been hosting our Magnum models since the first 72 B and has given thousands of people access to our models and helped us grow.

We would also like to thank all members of Anthracite who made this finetune possible.

Datasets

Training

The training was done for 2 epochs. We used 8xH100s GPUs graciously provided by Recursal AI / Featherless AI for the full-parameter fine-tuning of the model.

👁 Built with Axolotl

Safety

...

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 19.95
IFEval (0-Shot) 33.93
BBH (3-Shot) 30.50
MATH Lvl 5 (4-Shot) 9.82
GPQA (0-shot) 6.15
MuSR (0-shot) 10.36
MMLU-PRO (5-shot) 28.93
Downloads last month
536
Safetensors
Model size
12B params
Tensor type
BF16
·

Model tree for anthracite-org/magnum-v4-12b

Finetunes
2 models
Merges
52 models
Quantizations
9 models

Datasets used to train anthracite-org/magnum-v4-12b

Spaces using anthracite-org/magnum-v4-12b 9

Collection including anthracite-org/magnum-v4-12b

Evaluation results