This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet and Opus.

This model is fine-tuned on top of mistralai/Mistral-Nemo-Instruct-2407.

Prompting

A typical input would look like this:

<s>[INST] SYSTEM MESSAGE
USER MESSAGE[/INST] ASSISTANT MESSAGE</s>[INST] USER MESSAGE[/INST]

SillyTavern templates

Below are Instruct and Context templates for use within SillyTavern.

Axolotl config

Credits

We'd like to thank Recursal / Featherless for sponsoring the compute for this train, Featherless has been hosting our Magnum models since the first 72 B and has given thousands of people access to our models and helped us grow.

We would also like to thank all members of Anthracite who made this finetune possible.

Datasets

Training

The training was done for 2 epochs. We used 8xH100s GPUs graciously provided by Recursal AI / Featherless AI for the full-parameter fine-tuning of the model.

👁 Built with Axolotl

Safety

...

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	19.95
IFEval (0-Shot)	33.93
BBH (3-Shot)	30.50
MATH Lvl 5 (4-Shot)	9.82
GPQA (0-shot)	6.15
MuSR (0-shot)	10.36
MMLU-PRO (5-shot)	28.93

Downloads last month: 536

Safetensors

Model size

12B params

Tensor type

BF16

Model tree for anthracite-org/magnum-v4-12b

Finetunes

2 models

Merges

52 models

Quantizations

9 models

Datasets used to train anthracite-org/magnum-v4-12b

Spaces using anthracite-org/magnum-v4-12b 9

Collection including anthracite-org/magnum-v4-12b

18 items • Updated Oct 20, 2024 • 34

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard
33.930
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard
30.500
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard
9.820
acc_norm on GPQA (0-shot)
Open LLM Leaderboard
6.150
acc_norm on MuSR (0-shot)
Open LLM Leaderboard
10.360
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard
28.930

URL: https://huggingface.co/anthracite-org/magnum-v4-12b

⇱ anthracite-org/magnum-v4-12b · Hugging Face