Paper • 2403.19522 • Published • 15
merge
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the Model Stock merge method using huihui-ai/Dolphin3.0-Llama3.1-8B-abliterated as a base.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
merge_method: model_stock
models:
- model: mlabonne/Hermes-3-Llama-3.1-8B-lorablated
parameters:
weight: 1.0
- model: meditsolutions/Llama-3.1-MedIT-SUN-8B
parameters:
weight: 1.0
base_model: huihui-ai/Dolphin3.0-Llama3.1-8B-abliterated
dtype: bfloat16
normalize: true
chat_template: auto
tokenizer:
source: union
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
| Metric | Value |
|---|---|
| Avg. | 23.45 |
| IFEval (0-Shot) | 50.87 |
| BBH (3-Shot) | 31.71 |
| MATH Lvl 5 (4-Shot) | 13.44 |
| GPQA (0-shot) | 5.93 |
| MuSR (0-shot) | 10.21 |
| MMLU-PRO (5-shot) | 28.56 |
- Downloads last month
- 9
Safetensors
Model size
8B params
Tensor type
BF16
·
Model tree for Nexesenex/Llama_3.1_8b_Dolermed_V1.01
Merge model
this model
Paper for Nexesenex/Llama_3.1_8b_Dolermed_V1.01
Evaluation results
- strict accuracy on IFEval (0-Shot)Open LLM Leaderboard50.870
- normalized accuracy on BBH (3-Shot)Open LLM Leaderboard31.710
- exact match on MATH Lvl 5 (4-Shot)Open LLM Leaderboard13.440
- acc_norm on GPQA (0-shot)Open LLM Leaderboard5.930
- acc_norm on MuSR (0-shot)Open LLM Leaderboard10.210
- accuracy on MMLU-PRO (5-shot)test set Open LLM Leaderboard28.560
