🐦 Llama-3-8B-Magpie-Align-SFT-v0.2

Project Web: https://magpie-align.github.io/

Arxiv Technical Report: https://arxiv.org/abs/2406.08464

Codes: https://github.com/magpie-align/magpie

About This Model

This model is a fine-tuned version of meta-llama/Meta-Llama-3-8B on

Compared to v0.1, we enhance its reasoning ability by incorporating a reasoning dataset (150K math, code, and reasoning data). It achieves performance comparable with the official Llama-3-8B-Instruct Model with SFT only! The detailed benchmark performance is as follows:

MT-Bench: 8.350 (1st Turn), 7.700 (Second Turn), 8.025 (Average)
Alpaca Eval 2 (GPT-4-Turbo-1106): 24.89 (LC), 24.63 (WR)
Alpaca Eval 2 (Llama-3-8B-Instruct): 54.70 (LC), 54.73 (WR)
Arena Hard: 19.1

Other Information

License: Please follow Meta Llama 3 Community License.

Conversation Template: Please use Llama 3 official chat template for the best performance.

How to use it? Please check the official Llama 3 repository for detailed instructions. Simply replace the original model_id with Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.2.

Citation

If you find the model, data, or code useful, please cite our paper:

@article{xu2024magpie,
 title={Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing}, 
 author={Zhangchen Xu and Fengqing Jiang and Luyao Niu and Yuntian Deng and Radha Poovendran and Yejin Choi and Bill Yuchen Lin},
 year={2024},
 eprint={2406.08464},
 archivePrefix={arXiv},
 primaryClass={cs.CL}
}

Questions? Please contact Zhangchen by email.

Paper Abstract

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 2e-05
train_batch_size: 1
eval_batch_size: 1
seed: 42
distributed_type: multi-GPU
num_devices: 4
gradient_accumulation_steps: 32
total_train_batch_size: 128
total_eval_batch_size: 4
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: cosine
lr_scheduler_warmup_steps: 79
num_epochs: 2

Training results

Training Loss	Epoch	Step	Validation Loss
0.8241	0.0024	1	0.8068
0.5623	0.2007	85	0.5087
0.4704	0.4014	170	0.4326
0.4478	0.6020	255	0.4079
0.4256	0.8027	340	0.3948
0.4261	1.0034	425	0.3867
0.3662	1.1844	510	0.3850
0.363	1.3851	595	0.3823
0.357	1.5858	680	0.3813
0.3677	1.7865	765	0.3813

Framework versions

Transformers 4.42.3
Pytorch 2.3.1+cu121
Datasets 2.19.1
Tokenizers 0.19.1

Internal name for identification: Llama-3-8B-Magpie-Mix-300KMT-150KR. Please change the model name in the below Axolotl config.

👁 Built with Axolotl

Downloads last month: 9

Safetensors

Model size

8B params

Tensor type

BF16

Model tree for Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.2

Base model

meta-llama/Meta-Llama-3-8B

Finetuned

(599)

this model

Finetunes

1 model

Datasets used to train Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.2

Collection including Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.2

Open-aligned models using Magpie datasets. • 11 items • Updated Jan 13, 2025 • 1

Paper for Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.2

Paper • 2406.08464 • Published Jun 12, 2024 • 72

URL: https://huggingface.co/Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.2

⇱ Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.2 · Hugging Face