🐦 Llama-3-8B-Magpie-Align-SFT-v0.3

Project Web: https://magpie-align.github.io/

Arxiv Technical Report: https://arxiv.org/abs/2406.08464

Codes: https://github.com/magpie-align/magpie

🧐 About This Model

This model is a fine-tuned version of meta-llama/Meta-Llama-3-8B on

Compared to v0.2, we enhance its multi-lingual ability by incorporating a new dataset with 200K Chinese instructions. It achieves performance comparable with the official Llama-3-8B-Instruct Model with SFT only! The detailed benchmark performance is as follows:

MT-Bench: 8.050 (1st Turn), 7.350 (Second Turn), 7.700 (Average)
Alpaca Eval 2 (GPT-4-Turbo-1106): 26.37 (LC), 26.42 (WR)
Alpaca Eval 2 (Llama-3-8B-Instruct): 54.53 (LC), 55.26 (WR)
Arena Hard: 20.6

👀 Other Information

License: Please follow Meta Llama 3 Community License.

Conversation Template: Please use Llama 3 official chat template for the best performance.

How to use it? Please check the official Llama 3 repository for detailed instructions. Simply replace the original model_id with Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3.

📚 Citation

If you find the model, data, or code useful, please cite our paper:

@article{xu2024magpie,
 title={Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing}, 
 author={Zhangchen Xu and Fengqing Jiang and Luyao Niu and Yuntian Deng and Radha Poovendran and Yejin Choi and Bill Yuchen Lin},
 year={2024},
 eprint={2406.08464},
 archivePrefix={arXiv},
 primaryClass={cs.CL}
}

Questions? Please contact Zhangchen by email.

Paper Abstract

🏃‍♂️‍➡️ Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 2e-05
train_batch_size: 1
eval_batch_size: 1
seed: 42
distributed_type: multi-GPU
num_devices: 4
gradient_accumulation_steps: 32
total_train_batch_size: 128
total_eval_batch_size: 4
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: cosine
lr_scheduler_warmup_steps: 98
num_epochs: 2

Training results

Training Loss	Epoch	Step	Validation Loss
0.8616	0.0019	1	0.8870
0.5554	0.2013	106	0.5568
0.5067	0.4027	212	0.5065
0.4728	0.6040	318	0.4865
0.4681	0.8054	424	0.4740
0.4563	1.0067	530	0.4662
0.4115	1.1944	636	0.4642
0.3993	1.3957	742	0.4620
0.4048	1.5971	848	0.4613
0.4167	1.7984	954	0.4611

Framework versions

Transformers 4.42.3
Pytorch 2.3.1+cu121
Datasets 2.19.1
Tokenizers 0.19.1

Internal name for identification: Llama-3-8B-Magpie-Mix-RC. Please change the model name in the below Axolotl config.

👁 Built with Axolotl

Downloads last month: 8,087

Safetensors

Model size

8B params

Tensor type

BF16

Model tree for Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3

Base model

meta-llama/Meta-Llama-3-8B

Finetuned

(599)

this model

Finetunes

1 model

Quantizations

1 model

Datasets used to train Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3

Spaces using Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3 10

Collection including Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3

Open-aligned models using Magpie datasets. • 11 items • Updated Jan 13, 2025 • 1

Paper for Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3

Paper • 2406.08464 • Published Jun 12, 2024 • 72

URL: https://huggingface.co/Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3

⇱ Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3 · Hugging Face