๐ฆ Llama-3-8B-Magpie-Align-SFT-v0.3
Project Web: https://magpie-align.github.io/
Arxiv Technical Report: https://arxiv.org/abs/2406.08464
Codes: https://github.com/magpie-align/magpie
๐ง About This Model
This model is a fine-tuned version of meta-llama/Meta-Llama-3-8B on
- Magpie-Align/Magpie-Pro-MT-300K-v0.1,
- Magpie-Align/Magpie-Reasoning-150K, and
- Magpie-Align/Magpie-Qwen2-Pro-200K-Chinese
Compared to v0.2, we enhance its multi-lingual ability by incorporating a new dataset with 200K Chinese instructions. It achieves performance comparable with the official Llama-3-8B-Instruct Model with SFT only! The detailed benchmark performance is as follows:
- MT-Bench: 8.050 (1st Turn), 7.350 (Second Turn), 7.700 (Average)
- Alpaca Eval 2 (GPT-4-Turbo-1106): 26.37 (LC), 26.42 (WR)
- Alpaca Eval 2 (Llama-3-8B-Instruct): 54.53 (LC), 55.26 (WR)
- Arena Hard: 20.6
๐ Other Information
License: Please follow Meta Llama 3 Community License.
Conversation Template: Please use Llama 3 official chat template for the best performance.
How to use it? Please check the official Llama 3 repository for detailed instructions. Simply replace the original model_id with Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3.
๐ Citation
If you find the model, data, or code useful, please cite our paper:
@article{xu2024magpie,
title={Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing},
author={Zhangchen Xu and Fengqing Jiang and Luyao Niu and Yuntian Deng and Radha Poovendran and Yejin Choi and Bill Yuchen Lin},
year={2024},
eprint={2406.08464},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
Questions? Please contact Zhangchen by email.
Paper Abstract
๐โโ๏ธโโก๏ธ Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 1
- eval_batch_size: 1
- seed: 42
- distributed_type: multi-GPU
- num_devices: 4
- gradient_accumulation_steps: 32
- total_train_batch_size: 128
- total_eval_batch_size: 4
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: cosine
- lr_scheduler_warmup_steps: 98
- num_epochs: 2
Training results
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.8616 | 0.0019 | 1 | 0.8870 |
| 0.5554 | 0.2013 | 106 | 0.5568 |
| 0.5067 | 0.4027 | 212 | 0.5065 |
| 0.4728 | 0.6040 | 318 | 0.4865 |
| 0.4681 | 0.8054 | 424 | 0.4740 |
| 0.4563 | 1.0067 | 530 | 0.4662 |
| 0.4115 | 1.1944 | 636 | 0.4642 |
| 0.3993 | 1.3957 | 742 | 0.4620 |
| 0.4048 | 1.5971 | 848 | 0.4613 |
| 0.4167 | 1.7984 | 954 | 0.4611 |
Framework versions
- Transformers 4.42.3
- Pytorch 2.3.1+cu121
- Datasets 2.19.1
- Tokenizers 0.19.1
Internal name for identification: Llama-3-8B-Magpie-Mix-RC. Please change the model name in the below Axolotl config.
- Downloads last month
- 8,087
Model tree for Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3
Base model
meta-llama/Meta-Llama-3-8B