🐦 Llama-3-8B-Magpie-Align-SFT-v0.2
Project Web: https://magpie-align.github.io/
Arxiv Technical Report: https://arxiv.org/abs/2406.08464
Codes: https://github.com/magpie-align/magpie
About This Model
This model is a fine-tuned version of meta-llama/Meta-Llama-3-8B on
Compared to v0.1, we enhance its reasoning ability by incorporating a reasoning dataset (150K math, code, and reasoning data). It achieves performance comparable with the official Llama-3-8B-Instruct Model with SFT only! The detailed benchmark performance is as follows:
- MT-Bench: 8.350 (1st Turn), 7.700 (Second Turn), 8.025 (Average)
- Alpaca Eval 2 (GPT-4-Turbo-1106): 24.89 (LC), 24.63 (WR)
- Alpaca Eval 2 (Llama-3-8B-Instruct): 54.70 (LC), 54.73 (WR)
- Arena Hard: 19.1
Other Information
License: Please follow Meta Llama 3 Community License.
Conversation Template: Please use Llama 3 official chat template for the best performance.
How to use it? Please check the official Llama 3 repository for detailed instructions. Simply replace the original model_id with Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.2.
Citation
If you find the model, data, or code useful, please cite our paper:
@article{xu2024magpie,
title={Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing},
author={Zhangchen Xu and Fengqing Jiang and Luyao Niu and Yuntian Deng and Radha Poovendran and Yejin Choi and Bill Yuchen Lin},
year={2024},
eprint={2406.08464},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
Questions? Please contact Zhangchen by email.
Paper Abstract
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 1
- eval_batch_size: 1
- seed: 42
- distributed_type: multi-GPU
- num_devices: 4
- gradient_accumulation_steps: 32
- total_train_batch_size: 128
- total_eval_batch_size: 4
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: cosine
- lr_scheduler_warmup_steps: 79
- num_epochs: 2
Training results
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.8241 | 0.0024 | 1 | 0.8068 |
| 0.5623 | 0.2007 | 85 | 0.5087 |
| 0.4704 | 0.4014 | 170 | 0.4326 |
| 0.4478 | 0.6020 | 255 | 0.4079 |
| 0.4256 | 0.8027 | 340 | 0.3948 |
| 0.4261 | 1.0034 | 425 | 0.3867 |
| 0.3662 | 1.1844 | 510 | 0.3850 |
| 0.363 | 1.3851 | 595 | 0.3823 |
| 0.357 | 1.5858 | 680 | 0.3813 |
| 0.3677 | 1.7865 | 765 | 0.3813 |
Framework versions
- Transformers 4.42.3
- Pytorch 2.3.1+cu121
- Datasets 2.19.1
- Tokenizers 0.19.1
Internal name for identification: Llama-3-8B-Magpie-Mix-300KMT-150KR. Please change the model name in the below Axolotl config.
- Downloads last month
- 9
