VOOZH about

URL: https://huggingface.co/Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3

โ‡ฑ Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3 ยท Hugging Face


๐Ÿ‘ Magpie

๐Ÿฆ Llama-3-8B-Magpie-Align-SFT-v0.3

Project Web: https://magpie-align.github.io/

Arxiv Technical Report: https://arxiv.org/abs/2406.08464

Codes: https://github.com/magpie-align/magpie

๐Ÿง About This Model

This model is a fine-tuned version of meta-llama/Meta-Llama-3-8B on

Compared to v0.2, we enhance its multi-lingual ability by incorporating a new dataset with 200K Chinese instructions. It achieves performance comparable with the official Llama-3-8B-Instruct Model with SFT only! The detailed benchmark performance is as follows:

  • MT-Bench: 8.050 (1st Turn), 7.350 (Second Turn), 7.700 (Average)
  • Alpaca Eval 2 (GPT-4-Turbo-1106): 26.37 (LC), 26.42 (WR)
  • Alpaca Eval 2 (Llama-3-8B-Instruct): 54.53 (LC), 55.26 (WR)
  • Arena Hard: 20.6

๐Ÿ‘€ Other Information

License: Please follow Meta Llama 3 Community License.

Conversation Template: Please use Llama 3 official chat template for the best performance.

How to use it? Please check the official Llama 3 repository for detailed instructions. Simply replace the original model_id with Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3.

๐Ÿ“š Citation

If you find the model, data, or code useful, please cite our paper:

@article{xu2024magpie,
 title={Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing}, 
 author={Zhangchen Xu and Fengqing Jiang and Luyao Niu and Yuntian Deng and Radha Poovendran and Yejin Choi and Bill Yuchen Lin},
 year={2024},
 eprint={2406.08464},
 archivePrefix={arXiv},
 primaryClass={cs.CL}
}

Questions? Please contact Zhangchen by email.

Paper Abstract

๐Ÿƒโ€โ™‚๏ธโ€โžก๏ธ Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 1
  • eval_batch_size: 1
  • seed: 42
  • distributed_type: multi-GPU
  • num_devices: 4
  • gradient_accumulation_steps: 32
  • total_train_batch_size: 128
  • total_eval_batch_size: 4
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: cosine
  • lr_scheduler_warmup_steps: 98
  • num_epochs: 2

Training results

Training Loss Epoch Step Validation Loss
0.8616 0.0019 1 0.8870
0.5554 0.2013 106 0.5568
0.5067 0.4027 212 0.5065
0.4728 0.6040 318 0.4865
0.4681 0.8054 424 0.4740
0.4563 1.0067 530 0.4662
0.4115 1.1944 636 0.4642
0.3993 1.3957 742 0.4620
0.4048 1.5971 848 0.4613
0.4167 1.7984 954 0.4611

Framework versions

  • Transformers 4.42.3
  • Pytorch 2.3.1+cu121
  • Datasets 2.19.1
  • Tokenizers 0.19.1

Internal name for identification: Llama-3-8B-Magpie-Mix-RC. Please change the model name in the below Axolotl config.

๐Ÿ‘ Built with Axolotl


Downloads last month
8,087
Safetensors
Model size
8B params
Tensor type
BF16
ยท

Model tree for Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3

Finetuned
(599)
this model
Finetunes
1 model
Quantizations
1 model

Datasets used to train Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3

Spaces using Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3 10

Collection including Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3

Paper for Magpie-Align/Llama-3-8B-Magpie-Align-SFT-v0.3