Llama-Gemma-2-27b-SFT-trial1

概要

google/gemma-2-27bを教師あり学習によりInstruction Tuningしたモデルです。

松尾研大規模言語モデル講座2024のコンペ用の提出モデル作成の一環として作成・公開しています。

This model is built with Llama and Qwen.

使用データセット

ライセンス

本モデルは学習に利用したデータの関係で以下のライセンスの影響を受けます。

META LLAMA 3.1 COMMUNITY LICENSEを継承します。
Gemma Terms of Useを継承します。
Qwen LICENSE AGREEMENTの影響を受けます。ライセンスは継承しませんが、「Built with Qwen」のような文言を記載する必要があります。

学習に関する詳細

本モデルの学習にはaxolotlを使いました。パラメータ等の学習の設定は下記の自動生成された記述をご確認ください。

👁 Built with Axolotl

fft-1

This model is a fine-tuned version of google/gemma-2-27b on the None dataset. It achieves the following results on the evaluation set:

Loss: 0.6122

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 1e-05
train_batch_size: 8
eval_batch_size: 1
seed: 42
distributed_type: multi-GPU
num_devices: 7
gradient_accumulation_steps: 4
total_train_batch_size: 224
total_eval_batch_size: 7
optimizer: Use OptimizerNames.PAGED_ADAMW_8BIT with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: cosine
lr_scheduler_warmup_steps: 10
num_epochs: 2

Training results

Training Loss	Epoch	Step	Validation Loss
0.9427	0.0020	1	0.9940
0.6566	0.2043	100	0.6648
0.6609	0.4086	200	0.6430
0.6457	0.6129	300	0.6306
0.6322	0.8172	400	0.6203
0.5082	1.0204	500	0.6238
0.5348	1.2247	600	0.6212
0.5253	1.4290	700	0.6181
0.5136	1.6333	800	0.6147
0.5125	1.8376	900	0.6122

Framework versions

Transformers 4.46.3
Pytorch 2.3.1+cu121
Datasets 3.1.0
Tokenizers 0.20.3

Downloads last month: 14

Safetensors

Model size

27B params

Tensor type

BF16

Model tree for Aratako/Llama-Gemma-2-27b-SFT-trial1

Base model

google/gemma-2-27b

Finetuned

(27)

this model

Finetunes

1 model

URL: https://huggingface.co/Aratako/Llama-Gemma-2-27b-SFT-trial1

⇱ Aratako/Llama-Gemma-2-27b-SFT-trial1 · Hugging Face

Llama-Gemma-2-27b-SFT-trial1

概要

使用データセット

ライセンス

学習に関する詳細

fft-1

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

Model tree for Aratako/Llama-Gemma-2-27b-SFT-trial1