nemo-kink-lora

This model is a fine-tuned version of mistralai/Mistral-Nemo-Base-2407 on the Alfitaria/synthkink-combined-completions and the Alfitaria/bodinforg-completions datasets. It achieves the following results on the evaluation set:

Loss: 1.2252

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 1e-05
train_batch_size: 2
eval_batch_size: 2
seed: 69
distributed_type: multi-GPU
num_devices: 2
gradient_accumulation_steps: 2
total_train_batch_size: 8
total_eval_batch_size: 4
optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: cosine
lr_scheduler_warmup_steps: 5
training_steps: 181

Training results

Training Loss	Epoch	Step	Validation Loss
1.5045	0.0055	1	1.5520
1.4399	0.1047	19	1.3974
1.3297	0.2094	38	1.3388
1.4392	0.3140	57	1.3054
1.2685	0.4187	76	1.2815
0.9801	0.5234	95	1.2641
1.1412	0.6281	114	1.2507
1.1564	0.7328	133	1.2393
1.1739	0.8375	152	1.2313
1.2154	0.9421	171	1.2252

Framework versions

PEFT 0.15.2
Transformers 4.51.3
Pytorch 2.7.0+cu128
Datasets 3.5.1
Tokenizers 0.21.1

Downloads last month: 3

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ToastyPigeon/nemo-kimi-lora

Base model

mistralai/Mistral-Nemo-Base-2407

Adapter

(10)

this model

Merges

5 models

URL: https://huggingface.co/ToastyPigeon/nemo-kimi-lora

⇱ ToastyPigeon/nemo-kimi-lora · Hugging Face

nemo-kink-lora

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

Model tree for ToastyPigeon/nemo-kimi-lora

Dataset used to train ToastyPigeon/nemo-kimi-lora