VOOZH about

URL: https://huggingface.co/Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8.8

⇱ Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8.8 · Hugging Face


merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the SLERP merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

base_model: Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8
merge_method: slerp
tokenizer_source: base
dtype: bfloat16
parameters:
 t:
 - filter: self_attn
 value: [0.0, 0.5, 0.3, 0.7, 1.0]
 - filter: mlp
 value: [1.0, 0.5, 0.7, 0.3, 0.0]
 - value: 0.5
slices:
 - sources:
 - model: Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8
 layer_range: [0, 24]
 - model: Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v5
 layer_range: [0, 24]
 - sources:
 - model: Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8
 layer_range: [24, 48]
 - model: Lunzima/NQLSG-Qwen2.5-14B-OriginalFusion
 layer_range: [24, 48]
Downloads last month
9
Safetensors
Model size
15B params
Tensor type
BF16
·

Model tree for Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8.8