VOOZH about

URL: https://huggingface.co/Naphula-Archives/F5-stage7-12B

⇱ Naphula-Archives/F5-stage7-12B · Hugging Face


YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

merge_method: ramplus_tl 
base_model: C:\mergekit-main\merged_model_BloodKrakenMuse
models: 
 - model: C:\mergekit-main\merged_model_BloodKrakenMuse 
 - model: C:\mergekit-main\merged_model_RedKFlux 
parameters: 
 epsilon: 0.001 # Increased from 1e-5 to 1e-3 for denser SFT/DPO task vectors 
 r: 0.25 # Increased from 0.1 to 0.2-0.3 for better SFT behavior preservation 
 alpha: 0.4 # Increased from 0.2 to 0.4 for enhanced rescaling
dtype: float32
out_dtype: bfloat16
tokenizer:
 source: base
name: Stage7
Downloads last month
2
Safetensors
Model size
12B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Naphula-Archives/F5-stage7-12B

Merges
2 models