VOOZH about

URL: https://huggingface.co/allura-forge/Llama-3.3-8B-Instruct

⇱ allura-forge/Llama-3.3-8B-Instruct · Hugging Face


Llama 3.3 8B Instruct

Yes, this is official, and yes, this is, to my knowledge, a real version of Llama 3.3 8B!

I would highly recommend trying both this model, and a version with the Llama 3.3 70B config applied to extend the context length to 128k. I am unsure as to which one of these is closest to real; while the original copy I downloaded came with the 8k context configuration, benchmarks seem to slightly improve on the 128k version.

Benchmarks

Llama 3.1 8B Instruct Llama 3.3 8B Instruct (as downloaded from Facebook) Llama 3.3 8B Instruct w/ Llama 3.3 70B RoPE config to extend to 128k context
IFEval (1 epoch, score avged across all strict/loose instruction/prompt accuracies to follow Llama 3 paper) 78.2 81.95 84.775
GPQA Diamond (3 epochs) 29.3 37.0 37.5

All benchmarks done in OpenBench at 1.0 temp.

Downloads last month
684
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 4 Ask for provider support

Model tree for allura-forge/Llama-3.3-8B-Instruct

Finetunes
12 models
Quantizations
16 models