Llama 3.3 8B Instruct

Yes, this is official, and yes, this is, to my knowledge, a real version of Llama 3.3 8B!

I would highly recommend trying both this model, and a version with the Llama 3.3 70B config applied to extend the context length to 128k. I am unsure as to which one of these is closest to real; while the original copy I downloaded came with the 8k context configuration, benchmarks seem to slightly improve on the 128k version.

Benchmarks

	Llama 3.1 8B Instruct	Llama 3.3 8B Instruct (as downloaded from Facebook)	Llama 3.3 8B Instruct w/ Llama 3.3 70B RoPE config to extend to 128k context
IFEval (1 epoch, score avged across all strict/loose instruction/prompt accuracies to follow Llama 3 paper)	78.2	81.95	84.775
GPQA Diamond (3 epochs)	29.3	37.0	37.5

All benchmarks done in OpenBench at 1.0 temp.

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 4 Ask for provider support

Finetunes

Quantizations