Llama 3.3 8B Instruct
Yes, this is official, and yes, this is, to my knowledge, a real version of Llama 3.3 8B!
I would highly recommend trying both this model, and a version with the Llama 3.3 70B config applied to extend the context length to 128k. I am unsure as to which one of these is closest to real; while the original copy I downloaded came with the 8k context configuration, benchmarks seem to slightly improve on the 128k version.
Benchmarks
| Llama 3.1 8B Instruct | Llama 3.3 8B Instruct (as downloaded from Facebook) | Llama 3.3 8B Instruct w/ Llama 3.3 70B RoPE config to extend to 128k context | |
|---|---|---|---|
| IFEval (1 epoch, score avged across all strict/loose instruction/prompt accuracies to follow Llama 3 paper) | 78.2 | 81.95 | 84.775 |
| GPQA Diamond (3 epochs) | 29.3 | 37.0 | 37.5 |
All benchmarks done in OpenBench at 1.0 temp.
- Downloads last month
- 684
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 4 Ask for provider support
