VOOZH about

URL: https://huggingface.co/open-sci/open-sci-ref-v0.02-1.7b-nemotron-hq-1T-4096-rope_theta-100k

⇱ open-sci/open-sci-ref-v0.02-1.7b-nemotron-hq-1T-4096-rope_theta-100k · Hugging Face


open-sci-ref-v0.02-1.7b-nemotron-hq-1T-4096-rope_theta-100k

1.7B open-sci-ref model trained on Nemotron-CC HQ for 1T tokens (sequence length 4096, RoPE theta = 100000).

The main branch holds the final checkpoint (iter 238419). Intermediate checkpoints (iters 58000-238000, every 2000) are available as branches named iter_XXXXXXX.

Evaluation

Final checkpoint on the open-sci-0.01 suite (lm-eval-harness). Metrics collected with oellm collect-results.

Task n-shot Metric Score
arc_challenge 10 acc_norm 0.5316
arc_easy 10 acc_norm 0.8190
boolq 10 acc 0.7615
commonsense_qa 10 acc 0.6003
copa 0 acc 0.8600
hellaswag 10 acc_norm 0.7440
lambada_openai 0 acc 0.6136
mmlu 5 acc 0.5212
openbookqa 0 acc_norm 0.4360
piqa 10 acc_norm 0.8041
social_iqa 0 acc 0.4458
winogrande 0 acc 0.6606
average 0.6498
Downloads last month
61
Safetensors
Model size
2B params
Tensor type
BF16
·