VOOZH

URL: https://huggingface.co/Felladrin/Smol-Llama-101M-Chat-v1

⇱ Felladrin/Smol-Llama-101M-Chat-v1 · Hugging Face

A Llama Chat Model of 101M Parameters

Base model: BEE-spoke-data/smol_llama-101M-GQA
Datasets:
Availability in other ML formats:
- GGUF: Felladrin/gguf-Smol-Llama-101M-Chat-v1
- ONNX: Felladrin/onnx-Smol-Llama-101M-Chat-v1
- MLC: Felladrin/mlc-q4f16-Smol-Llama-101M-Chat-v1

Recommended Prompt Format

<|im_start|>system
{system_message}<|im_end|>
<|im_start|>user
{user_message}<|im_end|>
<|im_start|>assistant

Recommended Inference Parameters

penalty_alpha: 0.5
top_k: 4
repetition_penalty: 1.105

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	28.73
AI2 Reasoning Challenge (25-Shot)	22.87
HellaSwag (10-Shot)	28.69
MMLU (5-Shot)	24.93
TruthfulQA (0-shot)	45.76
Winogrande (5-shot)	50.04
GSM8k (5-shot)	0.08

Downloads last month: 90

Safetensors

Model size

0.1B params

Tensor type

F32

·

Model tree for Felladrin/Smol-Llama-101M-Chat-v1

Base model

BEE-spoke-data/smol_llama-101M-GQA

Finetuned

(3)

this model

Finetunes

Quantizations

Datasets used to train Felladrin/Smol-Llama-101M-Chat-v1

Space using Felladrin/Smol-Llama-101M-Chat-v1 1

Collection including Felladrin/Smol-Llama-101M-Chat-v1

They may be small, but they're training like giants! • 9 items • Updated Aug 16, 2025 • 20

Evaluation results

normalized accuracy on AI2 Reasoning Challenge (25-Shot)
test set Open LLM Leaderboard
22.870
normalized accuracy on HellaSwag (10-Shot)
validation set Open LLM Leaderboard
28.690
accuracy on MMLU (5-Shot)
test set Open LLM Leaderboard
24.930
mc2 on TruthfulQA (0-shot)
validation set Open LLM Leaderboard
45.760
accuracy on Winogrande (5-shot)
validation set Open LLM Leaderboard
50.040
accuracy on GSM8k (5-shot)
test set Open LLM Leaderboard
0.080