Idea
#1
by HuggingfacebrosLLM - opened
๐ฆ Supra-Edge-100M
Tagline: "The smartest model that fits everywhere."
๐ฏ Goal
A 100M parameter base and instruct model (two variants) optimized for:
๐ฑ Phones (Android)
๐ Raspberry Pi
๐ป Old laptops
๐ฅ๏ธ CPUs without GPUs
๐ Offline use
๐ Why this is interesting
The jump from 50M โ 100M is huge for reasoning quality while still being extremely lightweight.
Most people either build:
Tiny meme models (1Mโ10M)
Huge models (7B+)
Very few organizations focus on the 100M sweet spot. Combo for training data: 50B FineWeb-Edu tokens
6B Wikipedia tokens
3B The Stack tokens
59B tokens
We are already producing a 124M model.
AxionLab-official changed discussion status to closed
BTW: @HuggingfacebrosLLM stop using AI to impress us. โน๏ธ
