VOOZH about

URL: https://willitrunai.com/models/smollm3-3b

โ‡ฑ SmolLM3 3B VRAM Requirements โ€” GPU Compatibility


๐Ÿ‘ HuggingFace
HuggingFace

SmolLM3 3B

๐Ÿ‘ huggingface
HuggingFace
649.0KDownloads981LikesJul 2025Released128K tokensContextApache 2.0License21 EntryQuality

SmolLM3 3B (3B parameters) requires approximately 5.6 GB of VRAM with Q4_K_M quantization. For the best balance of quality and speed, we recommend hardware with at least 7 GB of VRAM.

Get started

โ€” copy & paste to run locally

Copy-paste commands to run SmolLM3 3B on your machine.

Run

lms load SmolLM3-3B && lms server start

Quick specs

Parameters3B
Architecturedense
Context128K tokens
Modalitytext
Min RAM1.2 GB
Rec. RAM1.8 GB (Q4_K_M)
LicenseApache 2.0
FamilySmolLM
โœ“ Chatโœ“ Reasoning

About this model

SmolLM3 is a fully open 3B-parameter language model with dual-mode reasoning, 128K context via YARN extrapolation, and native support for 6 languages. Pretrained on 11.2T tokens with a staged curriculum of web, code, math, and reasoning data. Post-trained with 140B reasoning tokens and Anchored Preference Optimization.

  • โ€ขDual-mode reasoning: extended thinking can be toggled on/off
  • โ€ข128K context via YARN extrapolation from 64K training
  • โ€ข6 natively supported languages: English, French, Spanish, German, Italian, Portuguese
  • โ€ขFully open: weights, training details, and public data mixture

Your hardware

Detecting...

Quick picks

๐Ÿ‘ Intel
Best budgetB
Intel Arc A380 6GB~$139 โ€” 42 tok/s
๐Ÿ‘ NVIDIA
Best overallB
RTX 5060 8GB~$299 โ€” 57 tok/s

Best hardware

Top picks for SmolLM3 3B

RTX 5060 8GBB
8 GB
RTX 5060 Ti 8GBB
8 GB
RTX 5050 8GBB
8 GB
RTX 4060 8GBB
8 GB
RTX 4070 Laptop 8GBB
8 GB

Run this model

SmolLM3 3B on RTX 5060 8GBSmolLM3 3B on RTX 5060 Ti 8GBSmolLM3 3B on RTX 5050 8GB

Quantization options

VRAM estimates by quant level

No hardware detected โ€” fit column shows raw VRAM estimates

QuantBitsVRAMQualityFit
Q2_K
2
1.2 GB
Lowโ€”
Q3_K_S
3
1.5 GB
Lowโ€”
NVFP4
4
1.7 GB
Mediumโ€”
Q4_K_M
4
1.8 GB
Mediumโ€”
Q5_K_M
5
2.2 GB
Highโ€”
Q6_K
6
2.5 GB
Highโ€”
Q8_0
8
3.2 GB
Very Highโ€”
F16
16
6.1 GB
Maximumโ€”

Quality benchmarks

SmolLM3 3B benchmark scores

Benchmark verified

Coding

SWE-bench Verifiedโ€”
HumanEval+30.5%
Aider Polyglotโ€”
LiveCodeBenchโ€”

Reasoning

MMLU-Pro32.7%
GPQA Diamond35.7%
MATH-500โ€”
ARC Challengeโ€”

General

Chatbot Arenaโ€”
IFEval76.7%

Source: official ยท 2025-07-02

Hardware compatibility

Fit estimates across all hardware

Open calculator

Computing compatibility...

Memory breakdown

Reference: RTX 2060 6GB

Weights1.8 GB
KV Cache2.0 GB
Runtime1.2 GB
Headroom0.6 GB

Frequently asked questions

FAQ โ€” SmolLM3 3B

See also

Quantization GuideScoring MethodologyVRAM Calculator