Browse AI Models

12 models available

Not sure what fits your GPU?Auto-detect your hardware →·Manual VRAM calculator →·Build recommender by budget →

Best for:4GB 6GB 8GB 12GB 16GB 24GB 48GB

Status:

Sort:

Filtered by:

👁 Google
GoogleGemma 4 31B

30.7B256K ctx18.7 GBfrontier

denseHigh

Gemma 4 31B is the largest and most capable open Gemma model. Dense architecture with 30.7B parameters. 256K context window. Achieves 2150 Codeforces ELO and 89.2% AIME 2026. Apache 2.0 licensed.

👁 Google
GoogleGemma 4 26B A4B

25.2B (3.8B active)256K ctx15.4 GBfrontier

moeHigh

Gemma 4 26B-A4B is Google's MoE model with 25.2B total parameters, 3.8B active per token (128 experts, 8 active). Matches much larger dense models at a fraction of the compute. 256K context. Apache 2.0.

👁 Google
GoogleGemma 3 27B

27B131K ctx16.5 GBcurrent

denseHigh

Gemma 3 27B is Google's flagship Gemma 3 model with 128K context and vision support. Delivers top-tier open model performance in reasoning, code, math, and multimodal understanding.

👁 Google
GoogleGemma 3 12B

12B131K ctx7.3 GBcurrent

denseHigh

Gemma 3 12B is Google's mid-range Gemma 3 model with vision capabilities. Offers strong reasoning, code generation, and image understanding balanced with practical resource requirements.

👁 Google
GoogleGemma 4 E4B

8B128K ctx4.9 GBfrontier

denseMid

Gemma 4 E4B is Google's mid-range on-device model with 8B total parameters (4.5B effective). Default Gemma 4 model on Ollama. Supports text and image. Apache 2.0 licensed.

👁 Google
GoogleGemma 4 E2B

5.1B128K ctx3.1 GBfrontier

denseMid

Gemma 4 E2B is Google's smallest Gemma 4 model with 5.1B total parameters (2.3B effective via Per-Layer Embeddings). Supports text, image, audio, and video natively. Apache 2.0 licensed. Built on Gemini 3 technology.

👁 Google
GoogleGemma 3 4B

4B128K ctx2.4 GBcurrent

denseMid

Gemma 3 4B is Google's efficient Gemma 3 model supporting vision and text. Ideal for on-device applications requiring multimodal understanding with fast inference speeds.

👁 Google
GoogleGemma 2 27B

27B8K ctx16.5 GBcurrent

denseBudget

Gemma 2 27B is Google's largest Gemma 2 model, offering state-of-the-art performance among open models of similar size. Built on Gemini technology with strong reasoning, code, and multilingual capabilities.

👁 Google
GoogleGemma 2 9B

9B8K ctx5.5 GBcurrent

denseBudget

Gemma 2 9B is Google's mid-size open model built on Gemini research. Features improved reasoning and safety with a novel architecture optimized for efficient inference on consumer hardware.

👁 Google
GoogleGemma 3 1B

1B33K ctx0.6 GBcurrent

denseLegacy

Gemma 3 1B is Google's ultra-compact model from the Gemma 3 family. Optimized for mobile and edge inference with surprisingly capable text generation for its parameter count.

👁 Google
GoogleGemma 2 2B

2B8K ctx1.2 GBcurrent

denseLegacy

Gemma 2 2B is Google's lightweight model designed for on-device and edge deployment. Delivers strong text generation and reasoning performance at minimal resource cost.

👁 Google
Googlegemma 2b

2B0K ctx1.2 GB

denseLegacy

URL: https://willitrunai.com/browse?provider=Google

⇱ Browse 300+ AI Models for Local Inference | Will It Run AI

Browse AI Models