VOOZH about

URL: https://willitrunai.com/browse?provider=Google

⇱ Browse 300+ AI Models for Local Inference | Will It Run AI


Browse AI Models

12 models available

Not sure what fits your GPU?Auto-detect your hardware →·Manual VRAM calculator →·Build recommender by budget →
Best for:4GB6GB8GB12GB16GB24GB48GB
Status:
Sort:
Filtered by:
👁 Google
GoogleGemma 4 31B
30.7B256K ctx18.7 GBfrontier
denseHigh

Gemma 4 31B is the largest and most capable open Gemma model. Dense architecture with 30.7B parameters. 256K context window. Achieves 2150 Codeforces ELO and 89.2% AIME 2026. Apache 2.0 licensed.

👁 Google
GoogleGemma 4 26B A4B
25.2B (3.8B active)256K ctx15.4 GBfrontier
moeHigh

Gemma 4 26B-A4B is Google's MoE model with 25.2B total parameters, 3.8B active per token (128 experts, 8 active). Matches much larger dense models at a fraction of the compute. 256K context. Apache 2.0.

👁 Google
GoogleGemma 3 27B
27B131K ctx16.5 GBcurrent
denseHigh

Gemma 3 27B is Google's flagship Gemma 3 model with 128K context and vision support. Delivers top-tier open model performance in reasoning, code, math, and multimodal understanding.

👁 Google
GoogleGemma 3 12B
12B131K ctx7.3 GBcurrent
denseHigh

Gemma 3 12B is Google's mid-range Gemma 3 model with vision capabilities. Offers strong reasoning, code generation, and image understanding balanced with practical resource requirements.

👁 Google
GoogleGemma 4 E4B
8B128K ctx4.9 GBfrontier
denseMid

Gemma 4 E4B is Google's mid-range on-device model with 8B total parameters (4.5B effective). Default Gemma 4 model on Ollama. Supports text and image. Apache 2.0 licensed.

👁 Google
GoogleGemma 4 E2B
5.1B128K ctx3.1 GBfrontier
denseMid

Gemma 4 E2B is Google's smallest Gemma 4 model with 5.1B total parameters (2.3B effective via Per-Layer Embeddings). Supports text, image, audio, and video natively. Apache 2.0 licensed. Built on Gemini 3 technology.

👁 Google
GoogleGemma 3 4B
4B128K ctx2.4 GBcurrent
denseMid

Gemma 3 4B is Google's efficient Gemma 3 model supporting vision and text. Ideal for on-device applications requiring multimodal understanding with fast inference speeds.

👁 Google
GoogleGemma 2 27B
27B8K ctx16.5 GBcurrent
denseBudget

Gemma 2 27B is Google's largest Gemma 2 model, offering state-of-the-art performance among open models of similar size. Built on Gemini technology with strong reasoning, code, and multilingual capabilities.

👁 Google
GoogleGemma 2 9B
9B8K ctx5.5 GBcurrent
denseBudget

Gemma 2 9B is Google's mid-size open model built on Gemini research. Features improved reasoning and safety with a novel architecture optimized for efficient inference on consumer hardware.

👁 Google
GoogleGemma 3 1B
1B33K ctx0.6 GBcurrent
denseLegacy

Gemma 3 1B is Google's ultra-compact model from the Gemma 3 family. Optimized for mobile and edge inference with surprisingly capable text generation for its parameter count.

👁 Google
GoogleGemma 2 2B
2B8K ctx1.2 GBcurrent
denseLegacy

Gemma 2 2B is Google's lightweight model designed for on-device and edge deployment. Delivers strong text generation and reasoning performance at minimal resource cost.

👁 Google
Googlegemma 2b
2B0K ctx1.2 GB
denseLegacy