Gemma 4 E2B

Parameters

5.1B

Context Length

128K

Modality

Multimodal

Architecture

Dense

License

Apache 2.0

Release Date

2 Apr 2026

Knowledge Cutoff

Technical Specifications

Attention

Attention Structure

Grouped-Query Attention

Attention Heads

Key-Value Heads

Attention Head Dimension

256

Position Embedding

ROPE

RoPE Theta

10,000

Sliding Window Attention

Yes

Sliding Window Size

512

Normalization

RMS Normalization

Activation Function

GELU

Dimensions

Hidden Dimension Size

6,144

Number of Layers

FFN Intermediate Size (Dense)

6,144

Multi-Token Prediction Heads

Tokenizer

Vocabulary Size

262,144

Architecture Diagram

Gemma 4 E2B

Gemma 4 E2B is an ultra-efficient model with 2.3B effective parameters (5.1B with Per-Layer Embeddings) designed for mobile and IoT devices. Supports text, image, and audio input with 128K context window, delivering frontier capabilities on edge devices with near-zero latency and offline operation. Features built-in reasoning mode and native function calling for agentic workflows.

About Gemma 4

Gemma 4 is Google DeepMind's most advanced open model family, built from Gemini 3 research and technology. Featuring both Dense and Mixture-of-Experts (MoE) architectures, these multimodal models handle text, images, and audio (on smaller variants), with context windows up to 256K tokens. Designed for frontier-level performance across reasoning, coding, and agentic workflows, Gemma 4 delivers unprecedented intelligence-per-parameter from mobile devices to enterprise servers. Released under Apache 2.0 license.

Other Gemma 4 Models

Evaluation Benchmarks

No evaluation benchmarks for Gemma 4 E2B available.

Rankings

Overall Rank

Coding Rank

Model Integrity

Total Score

66 / 100

GPU Requirements

Full Calculator

Choose the quantization method for model weights

Context Size: 1,024 tokens

63k

125k

VRAM Required:

Recommended GPUs

Resources

Official Documentation Download Weights Source Code

About Contact Compute Efficiency Content Integrity Terms of Use Privacy Policy

URL: https://apxml.com/models/gemma-4-e2b

⇱ Gemma 4 E2B: Specifications and GPU VRAM Requirements

Gemma 4 E2B

Technical Specifications

Architecture Diagram

Gemma 4 E2B

About Gemma 4

Other Gemma 4 Models

Evaluation Benchmarks

Rankings

Model Integrity

GPU Requirements

VRAM Required:

Recommended GPUs

Resources