VOOZH about

URL: https://apxml.com/models/qwen35-27b

⇱ Qwen3.5-27B: Specifications and GPU VRAM Requirements


Qwen3.5-27B

Parameters

27B

Context Length

262K

Modality

Multimodal

Architecture

Dense

License

Apache 2.0

Release Date

24 Feb 2026

Knowledge Cutoff

-

Technical Specifications

Attention

Attention Structure

Grouped-Query Attention

Attention Heads

24

Key-Value Heads

4

Attention Head Dimension

256

Position Embedding

ROPE

RoPE Theta

10,000,000

Sliding Window Attention

No

Sliding Window Size

-

Normalization

RMS Normalization

Activation Function

SwigLU

Dimensions

Hidden Dimension Size

5,120

Number of Layers

64

FFN Intermediate Size (Dense)

17,408

Multi-Token Prediction Heads

1

Tokenizer

Vocabulary Size

248,320

Architecture Diagram

Qwen3.5-27B

Qwen3.5-27B is Alibaba Cloud's dense multimodal foundation model with 27B parameters, released February 2026. Unlike the MoE variants, it uses a dense architecture combining Gated Delta Networks and Feed Forward Networks. It achieves MMLU-Pro (86.1%), GPQA Diamond (85.5%), SWE-bench Verified (72.4%), and Terminal-Bench 2.0 (41.6%). Features unified vision-language capabilities, 262k native context (extensible to 1M), and excels across reasoning, coding, multimodal understanding, and multilingual tasks spanning 201 languages.

About Qwen 3.5

Qwen 3.5 is Alibaba Cloud's latest-generation foundation model family, released February 2026. It represents a significant leap forward, integrating breakthroughs in multimodal learning (unified vision-language foundation), efficient hybrid architecture (Gated Delta Networks with sparse Mixture-of-Experts), scalable reinforcement learning across million-agent environments, and global linguistic coverage spanning 201 languages. Available under Apache 2.0 license with open weights.


Other Qwen 3.5 Models

Evaluation Benchmarks

Rank

#53

BenchmarkScoreRank

General Text

Text Arena

1409

50

Web Development

WebDev Arena

1357

56

Rankings

Overall Rank

#53

Coding Rank

#65

Model Integrity

Total Score

B

69 / 100

GPU Requirements

Full Calculator

Choose the quantization method for model weights

Context Size: 1,024 tokens

1k
128k
256k

VRAM Required:

Recommended GPUs