VOOZH

URL: https://www.phoronix.com/news/Intel-LLM-Scaler-vLLM-1.3

⇱ Intel Releases LLM-Scaler-vLLM 1.3 With New LLM Model Support - Phoronix

👁 Phoronix

Articles & Reviews
News Archive
Forums
Premium
Contact
Categories

Computers Display Drivers Graphics Cards Linux Gaming Memory Motherboards Processors Software Storage Operating Systems Peripherals

Intel Releases LLM-Scaler-vLLM 1.3 With New LLM Model Support

Written by Michael Larabel in Intel on 30 January 2026 at 05:34 AM EST. Add A Comment

Intel today released the LLM-Scaler-vLLM 1.3 update with expanding the array of large language models that can run on Intel Arc Battlemage graphics cards with this Docker-based stack for deploying vLLM.

The new Intel llm-scaler-vllm 1.3 release via Docker and GitHub adds support for eight new models on capable Intel Arc Graphics hardware: Qwen3-Next-80B-A3B-Instruct, Qwen3-Next-80B-A3B-Thinking, InternVL3.5-30B-A3B, DeepSeek-OCR,PaddleOCR-VL, Seed-OSS-36B-Instruct, Qwen3-30B-A3B-Instruct-2507 and openai/whisper-large-v3.

In addition to those models, there is support for PaddleOCR models and GLM-4.6v-Flash support noted separately. There is also now sym_int4 support now for Qwen3-30B-A3B on TP 4/8 and Qwen3-235B-A22B on TP 16.

The LLM-Scaler-vLLM stack has upgraded against vLLM 0.11.1 and PyTorch 2.9. With the vLLM upgrade they have also enabled CPU KV cache offload, speculative decoding support with two more methods, experimental FP8 KV cache, and other enhancements.

Plus there are more bug fixes and other improvements with Intel LLM-Scaler-vLLM 1.3. Downloads and all the details via GitHub.

Add A Comment

Intel Compute Runtime Now Advertises Early Support For Nova Lake, Introduces Experimental "LEO"

Intel Performance Skills: New Open-Source Project Leveraging AI For Linux Performance Optimizations

Intel Ending Development Of BigDL: An Open-Source AI/LLM Effort Getting Axed

Intel Thermald 2.5.12 Released... With Initial Support For ARM

Intel's Open Image Denoise 2.5 Delivers Solid Performance Improvements For GPUs

Intel XPU Manager 2.0 Overhauls Windows & Linux Management For Arc Pro GPUs

Michael Larabel is the principal author of Phoronix.com and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated benchmarking software. He can be followed via Twitter, LinkedIn, or contacted via MichaelLarabel.com.

Arch Linux Now Believes Malware Incident Under Control: More Than 1,500 Affected Packages

ReactOS "Open-Source Windows" Reaches The Milestone Of Being Able To Run Half-Life

Arch Linux's AUR Sees More Than 400 Packages Compromised With Malware

Arch Linux AUR Hit By Another Wave Of Now More Sophisticated Malware Attack

Russian Spam & Profanities Are Now Plaguing The Arch Linux AUR

YSERVER: Modern X11 Server Written In Rust With The Help Of Claude Code

AMD Opens Pre-Orders For The Linux-Friendly Ryzen AI Halo Developer Platform

Linux 7.1 Released: New NTFS Driver, Intel FRED For Panther Lake, Faster Arc Graphics

Support Phoronix
While Having Ad-Free Browsing,
Single-Page Article Viewing

Legal Disclaimer, Privacy Policy, Cookies | | Contact
Copyright © 2004 - 2026 by Phoronix Media.
All trademarks used are properties of their respective owners. All rights reserved.