VOOZH about

URL: https://www.phoronix.com/news/Intel-llm-scaler-vllm-gpt-oss

⇱ Intel's LLM-Scaler Updated With OpenAI's GPT-OSS Model Support - Phoronix


👁 Phoronix

Intel's LLM-Scaler Updated With OpenAI's GPT-OSS Model Support

Written by Michael Larabel in Intel on 4 November 2025 at 06:06 AM EST. Add A Comment
Back in August was the announcement of LLM-Scaler as part of Project Battlematrix. LLM-Scaler is a new Intel software project to provide optimized AI inference capabilities on Intel graphics hardware. A new beta release of LLM-Scaler "llm-scaler-vllm" is now available with expanded LLM model coverage.

Since the original August debut there have been more releases of this Docker-based LLM-Scaler solution for delivering expanded model coverage and other new features geared for Battlemage GPUs. Out today is a new llm-scaler-vllm release to once again expand the scope of supported large language models.

The new version today is llm-scaler-vllm beta release 0.10.2-b5. Significant with this updated Docker image is now supporting OpenAI's GPT-OSS models for inferencing with Intel Arc (Pro) B-Series GPUs. The GPT-OSS support should now be in good shape with this LLM-Scaler solution for Intel GPUs.

👁 Intel Arc Pro B50 graphics card


The updated LLM-Scaler also now enables the Qwen3-VL series and Qwen3-Omni series models too. That's all for the listed changes with today's beta release.

Those wanting to grab the new Intel LLM-Scaler-vLLM beta release can find the details on GitHub.

Michael Larabel is the principal author of Phoronix.com and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated benchmarking software. He can be followed via Twitter, LinkedIn, or contacted via MichaelLarabel.com.