Qwen Unveils 480B Coder LLM and New Command-Line Tool for Local Use

Allan Witt • Jul 23, 2025 at 8:12am PDT

💬 0 Comments

In a significant development for the AI community, the Qwen team has announced the release of its most powerful open agentic code model to date, the Qwen3-Coder-480B-A35B-Instruct. Alongside this powerful new model, the team has also open-sourced Qwen Code, a new AI-powered command-line interface designed to fully leverage the model’s capabilities.

Meet the Qwen3 Coder 480B Model

The main highlight of the announcement is the Qwen3-Coder-480B-A35B-Instruct model. This isn’t a typical dense model; it utilizes a Mixture-of-Experts (MoE) architecture. It has a total of 480 billion parameters, but only activates 35 billion of them during any given task, offering a balance between massive scale and computational efficiency.

One of its most notable features is its extensive context window. The model natively supports a 256,000-token context, which can be scaled up to one million tokens using YaRN. This allows the model to handle and reason over entire code repositories, a significant capability for complex development tasks. The announcement highlights its state-of-the-art performance on challenging agent coding benchmarks like SWE-Bench Verified.

The release of a 480-billion parameter model naturally brings up the topic of local hardware. The Mixture-of-Experts design helps to make it more manageable than a dense model of the same size, but understanding the specific VRAM and system memory needed to run quantized versions is critical for any enthusiast looking to deploy it on their own local hardware.

Qwen Code: A New AI-Powered CLI

To complement the new model, the team also launched Qwen Code, a command-line tool for agentic coding. The tool is an adaptation of the open-source Gemini CLI and has been modified with custom prompts and function-calling protocols specifically to unlock the capabilities of the Qwen3 Coder model.

For local LLM enthusiasts, setting up Qwen Code is a straightforward process. It requires NodeJS to be installed, after which the tool can be installed globally with a single npm command. Configuration involves setting environment variables for an API key, which can be sourced from services like OpenRouter that provide access to the Qwen3 Coder model.

Practical Use and Initial Impressions

The purpose of Qwen Code is to provide a seamless interface for developers to interact with the powerful Qwen3 Coder model directly in their terminal. Initial demonstrations show its ability to handle tasks like generating simple applications from scratch, such as an HTML-based game, and creating multi-file landing pages with HTML, CSS, and JavaScript.

The tool can also analyze entire project structures, reading through multiple files to provide a comprehensive overview of a codebase’s architecture and key components. This showcases its utility not just for code generation, but also for understanding and working with existing projects. While early use has shown some multi-step tasks may require patience or refined prompts, the overall system presents a powerful new option for developers who prefer a terminal-centric workflow. The tool also inherits many features from the CLI it was forked from, providing a familiar foundation for many users.

👁 Google
Set as Preferred Source

👁 ms maia chip for llm inference in data center

URL: https://www.hardware-corner.net/qwen-unveils-480b-coder-local-llm-20250723/