Voozh

AI & ML interests

None defined yet.

Recent Activity

👁 Image

ColinZ22 updated a model about 3 hours ago

amd/GLM-5.2-MXFP4

👁 Image

bowenbaoamd published a model about 4 hours ago

amd/GLM-5.2-MXFP4

👁 Image

linzhao-amd updated a model about 5 hours ago

amd/GLM-5.2-MXFP4

View all activity

Papers

👁 Image

Stabilizing Efficient Reasoning with Step-Level Advantage Selection

👁 Image

Dynamic Chunking Diffusion Transformer

View all Papers

Articles

Join the AMD Open Robotics Hackathon

Nov 13, 2025

• 16

amd 's collections 47

zentorch Quantized Models - LLM-Compressor v0.11.0

LLM-Compressor v0.11.0 quantized models for AMD EPYC CPU inference

Text Generation • 11B • Updated about 16 hours ago

PARD-2

Ryzen AI 1.7.1 — NPU LFM2 Models

Liquid AI's Liquid Foundation (LFM2) ONNX based NPU models

Ryzen AI 1.7.1 — NPU 4K

Ryzen AI 1.7.1 models supporting context length up to 4K

Ryzen-AI-1.7-NPU-LLM_V2

Ryzen-AI-1.7.1 — SD Models

Stable Diffusion models for AMD NPU

Ryzen-AI-1.7-NPU-creativity-models

Ryzen-AI-1.7-Hybrid-LLM

SAND

Ryzen AI Whisper NPU Optimized ONNX models

Ryzen-AI-1.6-Hybrid-LLM

Quark ByteDance Models

Dell Pro AI Studio

Model for Dell Pro AI studio

RyzenAI-1.5_LLM_Hybrid_Models

Gumiho

Official Model Parameters for "Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding"

OGA CPU LLM Collection

This collection contains AMD-Quark quantized OGA exported models for CPU execution

Quark Quantized DeepSeek Models

RyzenAI-1.4_LLM_NPU_Models

Instella-VL✨

1B • Updated Mar 7, 2025 • 177 • 8

AMD-HybridLM-Models ✨

AMD-HybridLM is a family of post-trained, highly efficient hybrid models, designed to combine performance with speed and memory efficiency.

AMDGPU onnx

optimized image generation ONNX models for AMD Ryzen (TM) AI GPUs and Radeon Discrete GPUs

RyzenAI-1.3_LLM_Hybrid_Models

Models quantized by Quark and prepared for the OGA-based hybrid execution flow (Ryzen AI 1.3)

AMD-OLMo

AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo.

Quark Quantized OCP FP8 Models

zentorch Quantized Models - LLM-Compressor v0.10.0.2

LLM-Compressor v0.10.0.2 quantized models for AMD EPYC CPU inference

zentorch TorchAO Quantized Models - PyTorch 2.10

TorchAO quantized models for AMD EPYC CPU inference. The inference stack includes vLLM (0.15.0 to 0.18.0), PyTorch 2.10, and zentorch 5.2.1.

Ryzen AI 1.7.1 — NPU 16K

Ryzen AI 1.7.1 models supporting context length up to 16K

Ryzen AI 1.7.1 — Hybrid

Ryzen AI 1.7.1 hybrid (NPU and GPU) execution models

LuminaSFT

Micro-World

Action-controlled Interactive world model.

Hummingbird is a series of video generation models built on AMD Instinct™ GPUs, including text-to-video, image-to-videos models.

Ryzen-AI-1.6-NPU-LLM

Quark Quantized Auto Mixed Precision (AMP) Models

OGA_DML_8_6_2025

Models are quantized using quark-0.9, transformers-4.50.0, OGA-0.7.1, ORT-1.21.1 followed by OGA-DML export.

Quark Quantized PTPC FP8 Models

PTPC model quantized by quark

RyzenAI-1.5_LLM_NPU_Models

PARD

Official Model Parameters for "PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation"

Quark Quantized MXFP4 Models

AMDGPU OnnxGenAI

Collection ONNX GenAI compatible Language Models to run on AMD Ryzen(TM) GPUs and Radeon Discrete GPUs

RyzenAI-1.4_LLM_Hybrid_Models

Instella ✨

Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs.

AMD-RyzenAI-Deepseek-R1-Distill-Hybrid

RyzenAI-1.3_LLM_NPU_Models

Models quantized by Quark and prepared for the OGA-based NPU-only execution flow (Ryzen AI 1.3)

Nitro Diffusion 💥

Nitro Diffusion is a series of efficient text-to-image diffusion models built on AMD Instinct™ GPUs.

Quark Quantized ONNX LLMs for Ryzen AI 1.3 EA

ONNX Runtime generate() API based models quantized by Quark and optimized for Ryzen AI Strix Point NPU

zentorch Quantized Models - LLM-Compressor v0.11.0

LLM-Compressor v0.11.0 quantized models for AMD EPYC CPU inference

Text Generation • 11B • Updated about 16 hours ago

zentorch Quantized Models - LLM-Compressor v0.10.0.2

LLM-Compressor v0.10.0.2 quantized models for AMD EPYC CPU inference

PARD-2

zentorch TorchAO Quantized Models - PyTorch 2.10

TorchAO quantized models for AMD EPYC CPU inference. The inference stack includes vLLM (0.15.0 to 0.18.0), PyTorch 2.10, and zentorch 5.2.1.

Ryzen AI 1.7.1 — NPU LFM2 Models

Liquid AI's Liquid Foundation (LFM2) ONNX based NPU models

Ryzen AI 1.7.1 — NPU 16K

Ryzen AI 1.7.1 models supporting context length up to 16K

Ryzen AI 1.7.1 — NPU 4K

Ryzen AI 1.7.1 models supporting context length up to 4K

Ryzen AI 1.7.1 — Hybrid

Ryzen AI 1.7.1 hybrid (NPU and GPU) execution models

Ryzen-AI-1.7-NPU-LLM_V2

LuminaSFT

Ryzen-AI-1.7.1 — SD Models

Stable Diffusion models for AMD NPU

Micro-World

Action-controlled Interactive world model.

Ryzen-AI-1.7-NPU-creativity-models

Ryzen-AI-1.7-NPU-LLM

List will be updated

Ryzen-AI-1.7-Hybrid-LLM

ReasonLite

SAND

Hummingbird

Hummingbird is a series of video generation models built on AMD Instinct™ GPUs, including text-to-video, image-to-videos models.

Ryzen AI Whisper NPU Optimized ONNX models

Ryzen-AI-1.6-NPU-LLM

Ryzen-AI-1.6-Hybrid-LLM

Quark Quantized Auto Mixed Precision (AMP) Models

Quark ByteDance Models

OGA_DML_8_6_2025

Models are quantized using quark-0.9, transformers-4.50.0, OGA-0.7.1, ORT-1.21.1 followed by OGA-DML export.

Dell Pro AI Studio

Model for Dell Pro AI studio

Quark Quantized PTPC FP8 Models

PTPC model quantized by quark

RyzenAI-1.5_LLM_Hybrid_Models

RyzenAI-1.5_LLM_NPU_Models

Gumiho

Official Model Parameters for "Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding"

PARD

Official Model Parameters for "PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation"

OGA CPU LLM Collection

This collection contains AMD-Quark quantized OGA exported models for CPU execution

Quark Quantized MXFP4 Models

Quark Quantized DeepSeek Models

AMDGPU OnnxGenAI

Collection ONNX GenAI compatible Language Models to run on AMD Ryzen(TM) GPUs and Radeon Discrete GPUs

RyzenAI-1.4_LLM_NPU_Models

RyzenAI-1.4_LLM_Hybrid_Models

Instella-VL✨

1B • Updated Mar 7, 2025 • 177 • 8

Instella ✨

Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs.

AMD-HybridLM-Models ✨

AMD-HybridLM is a family of post-trained, highly efficient hybrid models, designed to combine performance with speed and memory efficiency.

AMD-RyzenAI-Deepseek-R1-Distill-Hybrid

AMDGPU onnx

optimized image generation ONNX models for AMD Ryzen (TM) AI GPUs and Radeon Discrete GPUs

RyzenAI-1.3_LLM_NPU_Models

Models quantized by Quark and prepared for the OGA-based NPU-only execution flow (Ryzen AI 1.3)

RyzenAI-1.3_LLM_Hybrid_Models

Models quantized by Quark and prepared for the OGA-based hybrid execution flow (Ryzen AI 1.3)

Nitro Diffusion 💥

Nitro Diffusion is a series of efficient text-to-image diffusion models built on AMD Instinct™ GPUs.

AMD-OLMo

AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo.

Quark Quantized ONNX LLMs for Ryzen AI 1.3 EA

ONNX Runtime generate() API based models quantized by Quark and optimized for Ryzen AI Strix Point NPU

Quark Quantized OCP FP8 Models

URL: https://huggingface.co/amd/collections

⇱ amd (AMD)

AI & ML interests

Recent Activity

Papers

Articles

Join the AMD Open Robotics Hackathon

amd 's collections 47