zentorch Quantized Models - LLM-Compressor v0.11.0 LLM-Compressor v0.11.0 quantized models for AMD EPYC CPU inference Text Generation • 11B • Updated about 16 hours ago
PARD-2 Text Generation • 0.8B • Updated 3 days ago • 931 Text Generation • 1B • Updated 6 days ago • 63 Text Generation • 0.8B • Updated 6 days ago • 68
Ryzen AI 1.7.1 — NPU LFM2 Models Liquid AI's Liquid Foundation (LFM2) ONNX based NPU models Updated Apr 19 • 29 • 1 Updated Apr 19 • 8 Updated Apr 19 • 26 • 1
Ryzen AI 1.7.1 — NPU 4K Ryzen AI 1.7.1 models supporting context length up to 4K Text Generation • Updated Mar 31 • 7 Text Generation • Updated Mar 31 • 9 Text Generation • Updated Mar 31 • 10 Text Generation • Updated Mar 31 • 7
Ryzen-AI-1.7.1 — SD Models Stable Diffusion models for AMD NPU Text-to-Image • Updated Feb 11 • 6 Text-to-Image • Updated Feb 24 • 1 Text-to-Image • Updated Feb 24 • 2 Text-to-Image • Updated Feb 24 • 1
Ryzen-AI-1.7-NPU-creativity-models Updated Jan 21 Updated Jan 21 Updated Feb 4 Image Segmentation • Updated Jan 21
Ryzen-AI-1.7-Hybrid-LLM Text Generation • Updated Jan 27 • 5 Text Generation • Updated Jan 27 Text Generation • Updated Jan 26 Updated Jan 26
SAND Text Generation • 33B • Updated Dec 6, 2025 • 15 • • 3 Text Generation • 33B • Updated Dec 6, 2025 • 20 • • 2 Viewer • Updated Dec 6, 2025 • 27.9k • 246 • 3 Viewer • Updated Oct 17, 2025 • 16.9k • 351 • 3
Ryzen AI Whisper NPU Optimized ONNX models Updated Jan 15 Updated Jan 30 Updated Jan 15 • 4 Updated Feb 10
Ryzen-AI-1.6-Hybrid-LLM Updated Oct 23, 2025 • 8 Updated Oct 23, 2025 • 8 • 2 Updated Oct 23, 2025 • 10 • 1 Updated Oct 23, 2025 • 15
Quark ByteDance Models 342B • Updated Dec 12, 2025 • 10 • 1 38B • Updated Nov 6, 2025 • 15.5k • 2 218B • Updated Nov 6, 2025 • 2.28k • 1
Dell Pro AI Studio Model for Dell Pro AI studio Updated Jul 30, 2025 • 4 Updated Nov 17, 2025 • 3 Updated Jul 17, 2025 • 2 Updated Oct 6, 2025 • 1
RyzenAI-1.5_LLM_Hybrid_Models Text Generation • Updated Aug 27, 2025 • 12 Text Generation • Updated Sep 16, 2025 • 12 Updated Sep 16, 2025 • 17 Text Generation • Updated Sep 16, 2025 • 11
Gumiho Official Model Parameters for "Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding" Paper • 2503.10135 • Published Mar 13, 2025 Updated Jun 12, 2025 Updated Jun 12, 2025 Updated Jun 12, 2025
OGA CPU LLM Collection This collection contains AMD-Quark quantized OGA exported models for CPU execution Updated Apr 12, 2025 Updated Apr 12, 2025 Text Generation • Updated Jan 30, 2025 Updated Apr 28, 2025
Quark Quantized DeepSeek Models 371B • Updated Apr 13 • 126k • 5 363B • Updated Nov 6, 2025 • 565 • 1 356B • Updated Feb 26 • 23.7k • 2 342B • Updated Dec 12, 2025 • 10 • 1
RyzenAI-1.4_LLM_NPU_Models Text Generation • Updated Aug 27, 2025 • 19 • 2 Text Generation • Updated Sep 16, 2025 • 15 • 3 Updated Sep 16, 2025 • 9 Text Generation • Updated Jun 28, 2025 • 8 • 1
AMD-HybridLM-Models ✨ AMD-HybridLM is a family of post-trained, highly efficient hybrid models, designed to combine performance with speed and memory efficiency. Updated Sep 23, 2025 • 251 Updated Sep 23, 2025 • 5 Updated Sep 23, 2025 • 4 Updated Sep 23, 2025 • 3
AMDGPU onnx optimized image generation ONNX models for AMD Ryzen (TM) AI GPUs and Radeon Discrete GPUs Text-to-Image • Updated Dec 17, 2025 • 4 Text-to-Image • Updated Dec 17, 2025 • 22 Updated Apr 3, 2025 • 3 Text-to-Image • Updated Apr 3, 2025 • 14
RyzenAI-1.3_LLM_Hybrid_Models Models quantized by Quark and prepared for the OGA-based hybrid execution flow (Ryzen AI 1.3) Text Generation • Updated Aug 27, 2025 • 12 Text Generation • Updated Sep 16, 2025 • 12 Updated Sep 16, 2025 • 17 Text Generation • Updated Sep 16, 2025 • 11
AMD-OLMo AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo. Text Generation • Updated Nov 17, 2025 • 83 Text Generation • 1B • Updated Nov 17, 2025 • 30 • 25 Text Generation • 1B • Updated Nov 17, 2025 • 29 • 21 Text Generation • 1B • Updated Nov 17, 2025 • 60 • 23
Quark Quantized OCP FP8 Models 8B • Updated Dec 19, 2024 • 34.6k • 6 71B • Updated Dec 19, 2024 • 1.97k • 5 406B • Updated Dec 19, 2024 • 1.83k • 5 3B • Updated Dec 19, 2024 • 10.2k • 3
zentorch Quantized Models - LLM-Compressor v0.10.0.2 LLM-Compressor v0.10.0.2 quantized models for AMD EPYC CPU inference Text Generation • 21B • Updated 15 days ago • 468 Text Generation • 2B • Updated about 14 hours ago Text Generation • 7B • Updated about 14 hours ago
zentorch TorchAO Quantized Models - PyTorch 2.10 TorchAO quantized models for AMD EPYC CPU inference. The inference stack includes vLLM (0.15.0 to 0.18.0), PyTorch 2.10, and zentorch 5.2.1. Text Generation • Updated Apr 30 • 1.47k • 1 Image-Text-to-Text • Updated May 4 • 42 Text Generation • Updated May 4 • 1.4k Text Generation • Updated May 4 • 1.63k
Ryzen AI 1.7.1 — NPU 16K Ryzen AI 1.7.1 models supporting context length up to 16K Text Generation • Updated Mar 31 • 9 Text Generation • Updated Mar 31 • 8 Text Generation • Updated Mar 31 • 7 Text Generation • Updated Mar 31 • 9
Ryzen AI 1.7.1 — Hybrid Ryzen AI 1.7.1 hybrid (NPU and GPU) execution models Text Generation • Updated Mar 30 Text Generation • Updated Mar 31 Text Generation • Updated Mar 31 Text Generation • Updated Mar 31
LuminaSFT Viewer • Updated Mar 2 • 207k • 53 • 1 Viewer • Updated Mar 2 • 1.12M • 44 Viewer • Updated Mar 2 • 1.3M • 32 Viewer • Updated Mar 2 • 851k • 78
Micro-World Action-controlled Interactive world model. Updated Feb 5 • 7 • 3 Updated Feb 5 • 10 • 7 Viewer • Updated Feb 6 • 2k • 2.25k • 3
Ryzen-AI-1.7-NPU-LLM List will be updated Updated Jan 21 • 2 Updated Dec 13, 2025 • 2 Text Generation • Updated Jan 21 Text Generation • Updated Oct 23, 2025 • 31 • 1
ReasonLite 0.8B • Updated Jan 22 • 305 • 11 0.8B • Updated Jan 22 • 67 • 7 Viewer • Updated Jan 22 • 6.16M • 361 • 13
Hummingbird Hummingbird is a series of video generation models built on AMD Instinct™ GPUs, including text-to-video, image-to-videos models. Text-to-Video • Updated Mar 4, 2025 • 9 Updated Sep 8, 2025 • 9 Updated Feb 24 • 9
Ryzen-AI-1.6-NPU-LLM Text Generation • Updated Oct 23, 2025 • 31 • 1 Updated Oct 23, 2025 • 11 Text Generation • Updated Oct 8, 2025 • 12 Text Generation • Updated Oct 23, 2025 • 26
Quark Quantized Auto Mixed Precision (AMP) Models 55B • Updated Sep 26, 2025 • 5 37B • Updated Nov 3, 2025 • 23 6B • Updated Sep 26, 2025 • 3.94k • 2 11B • Updated Jan 21 • 7.01k • 2
OGA_DML_8_6_2025 Models are quantized using quark-0.9, transformers-4.50.0, OGA-0.7.1, ORT-1.21.1 followed by OGA-DML export. Text Generation • Updated Aug 8, 2025 Text Generation • Updated Aug 8, 2025
Quark Quantized PTPC FP8 Models PTPC model quantized by quark 31B • Updated Dec 24, 2025 • 9 • 1 236B • Updated Dec 24, 2025 • 7 671B • Updated Dec 24, 2025 • 5 684B • Updated Nov 28, 2025 • 19
RyzenAI-1.5_LLM_NPU_Models Text Generation • Updated Aug 27, 2025 • 19 • 2 Text Generation • Updated Sep 16, 2025 • 15 • 3 Updated Sep 16, 2025 • 9 Text Generation • Updated Jun 28, 2025 • 8 • 1
PARD Official Model Parameters for "PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation" Text Generation • 1B • Updated May 19, 2025 • 17.9k • • 2 Text Generation • 2B • Updated May 19, 2025 • 29 • • 2 Text Generation • 0.6B • Updated May 19, 2025 • 57 • Text Generation • 0.8B • Updated Jul 9, 2025 • 3.67k • • 2
Quark Quantized MXFP4 Models 371B • Updated Apr 13 • 126k • 5 363B • Updated Nov 6, 2025 • 565 • 1 356B • Updated Feb 26 • 23.7k • 2 342B • Updated Dec 12, 2025 • 10 • 1
AMDGPU OnnxGenAI Collection ONNX GenAI compatible Language Models to run on AMD Ryzen(TM) GPUs and Radeon Discrete GPUs Updated Apr 8, 2025 Updated Apr 10, 2025 Updated Jul 29, 2025 Updated Jul 29, 2025
RyzenAI-1.4_LLM_Hybrid_Models Text Generation • Updated Aug 27, 2025 • 12 Text Generation • Updated Sep 16, 2025 • 12 Updated Sep 16, 2025 • 17 Text Generation • Updated Sep 16, 2025 • 11
Instella ✨ Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs. Text Generation • 3B • Updated Nov 14, 2025 • 48 • 13 Text Generation • 3B • Updated Nov 14, 2025 • 110 • 42 Text Generation • 3B • Updated Nov 14, 2025 • 55 • 11 Text Generation • 3B • Updated Nov 14, 2025 • 728 • 59
AMD-RyzenAI-Deepseek-R1-Distill-Hybrid Updated Sep 16, 2025 • 23 • 1 Updated Jun 23, 2025 • 33 • 1 Updated Sep 16, 2025 • 9 • 4
RyzenAI-1.3_LLM_NPU_Models Models quantized by Quark and prepared for the OGA-based NPU-only execution flow (Ryzen AI 1.3) Text Generation • Updated Aug 27, 2025 • 19 • 2 Text Generation • Updated Sep 16, 2025 • 15 • 3 Updated Sep 16, 2025 • 9 Text Generation • Updated Jun 28, 2025 • 8 • 1
Nitro Diffusion 💥 Nitro Diffusion is a series of efficient text-to-image diffusion models built on AMD Instinct™ GPUs. Text-to-Image • Updated Jun 25, 2025 • 5 • 9 Text-to-Image • Updated Jun 25, 2025 • 18 • 6 Text-to-Image • Updated Jul 9, 2025 • 15 • 5 Text-to-Image • Updated Jul 9, 2025 • 10 • 7
Quark Quantized ONNX LLMs for Ryzen AI 1.3 EA ONNX Runtime generate() API based models quantized by Quark and optimized for Ryzen AI Strix Point NPU Text Generation • Updated Jun 28, 2025 • 8 • 1 Updated Sep 16, 2025 • 9 Text Generation • Updated Jun 28, 2025 • 10 Text Generation • Updated Jun 28, 2025 • 13 • 2
zentorch Quantized Models - LLM-Compressor v0.11.0 LLM-Compressor v0.11.0 quantized models for AMD EPYC CPU inference Text Generation • 11B • Updated about 16 hours ago
zentorch Quantized Models - LLM-Compressor v0.10.0.2 LLM-Compressor v0.10.0.2 quantized models for AMD EPYC CPU inference Text Generation • 21B • Updated 15 days ago • 468 Text Generation • 2B • Updated about 14 hours ago Text Generation • 7B • Updated about 14 hours ago
PARD-2 Text Generation • 0.8B • Updated 3 days ago • 931 Text Generation • 1B • Updated 6 days ago • 63 Text Generation • 0.8B • Updated 6 days ago • 68
zentorch TorchAO Quantized Models - PyTorch 2.10 TorchAO quantized models for AMD EPYC CPU inference. The inference stack includes vLLM (0.15.0 to 0.18.0), PyTorch 2.10, and zentorch 5.2.1. Text Generation • Updated Apr 30 • 1.47k • 1 Image-Text-to-Text • Updated May 4 • 42 Text Generation • Updated May 4 • 1.4k Text Generation • Updated May 4 • 1.63k
Ryzen AI 1.7.1 — NPU LFM2 Models Liquid AI's Liquid Foundation (LFM2) ONNX based NPU models Updated Apr 19 • 29 • 1 Updated Apr 19 • 8 Updated Apr 19 • 26 • 1
Ryzen AI 1.7.1 — NPU 16K Ryzen AI 1.7.1 models supporting context length up to 16K Text Generation • Updated Mar 31 • 9 Text Generation • Updated Mar 31 • 8 Text Generation • Updated Mar 31 • 7 Text Generation • Updated Mar 31 • 9
Ryzen AI 1.7.1 — NPU 4K Ryzen AI 1.7.1 models supporting context length up to 4K Text Generation • Updated Mar 31 • 7 Text Generation • Updated Mar 31 • 9 Text Generation • Updated Mar 31 • 10 Text Generation • Updated Mar 31 • 7
Ryzen AI 1.7.1 — Hybrid Ryzen AI 1.7.1 hybrid (NPU and GPU) execution models Text Generation • Updated Mar 30 Text Generation • Updated Mar 31 Text Generation • Updated Mar 31 Text Generation • Updated Mar 31
LuminaSFT Viewer • Updated Mar 2 • 207k • 53 • 1 Viewer • Updated Mar 2 • 1.12M • 44 Viewer • Updated Mar 2 • 1.3M • 32 Viewer • Updated Mar 2 • 851k • 78
Ryzen-AI-1.7.1 — SD Models Stable Diffusion models for AMD NPU Text-to-Image • Updated Feb 11 • 6 Text-to-Image • Updated Feb 24 • 1 Text-to-Image • Updated Feb 24 • 2 Text-to-Image • Updated Feb 24 • 1
Micro-World Action-controlled Interactive world model. Updated Feb 5 • 7 • 3 Updated Feb 5 • 10 • 7 Viewer • Updated Feb 6 • 2k • 2.25k • 3
Ryzen-AI-1.7-NPU-creativity-models Updated Jan 21 Updated Jan 21 Updated Feb 4 Image Segmentation • Updated Jan 21
Ryzen-AI-1.7-NPU-LLM List will be updated Updated Jan 21 • 2 Updated Dec 13, 2025 • 2 Text Generation • Updated Jan 21 Text Generation • Updated Oct 23, 2025 • 31 • 1
Ryzen-AI-1.7-Hybrid-LLM Text Generation • Updated Jan 27 • 5 Text Generation • Updated Jan 27 Text Generation • Updated Jan 26 Updated Jan 26
ReasonLite 0.8B • Updated Jan 22 • 305 • 11 0.8B • Updated Jan 22 • 67 • 7 Viewer • Updated Jan 22 • 6.16M • 361 • 13
SAND Text Generation • 33B • Updated Dec 6, 2025 • 15 • • 3 Text Generation • 33B • Updated Dec 6, 2025 • 20 • • 2 Viewer • Updated Dec 6, 2025 • 27.9k • 246 • 3 Viewer • Updated Oct 17, 2025 • 16.9k • 351 • 3
Hummingbird Hummingbird is a series of video generation models built on AMD Instinct™ GPUs, including text-to-video, image-to-videos models. Text-to-Video • Updated Mar 4, 2025 • 9 Updated Sep 8, 2025 • 9 Updated Feb 24 • 9
Ryzen AI Whisper NPU Optimized ONNX models Updated Jan 15 Updated Jan 30 Updated Jan 15 • 4 Updated Feb 10
Ryzen-AI-1.6-NPU-LLM Text Generation • Updated Oct 23, 2025 • 31 • 1 Updated Oct 23, 2025 • 11 Text Generation • Updated Oct 8, 2025 • 12 Text Generation • Updated Oct 23, 2025 • 26
Ryzen-AI-1.6-Hybrid-LLM Updated Oct 23, 2025 • 8 Updated Oct 23, 2025 • 8 • 2 Updated Oct 23, 2025 • 10 • 1 Updated Oct 23, 2025 • 15
Quark Quantized Auto Mixed Precision (AMP) Models 55B • Updated Sep 26, 2025 • 5 37B • Updated Nov 3, 2025 • 23 6B • Updated Sep 26, 2025 • 3.94k • 2 11B • Updated Jan 21 • 7.01k • 2
Quark ByteDance Models 342B • Updated Dec 12, 2025 • 10 • 1 38B • Updated Nov 6, 2025 • 15.5k • 2 218B • Updated Nov 6, 2025 • 2.28k • 1
OGA_DML_8_6_2025 Models are quantized using quark-0.9, transformers-4.50.0, OGA-0.7.1, ORT-1.21.1 followed by OGA-DML export. Text Generation • Updated Aug 8, 2025 Text Generation • Updated Aug 8, 2025
Dell Pro AI Studio Model for Dell Pro AI studio Updated Jul 30, 2025 • 4 Updated Nov 17, 2025 • 3 Updated Jul 17, 2025 • 2 Updated Oct 6, 2025 • 1
Quark Quantized PTPC FP8 Models PTPC model quantized by quark 31B • Updated Dec 24, 2025 • 9 • 1 236B • Updated Dec 24, 2025 • 7 671B • Updated Dec 24, 2025 • 5 684B • Updated Nov 28, 2025 • 19
RyzenAI-1.5_LLM_Hybrid_Models Text Generation • Updated Aug 27, 2025 • 12 Text Generation • Updated Sep 16, 2025 • 12 Updated Sep 16, 2025 • 17 Text Generation • Updated Sep 16, 2025 • 11
RyzenAI-1.5_LLM_NPU_Models Text Generation • Updated Aug 27, 2025 • 19 • 2 Text Generation • Updated Sep 16, 2025 • 15 • 3 Updated Sep 16, 2025 • 9 Text Generation • Updated Jun 28, 2025 • 8 • 1
Gumiho Official Model Parameters for "Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding" Paper • 2503.10135 • Published Mar 13, 2025 Updated Jun 12, 2025 Updated Jun 12, 2025 Updated Jun 12, 2025
PARD Official Model Parameters for "PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation" Text Generation • 1B • Updated May 19, 2025 • 17.9k • • 2 Text Generation • 2B • Updated May 19, 2025 • 29 • • 2 Text Generation • 0.6B • Updated May 19, 2025 • 57 • Text Generation • 0.8B • Updated Jul 9, 2025 • 3.67k • • 2
OGA CPU LLM Collection This collection contains AMD-Quark quantized OGA exported models for CPU execution Updated Apr 12, 2025 Updated Apr 12, 2025 Text Generation • Updated Jan 30, 2025 Updated Apr 28, 2025
Quark Quantized MXFP4 Models 371B • Updated Apr 13 • 126k • 5 363B • Updated Nov 6, 2025 • 565 • 1 356B • Updated Feb 26 • 23.7k • 2 342B • Updated Dec 12, 2025 • 10 • 1
Quark Quantized DeepSeek Models 371B • Updated Apr 13 • 126k • 5 363B • Updated Nov 6, 2025 • 565 • 1 356B • Updated Feb 26 • 23.7k • 2 342B • Updated Dec 12, 2025 • 10 • 1
AMDGPU OnnxGenAI Collection ONNX GenAI compatible Language Models to run on AMD Ryzen(TM) GPUs and Radeon Discrete GPUs Updated Apr 8, 2025 Updated Apr 10, 2025 Updated Jul 29, 2025 Updated Jul 29, 2025
RyzenAI-1.4_LLM_NPU_Models Text Generation • Updated Aug 27, 2025 • 19 • 2 Text Generation • Updated Sep 16, 2025 • 15 • 3 Updated Sep 16, 2025 • 9 Text Generation • Updated Jun 28, 2025 • 8 • 1
RyzenAI-1.4_LLM_Hybrid_Models Text Generation • Updated Aug 27, 2025 • 12 Text Generation • Updated Sep 16, 2025 • 12 Updated Sep 16, 2025 • 17 Text Generation • Updated Sep 16, 2025 • 11
Instella ✨ Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs. Text Generation • 3B • Updated Nov 14, 2025 • 48 • 13 Text Generation • 3B • Updated Nov 14, 2025 • 110 • 42 Text Generation • 3B • Updated Nov 14, 2025 • 55 • 11 Text Generation • 3B • Updated Nov 14, 2025 • 728 • 59
AMD-HybridLM-Models ✨ AMD-HybridLM is a family of post-trained, highly efficient hybrid models, designed to combine performance with speed and memory efficiency. Updated Sep 23, 2025 • 251 Updated Sep 23, 2025 • 5 Updated Sep 23, 2025 • 4 Updated Sep 23, 2025 • 3
AMD-RyzenAI-Deepseek-R1-Distill-Hybrid Updated Sep 16, 2025 • 23 • 1 Updated Jun 23, 2025 • 33 • 1 Updated Sep 16, 2025 • 9 • 4
AMDGPU onnx optimized image generation ONNX models for AMD Ryzen (TM) AI GPUs and Radeon Discrete GPUs Text-to-Image • Updated Dec 17, 2025 • 4 Text-to-Image • Updated Dec 17, 2025 • 22 Updated Apr 3, 2025 • 3 Text-to-Image • Updated Apr 3, 2025 • 14
RyzenAI-1.3_LLM_NPU_Models Models quantized by Quark and prepared for the OGA-based NPU-only execution flow (Ryzen AI 1.3) Text Generation • Updated Aug 27, 2025 • 19 • 2 Text Generation • Updated Sep 16, 2025 • 15 • 3 Updated Sep 16, 2025 • 9 Text Generation • Updated Jun 28, 2025 • 8 • 1
RyzenAI-1.3_LLM_Hybrid_Models Models quantized by Quark and prepared for the OGA-based hybrid execution flow (Ryzen AI 1.3) Text Generation • Updated Aug 27, 2025 • 12 Text Generation • Updated Sep 16, 2025 • 12 Updated Sep 16, 2025 • 17 Text Generation • Updated Sep 16, 2025 • 11
Nitro Diffusion 💥 Nitro Diffusion is a series of efficient text-to-image diffusion models built on AMD Instinct™ GPUs. Text-to-Image • Updated Jun 25, 2025 • 5 • 9 Text-to-Image • Updated Jun 25, 2025 • 18 • 6 Text-to-Image • Updated Jul 9, 2025 • 15 • 5 Text-to-Image • Updated Jul 9, 2025 • 10 • 7
AMD-OLMo AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo. Text Generation • Updated Nov 17, 2025 • 83 Text Generation • 1B • Updated Nov 17, 2025 • 30 • 25 Text Generation • 1B • Updated Nov 17, 2025 • 29 • 21 Text Generation • 1B • Updated Nov 17, 2025 • 60 • 23
Quark Quantized ONNX LLMs for Ryzen AI 1.3 EA ONNX Runtime generate() API based models quantized by Quark and optimized for Ryzen AI Strix Point NPU Text Generation • Updated Jun 28, 2025 • 8 • 1 Updated Sep 16, 2025 • 9 Text Generation • Updated Jun 28, 2025 • 10 Text Generation • Updated Jun 28, 2025 • 13 • 2
Quark Quantized OCP FP8 Models 8B • Updated Dec 19, 2024 • 34.6k • 6 71B • Updated Dec 19, 2024 • 1.97k • 5 406B • Updated Dec 19, 2024 • 1.83k • 5 3B • Updated Dec 19, 2024 • 10.2k • 3