Voozh

AI & ML interests

None defined yet.

Recent Activity

👁 Image

rishitdagli submitted a paper about 5 hours ago

Adaptive Volumetric Mechanical Property Fields Invariant to Resolution

👁 Image

taesiri submitted a paper about 5 hours ago

ENPIRE: Agentic Robot Policy Self-Improvement in the Real World

👁 Image

mwatson-nvidia updated a Space about 13 hours ago

nvidia/AlpasimE2EClosedLoopChallenge2026

View all activity

Papers

👁 Image

ENPIRE: Agentic Robot Policy Self-Improvement in the Real World

👁 Image

Adaptive Volumetric Mechanical Property Fields Invariant to Resolution

View all Papers

Articles

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

15 days ago

• 11

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

15 days ago

• 57

Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining

15 days ago

• 17

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

18 days ago

• 83

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

27 days ago

• 34

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

May 18

• 21

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

Apr 28

• 62

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

Mar 19

• 47

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

Mar 17

• 66

The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics

Mar 16

• 31

Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline

Mar 13

• 40

Build an Agent That Thinks Like a Data Scientist: How We Hit #1 on DABStep with Reusable Tool Generation

Mar 13

• 18

How NVIDIA AI-Q Reached \#1 on DeepResearch Bench I and II

Mar 12

• 33

Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds

Mar 11

• 6

How NVIDIA Builds Open Data for AI

Mar 10

• 16

Deploying Open Source Vision Language Models (VLM) on Jetson

Feb 24

• 37

「データ不足」の壁を越える：合成ペルソナが日本のAI開発を加速

Feb 19

• 3

From Scarcity to Scale: How Synthetic Personas Can Bootstrap Japanese AI Development

Feb 19

• 3

NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル

Feb 17

• 25

NVIDIA Nemotron 2 Nano 9B Japanese: State-of-the-Art Small Language Model Customized for Japanese Sovereign AI

Feb 17

• 3

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

Feb 4

• 28

Introducing NVIDIA Cosmos Policy for Advanced Robot Control

Jan 29

• 48

Nemotron-Personas-Brazil: Co-Designed Data for Sovereign AI

Jan 28

• 11

Nemotron-Personas-Singapore: Co-Designed Data for Sovereign AI

Jan 27

• 10

NVIDIA Earth-2 Open Models Span the Whole Weather Stack

Jan 26

• 36

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

Jan 6

• 28

Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot

Jan 5

• 25

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

Jan 5

• 64

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

Jan 5

• 87

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

Dec 17, 2025

• 50

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Dec 15, 2025

• 111

Custom Policy Enforcement with Reasoning: Faster, Safer AI Applications

Dec 2, 2025

• 26

How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare

Oct 28, 2025

• 20

🛡️ Nemotron PII: Synthesized Data for Privacy-Preserving AI

Oct 28, 2025

• 35

Nemotron-Personas-USA: Synthesized Data for Sovereign AI

Oct 28, 2025

• 12

NVIDIA Isaac GR00T in LeRobot

Oct 28, 2025

• 29

Can Your LLM Think Like a Professional? Introducing ProfBench

Oct 28, 2025

• 21

NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image Reasoning, Image and Video QA Tasks

Oct 28, 2025

• 17

Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for Physical AI

Oct 28, 2025

• 21

Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes

Oct 22, 2025

• 11

Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard

Oct 21, 2025

• 14

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

Oct 20, 2025

• 19

Nemotron-Personas-India: Synthesized Data for Sovereign AI

Oct 13, 2025

• 14

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

Sep 26, 2025

• 10

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

Sep 23, 2025

• 27

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

Aug 20, 2025

• 19

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

Aug 18, 2025

• 32

📢 NVIDIA Releases Nemotron-CC-Math Pre-Training Dataset: A High-Quality, Web-Scale Math Corpus for Pretraining Large Language Models

Aug 18, 2025

• 5

NVIDIA Releases Improved Pretraining Dataset: Preserves High Value Math & Code, and Augments with Multi-Lingual

Aug 18, 2025

• 4

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

Aug 11, 2025

• 76

Measuring Open-Source Llama Nemotron Models on DeepResearch Bench

Aug 4, 2025

• 5

Accelerate a World of LLMs on Hugging Face with NVIDIA NIM

Jul 21, 2025

• 5

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Jul 18, 2025

• 51

Llama-NeMoRetriever-ColEmbed: Developer-Focused Guide to NVIDIA's State-of-the-Art Text-Image Retrieval

Jul 9, 2025

• 4

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

Jun 27, 2025

• 31

Introducing Cosmos Predict-2: A Foundation For Your Own World Model

Jun 17, 2025

• 9

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

Jun 11, 2025

• 134

Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B

Jun 10, 2025

• 7

Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to Real-World Distributions

Jun 10, 2025

• 25

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

Jun 4, 2025

• 23

Mastering Long Contexts in LLMs with KVPress

Jan 23, 2025

• 77

View all articles

nvidia 's collections 116

NVIDIA Cosmos

NVIDIA OmniDreams

NVIDIA OmniDreams model checkpoints and sample datasets.

Nemotron-Labs-Diffusion

A Tri-Mode Language Model Family Unifying Autoregressive, Diffusion, and Self-Speculation Decoding

Nemotron-Labs-Elastic

Cosmos1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

AnyFlow

Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Nemotron Vision-Language

Image-text paired datasets for building vision-language models (VLMs).

Nemotron Safety & Content Moderation

Datasets for building safe models with refusals, content moderation, PII detection, agentic safety, and audio safety capabilities.

Nemotron Code & SWE

Datasets for building models that write, debug, and reason about code. Covers competitive programming, software engineering, and code pretraining.

Nemotron Reward Modeling

Human preference data, reward model training sets, and generative reward modeling data for training Nemotron reward models.

NVIDIA Ising

NVIDIA Ising is a new Model Family to enable building useful Quantum Computers with AI.

PixelDiT

NvPanoptix-3D

3D panoptic reconstruction segmentation model

BioNeMo - Design

NVIDIA BioNeMo Models for Design

GR00T-N1.6

NVIDIA Isaac GR00T N1.6 open vision-language-action (VLA) model for generalized humanoid

Cosmos3

Omnimodal World Models for Physical AI

Nemotron Speech

Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S

Nemotron OCR and Object Detection

Steering Reasoning VLAs

Steering Reasoning VLA in robotics manipulation https://www.arxiv.org/abs/2510.16281

Cosmos2

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3.

Nemotron RAG

Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex docs

Cosmos Policy

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/c

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license.

ChronoEdit

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Cosmos-Drive-Dreams

A collection of tokenizers, diffusion models, and datasets relevant to the cosmos-drive-dreams platform.

Llama Nemotron

Open, Production-ready Enterprise Models

BioNeMo - Understand

NVIDIA BioNeMo Models for Understanding Biology

Cosmos-Predict2.5

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Nemotron-Personas

A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions.

GEN3C

3D-Informed World-Consistent Video Generation with Precise Camera Control

BioNeMo

Accelerated models for digital biology by the NVIDIA BioNeMo team. https://www.nvidia.com/en-us/clara/biopharma/

OpenReasoning-Nemotron

Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science.

Audio2Face-3D

Open-weight Audio2Face-3D and Audio2Emotion networks and a sample dataset for training and evaluation

Cosmos-Transfer2.5

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Updated Feb 12 • 8.22k • 67

Nemotron-H

Mamba-Transformer hybrid models

Describe Anything

Multimodal Large Language Models for Detailed Localized Image and Video Captioning

OpenMathReasoning

Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset"

OpenCodeReasoning-II

Reasoning data for supervised finetuning of LLMs to advance code generation and critique

Scoring Verifiers

Benchmarks for evaluating synthetic verifiers like test case generation and code reward models (as found in https://www.arxiv.org/abs/2502.13820).

Cosmos-Reason1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Cosmos-Predict1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos-predict2

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024.

QLIP

QLIP is a family of image tokenizers with SOTA reconstruction quality and zero-shot image understanding.

DMC

LLMs equipped with Dynamic Memory Compression to accelerate generation.

NemoGuard

Essential datasets and models for content safety, topic-following, and security guardrails

NeMo Audio Codecs

A series of Neural Audio Codecs

Optimized ONNX models for NVIDIA RTX GPUs

Collection of optimized ONNX model checkpoints for NVIDIA RTX GPUs

OpenMath-2

A collection of models and datasets introduced in "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data"

SteerLM

A collection of models and datasets relating to SteerLM and HelpSteer.

Canary ASR/AST

A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤

OpenMath

A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset"

NV-Embed

NV-Embed is a generalist embedding model encompassing retrieval, reranking, classification, clustering, STS tasks.

SSMs

A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers.

BigVGAN

BigVGAN is a universal neural vocoder that generates audio waveform using mel spectrogram as input.

PS3: Scaling Vision Pre-Training to 4K Resolution

Enabling 4k resolution for VLMs, CVPR 2025, https://nvlabs.github.io/PS3/

RADIO

A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.).

NeMo Curator - Classifier Models

Classifier models that can be used in NeMo Curator for labelling/filtering datasets.

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models

Open-SWE-Traces

Open-SWE-Traces: Advancing Dual-Mode Multilingual Distillation for Software Engineering Agents

Viewer • Updated 1 day ago • 207k • 568 • 13

Inference Optimized Checkpoints (with Model Optimizer)

A collection of generative models quantized and optimized for inference with Model Optimizer.

swe-zero-to-swe-hero

Datasets and Models for SWE-ZERO to SWE-HERO paper (https://arxiv.org/abs/2604.01496)

Efficient-DLM

Nemotron Supervised Fine-Tuning

SFT datasets covering math, code, chat, safety, agentic, VLM, multilingual, and specialized domains.

Nemotron Agentic & Tool-Use

Datasets for building models capable of function calling, multi-step agentic tasks, terminal use, and SWE workflows.

Nemotron Chat & Instruction Following

Datasets for building helpful, multi-turn, instruction-following conversational models across single and multi-turn settings.

Nemotron Math & Reasoning

Datasets for building models that excel at math reasoning, proofs, and quantitative problem-solving. Covers SFT, RL, and pretraining data.

Lyra

Project Lyra: Open Generative 3D World Models

NVIDIA EGM

Efficient Grounding Models

Nemotron-Cascade 2

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Kimodo-v1

Models for human(oid) motion generation

GR00T-N1.7

NVIDIA Isaac GR00T N1.7 open vision-language-action (VLA) model for generalized humanoid

MedTech Open Models

Open models for physical AI and medical imaging — robot control, surgical simulation, segmentation, reconstruction, generation, and reasoning.

Nemotron-Terminal

We are releasing Nemotron-Terminal models and training datasets.

Speculative Decoding Modules

A collection of speculative decoding modules created using Model Optimizer.

Nemotron ColEmbed V2

State-of-the-Art Late Interaction Vision-Language Embedding Models

Earth-2

Open, state of the art models for Climate and Weather forecasting. Nowcasting, Medium range, S2S range, Downscaling.

Nemotron-Cascade

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models.

NeMo Gym

Collection of RL verifiable data for NeMo Gym

KVzap

Alpamayo

A collection related to the Alpamayo ecosystem, containing Reasoning VLA models, Physical AI data, simulation frameworks, training utilities, and more

PhysicsNeMo

Framework of PyTorch composable modules for developing physics guided machine learning training pipelines. https://github.com/NVIDIA/physicsnemo

Reward Models 10-2025

A collection of great reward models for research and production

Clara Medical

NVIDIA Clara Open Models for medical imaging AI: segment, generate, and reason across CT, MRI, and X-ray. Built on MONAI by NVIDIA.

BioNeMo - Optimize

NVIDIA BioNeMo Models for Optimization

Cosmos-Reason2

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Llama-Embed-Nemotron-8B

State-of-the-Art Text Embedding Model

Reasoning Efficiency Research

Ultra-efficient reasoning model! SOTA Accuracy / CoT Length trade-offs

ViPE

Cosmos-Predict2

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos-predict25

Reward Models 06-2025

Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge

AceReason

Math and Code reasoning model trained through reinforcement learning (RL)

Cosmos-Embed1

Joint video-text embedding for physical AI

AceMath-RL

Math reasoning models trained through reinforcement learning (RL)

Text Generation • 8B • Updated Apr 23, 2025 • 452 • • 26

OpenCodeReasoning

Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding

Llama Nemotron Feedback-Edit Inference-Time Scaling

Novel ITS approach for open-ended tasks - No. 1 on Arena Hard on 18 Mar 2025

Nemotron-UltraLong

Cosmos-Transfer1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Cosmos-Tokenizer1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Physical AI

Collection of open, commercial-grade datasets for physical AI developers

Cosmos-Preidct1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

AceMath

We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark.

Eagle

Eagle is a family of frontier vision-language models with data-centric strategies. The model supports both HD image and long-context video input.

Hymba

A series of Hybrid Small Language Models.

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks.

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models.

Parakeet ASR

NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants.

InstructRetro

InstructRetro is an autoregressive decoder-only language model (LM) with retrieval-augmented pretraining and instruction tuning.

RLHF

A collection of models trained with Reinforcement Learning from Human Feedback (RLHF).

Llama3-ChatQA-1.5

Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG).

Nemotron 3 8B

The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise.

MambaVision

MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models.

Minitron

A family of compressed models obtained via pruning and knowledge distillation

Llama3-ChatQA-2

This is the collection that presents ChatQA-2, a suite of 128K long-context models, that also have exceptional RAG capabilities

Nemotron v3 Pre-Training

Large scale pre-training datasets used in the Nemotron family of models.

NVIDIA Cosmos

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models

NVIDIA OmniDreams

NVIDIA OmniDreams model checkpoints and sample datasets.

Open-SWE-Traces

Open-SWE-Traces: Advancing Dual-Mode Multilingual Distillation for Software Engineering Agents

Viewer • Updated 1 day ago • 207k • 568 • 13

Nemotron-Labs-Diffusion

A Tri-Mode Language Model Family Unifying Autoregressive, Diffusion, and Self-Speculation Decoding

Inference Optimized Checkpoints (with Model Optimizer)

A collection of generative models quantized and optimized for inference with Model Optimizer.

Nemotron-Labs-Elastic

swe-zero-to-swe-hero

Datasets and Models for SWE-ZERO to SWE-HERO paper (https://arxiv.org/abs/2604.01496)

Cosmos1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Efficient-DLM

AnyFlow

Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Nemotron Supervised Fine-Tuning

SFT datasets covering math, code, chat, safety, agentic, VLM, multilingual, and specialized domains.

Nemotron Vision-Language

Image-text paired datasets for building vision-language models (VLMs).

Nemotron Agentic & Tool-Use

Datasets for building models capable of function calling, multi-step agentic tasks, terminal use, and SWE workflows.

Nemotron Safety & Content Moderation

Datasets for building safe models with refusals, content moderation, PII detection, agentic safety, and audio safety capabilities.

Nemotron Chat & Instruction Following

Datasets for building helpful, multi-turn, instruction-following conversational models across single and multi-turn settings.

Nemotron Code & SWE

Datasets for building models that write, debug, and reason about code. Covers competitive programming, software engineering, and code pretraining.

Nemotron Math & Reasoning

Datasets for building models that excel at math reasoning, proofs, and quantitative problem-solving. Covers SFT, RL, and pretraining data.

Nemotron Reward Modeling

Human preference data, reward model training sets, and generative reward modeling data for training Nemotron reward models.

Lyra

Project Lyra: Open Generative 3D World Models

NVIDIA Ising

NVIDIA Ising is a new Model Family to enable building useful Quantum Computers with AI.

NVIDIA EGM

Efficient Grounding Models

PixelDiT

Nemotron-Cascade 2

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

NvPanoptix-3D

3D panoptic reconstruction segmentation model

Kimodo-v1

Models for human(oid) motion generation

BioNeMo - Design

NVIDIA BioNeMo Models for Design

GR00T-N1.7

NVIDIA Isaac GR00T N1.7 open vision-language-action (VLA) model for generalized humanoid

GR00T-N1.6

NVIDIA Isaac GR00T N1.6 open vision-language-action (VLA) model for generalized humanoid

MedTech Open Models

Open models for physical AI and medical imaging — robot control, surgical simulation, segmentation, reconstruction, generation, and reasoning.

Cosmos3

Omnimodal World Models for Physical AI

Nemotron-Terminal

We are releasing Nemotron-Terminal models and training datasets.

Nemotron Speech

Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S

Speculative Decoding Modules

A collection of speculative decoding modules created using Model Optimizer.

Nemotron OCR and Object Detection

Nemotron ColEmbed V2

State-of-the-Art Late Interaction Vision-Language Embedding Models

Steering Reasoning VLAs

Steering Reasoning VLA in robotics manipulation https://www.arxiv.org/abs/2510.16281

Earth-2

Open, state of the art models for Climate and Weather forecasting. Nowcasting, Medium range, S2S range, Downscaling.

Cosmos2

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Nemotron-Cascade

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3.

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models.

Nemotron RAG

Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex docs

NeMo Gym

Collection of RL verifiable data for NeMo Gym

Cosmos Policy

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/c

KVzap

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license.

Alpamayo

A collection related to the Alpamayo ecosystem, containing Reasoning VLA models, Physical AI data, simulation frameworks, training utilities, and more

ChronoEdit

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

PhysicsNeMo

Framework of PyTorch composable modules for developing physics guided machine learning training pipelines. https://github.com/NVIDIA/physicsnemo

Cosmos-Drive-Dreams

A collection of tokenizers, diffusion models, and datasets relevant to the cosmos-drive-dreams platform.

Reward Models 10-2025

A collection of great reward models for research and production

Llama Nemotron

Open, Production-ready Enterprise Models

Clara Medical

NVIDIA Clara Open Models for medical imaging AI: segment, generate, and reason across CT, MRI, and X-ray. Built on MONAI by NVIDIA.

BioNeMo - Understand

NVIDIA BioNeMo Models for Understanding Biology

BioNeMo - Optimize

NVIDIA BioNeMo Models for Optimization

Cosmos-Predict2.5

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Cosmos-Reason2

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Nemotron-Personas

A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions.

Llama-Embed-Nemotron-8B

State-of-the-Art Text Embedding Model

GEN3C

3D-Informed World-Consistent Video Generation with Precise Camera Control

Reasoning Efficiency Research

Ultra-efficient reasoning model! SOTA Accuracy / CoT Length trade-offs

BioNeMo

Accelerated models for digital biology by the NVIDIA BioNeMo team. https://www.nvidia.com/en-us/clara/biopharma/

ViPE

OpenReasoning-Nemotron

Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science.

Cosmos-Predict2

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos-predict25

Audio2Face-3D

Open-weight Audio2Face-3D and Audio2Emotion networks and a sample dataset for training and evaluation

Reward Models 06-2025

Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge

Cosmos-Transfer2.5

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Updated Feb 12 • 8.22k • 67

AceReason

Math and Code reasoning model trained through reinforcement learning (RL)

Nemotron-H

Mamba-Transformer hybrid models

Cosmos-Embed1

Joint video-text embedding for physical AI

Describe Anything

Multimodal Large Language Models for Detailed Localized Image and Video Captioning

AceMath-RL

Math reasoning models trained through reinforcement learning (RL)

Text Generation • 8B • Updated Apr 23, 2025 • 452 • • 26

OpenMathReasoning

Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset"

OpenCodeReasoning

Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding

OpenCodeReasoning-II

Reasoning data for supervised finetuning of LLMs to advance code generation and critique

Llama Nemotron Feedback-Edit Inference-Time Scaling

Novel ITS approach for open-ended tasks - No. 1 on Arena Hard on 18 Mar 2025

Scoring Verifiers

Benchmarks for evaluating synthetic verifiers like test case generation and code reward models (as found in https://www.arxiv.org/abs/2502.13820).

Nemotron-UltraLong

Cosmos-Reason1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Cosmos-Transfer1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Cosmos-Predict1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos-predict2

Cosmos-Tokenizer1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024.

Physical AI

Collection of open, commercial-grade datasets for physical AI developers

QLIP

QLIP is a family of image tokenizers with SOTA reconstruction quality and zero-shot image understanding.

Cosmos-Preidct1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3

DMC

LLMs equipped with Dynamic Memory Compression to accelerate generation.

AceMath

We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark.

NemoGuard

Essential datasets and models for content safety, topic-following, and security guardrails

Eagle

Eagle is a family of frontier vision-language models with data-centric strategies. The model supports both HD image and long-context video input.

NeMo Audio Codecs

A series of Neural Audio Codecs

Hymba

A series of Hybrid Small Language Models.

Optimized ONNX models for NVIDIA RTX GPUs

Collection of optimized ONNX model checkpoints for NVIDIA RTX GPUs

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks.

OpenMath-2

A collection of models and datasets introduced in "OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data"

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models.

SteerLM

A collection of models and datasets relating to SteerLM and HelpSteer.

Parakeet ASR

NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants.

Canary ASR/AST

A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤

InstructRetro

InstructRetro is an autoregressive decoder-only language model (LM) with retrieval-augmented pretraining and instruction tuning.

OpenMath

A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset"

RLHF

A collection of models trained with Reinforcement Learning from Human Feedback (RLHF).

NV-Embed

NV-Embed is a generalist embedding model encompassing retrieval, reranking, classification, clustering, STS tasks.

Llama3-ChatQA-1.5

Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG).

SSMs

A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers.

Nemotron 3 8B

The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise.

BigVGAN

BigVGAN is a universal neural vocoder that generates audio waveform using mel spectrogram as input.

MambaVision

MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models.

PS3: Scaling Vision Pre-Training to 4K Resolution

Enabling 4k resolution for VLMs, CVPR 2025, https://nvlabs.github.io/PS3/

Minitron

A family of compressed models obtained via pruning and knowledge distillation

RADIO

A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.).

Llama3-ChatQA-2

This is the collection that presents ChatQA-2, a suite of 128K long-context models, that also have exceptional RAG capabilities

NeMo Curator - Classifier Models

Classifier models that can be used in NeMo Curator for labelling/filtering datasets.

Nemotron v3 Pre-Training

Large scale pre-training datasets used in the Nemotron family of models.

URL: https://huggingface.co/nvidia/collections

⇱ nvidia (NVIDIA)

AI & ML interests

Recent Activity

Papers

Articles

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

Adaptive Ultrasound Imaging with Physics-Informed NV-Raw2Insights-US AI

Gemma 4 VLA Demo on Jetson Orin Nano Super

How to Ground a Korean AI Agent in Real Demographics with Synthetic Personas

Building a Fast Multilingual OCR Model with Synthetic Data

NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots

Build a Domain-Specific Embedding Model in Under a Day

Nemotron 3 Content Safety 4B: Multimodal, Multilingual Content Moderation

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics

Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline

Build an Agent That Thinks Like a Data Scientist: How We Hit #1 on DABStep with Reusable Tool Generation

How NVIDIA AI-Q Reached \#1 on DeepResearch Bench I and II

Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds

How NVIDIA Builds Open Data for AI

Deploying Open Source Vision Language Models (VLM) on Jetson

「データ不足」の壁を越える：合成ペルソナが日本のAI開発を加速

From Scarcity to Scale: How Synthetic Personas Can Bootstrap Japanese AI Development

NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル

NVIDIA Nemotron 2 Nano 9B Japanese: State-of-the-Art Small Language Model Customized for Japanese Sovereign AI

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

Introducing NVIDIA Cosmos Policy for Advanced Robot Control

Nemotron-Personas-Brazil: Co-Designed Data for Sovereign AI

Nemotron-Personas-Singapore: Co-Designed Data for Sovereign AI

**NVIDIA Earth-2 Open Models Span the Whole Weather Stack**

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Custom Policy Enforcement with Reasoning: Faster, Safer AI Applications

How to Build a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac for Healthcare

🛡️ Nemotron PII: Synthesized Data for Privacy-Preserving AI

Nemotron-Personas-USA: Synthesized Data for Sovereign AI

NVIDIA Isaac GR00T in LeRobot

Can Your LLM Think Like a Professional? Introducing ProfBench

NVIDIA Releases 8 Million Sample Open Dataset and Tooling for OCR, Image Reasoning, Image and Video QA Tasks

Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for Physical AI

Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes

Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

Nemotron-Personas-India: Synthesized Data for Sovereign AI

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

📢 NVIDIA Releases Nemotron-CC-Math Pre-Training Dataset: A High-Quality, Web-Scale Math Corpus for Pretraining Large Language Models

NVIDIA Releases Improved Pretraining Dataset: Preserves High Value Math & Code, and Augments with Multi-Lingual

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

Measuring Open-Source Llama Nemotron Models on DeepResearch Bench

Accelerate a World of LLMs on Hugging Face with NVIDIA NIM

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Llama-NeMoRetriever-ColEmbed: Developer-Focused Guide to NVIDIA's State-of-the-Art Text-Image Retrieval

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

Introducing Cosmos Predict-2: A Foundation For Your Own World Model

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B

Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to Real-World Distributions

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

Mastering Long Contexts in LLMs with KVPress

nvidia 's collections 116

Nemotron Speech Streaming

Magpietts Demo

Describe Anything

BigVGAN

Kimodo

NVIDIA Earth-2 Open Models Span the Whole Weather Stack