mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-nvfp4

This model was converted to MLX format from nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16 using mlx-vlm version 0.4.5. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm

python -m mlx_vlm.generate --model mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-nvfp4 --max-tokens 100 --temperature 0.0 --prompt "Describe this image." --image <path_to_image>

Downloads last month: 1,022

Safetensors

Model size

9B params

Tensor type

U32

BF16

MLX

Hardware compatibility

4-bit

Inference Providers NEW

Any-to-Any

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-nvfp4

Base model

nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

Quantized

(53)

this model

Dataset used to train mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-nvfp4

Collection including mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-nvfp4

8 items • Updated Apr 28 • 2

URL: https://huggingface.co/mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-nvfp4

⇱ mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-nvfp4 · Hugging Face

mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-nvfp4

Use with mlx

Model tree for mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-nvfp4

Dataset used to train mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-nvfp4

Collection including mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-nvfp4