mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-4bit

This model was converted to MLX format from nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16 using mlx-vlm version 0.4.5. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm

python -m mlx_vlm.generate --model mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-4bit --max-tokens 100 --temperature 0.0 --prompt "Describe this image." --image <path_to_image>

Downloads last month: 1,639

Safetensors

Model size

6B params

Tensor type

BF16

U32

MLX

Hardware compatibility

4-bit

Model tree for mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-4bit

Base model

nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

Quantized

(53)

this model

Dataset used to train mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-4bit

Collection including mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-4bit

8 items • Updated Apr 28 • 2

URL: https://huggingface.co/mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-4bit

⇱ mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-4bit · Hugging Face

mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-4bit

Use with mlx

Model tree for mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-4bit

Dataset used to train mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-4bit

Collection including mlx-community/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-4bit