Voozh

VOOZH

URL: https://huggingface.co/mzh12345/RLLaVA_math_grpo_online_3b

⇱ mzh12345/RLLaVA_math_grpo_online_3b · Hugging Face

README.md exists but content is empty.

Downloads last month: 3

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mzh12345/RLLaVA_math_grpo_online_3b

Base model

Qwen/Qwen2.5-VL-3B-Instruct

Finetuned

(793)

this model

Dataset used to train mzh12345/RLLaVA_math_grpo_online_3b

Collection including mzh12345/RLLaVA_math_grpo_online_3b

An RL-central Framework for Language and Vision Assistant, which decouples algorithm logic from distributed execution, enables modular customization. • 4 items • Updated Nov 28, 2025