Voozh

VOOZH

URL: https://huggingface.co/CASIA

⇱ CASIA (Chinese Academic of Science Institute of Automation)

AI & ML interests

None defined yet.

Recent Activity

jinzhuoran submitted a paper 3 days ago

Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It

jinzhuoran submitted a paper 4 days ago

Look Light, Think Heavy: What Multimodal Chain-of-Thought Reasoning Can and Cannot Do

MarkWang authored a paper about 1 month ago

Joint Training of Multi-Token Prediction in Reinforcement Learning via Optimal Coefficient Calibration

View all activity

Papers

Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It

Look Light, Think Heavy: What Multimodal Chain-of-Thought Reasoning Can and Cannot Do

View all Papers

models 0

None public yet

datasets 0

None public yet