VOOZH

URL: https://huggingface.co/dongguanting/collections

⇱ dongguanting (KABI)

👁 KABI's picture

KABI

dongguanting

👁 Image
tcy6's profile picture 👁 Image
hooooliday's profile picture 👁 Image
JoeYing's profile picture

·

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

upvoted a paper 5 days ago

Qwen-AgentWorld: Language World Models for General Agents

authored a paper 19 days ago

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

commentedon a paper 19 days ago

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

View all activity

Organizations

👁 Renmin University of China's profile picture
👁 BUPT AI PRIS's profile picture

dongguanting 's collections 4

The official datasets and model checkpoints of AEPO

Tool-Star is a reinforcement learning-based framework designed to empower LLMs to autonomously invoke multiple external tools during stepwise reasonin

The official datasets and model checkpoints of ARPO

The official datasets and model checkpoints of AEPO

The official datasets and model checkpoints of ARPO

Tool-Star is a reinforcement learning-based framework designed to empower LLMs to autonomously invoke multiple external tools during stepwise reasonin