AEPO The official datasets and model checkpoints of AEPO Paper • 2510.14545 • Published Oct 16, 2025 • 109 Text Generation • 8B • Updated Dec 20, 2025 • 29 • • 2 Text Generation • 33B • Updated Dec 20, 2025 • 5 • 2 Robotics • 15B • Updated Oct 21, 2025 • 4 • 1
Tool-Star Tool-Star is a reinforcement learning-based framework designed to empower LLMs to autonomously invoke multiple external tools during stepwise reasonin Paper • 2505.16410 • Published May 22, 2025 • 59 Viewer • Updated May 29, 2025 • 54k • 295 • 10 Viewer • Updated May 25, 2025 • 10k • 133 • 5 Text Generation • 8B • Updated Jun 30, 2025 • 24 • • 2
ARPO The official datasets and model checkpoints of ARPO Paper • 2507.19849 • Published Jul 26, 2025 • 161 8B • Updated Jul 29, 2025 • 14 • 2 33B • Updated Dec 20, 2025 • 4 • 1 Text Generation • 15B • Updated Aug 12, 2025 • 43 • 5
RAG-Critic Text Generation • 3B • Updated Jun 28, 2025 • 22 • • 4 Viewer • Updated Jun 28, 2025 • 100k • 15 • 3
AEPO The official datasets and model checkpoints of AEPO Paper • 2510.14545 • Published Oct 16, 2025 • 109 Text Generation • 8B • Updated Dec 20, 2025 • 29 • • 2 Text Generation • 33B • Updated Dec 20, 2025 • 5 • 2 Robotics • 15B • Updated Oct 21, 2025 • 4 • 1
ARPO The official datasets and model checkpoints of ARPO Paper • 2507.19849 • Published Jul 26, 2025 • 161 8B • Updated Jul 29, 2025 • 14 • 2 33B • Updated Dec 20, 2025 • 4 • 1 Text Generation • 15B • Updated Aug 12, 2025 • 43 • 5
Tool-Star Tool-Star is a reinforcement learning-based framework designed to empower LLMs to autonomously invoke multiple external tools during stepwise reasonin Paper • 2505.16410 • Published May 22, 2025 • 59 Viewer • Updated May 29, 2025 • 54k • 295 • 10 Viewer • Updated May 25, 2025 • 10k • 133 • 5 Text Generation • 8B • Updated Jun 30, 2025 • 24 • • 2
RAG-Critic Text Generation • 3B • Updated Jun 28, 2025 • 22 • • 4 Viewer • Updated Jun 28, 2025 • 100k • 15 • 3