Trainee2Trainer This is the checkpoints and dataset for: From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning Paper • 2606.17682 • Published 13 days ago • 26 Text Generation • 4B • Updated 11 days ago • 17 • 1 Viewer • Updated 11 days ago • 3.15k • 73 • 1
CodeScaler Text Classification • 8B • Updated Feb 23 • 71 Text Classification • 2B • Updated Feb 23 • 4 Text Classification • 4B • Updated Feb 23 • 4 Viewer • Updated Feb 23 • 51.1k • 23 • 1
EnvFactory This is the checkpoints and dataset for: EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL. Paper • 2605.18703 • Published May 18 • 50 Text Generation • 2B • Updated May 19 • 62 Text Generation • 4B • Updated May 19 • 5 Text Generation • 8B • Updated May 20 • 7 • 1
Trainee2Trainer This is the checkpoints and dataset for: From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning Paper • 2606.17682 • Published 13 days ago • 26 Text Generation • 4B • Updated 11 days ago • 17 • 1 Viewer • Updated 11 days ago • 3.15k • 73 • 1
EnvFactory This is the checkpoints and dataset for: EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL. Paper • 2605.18703 • Published May 18 • 50 Text Generation • 2B • Updated May 19 • 62 Text Generation • 4B • Updated May 19 • 5 Text Generation • 8B • Updated May 20 • 7 • 1
CodeScaler Text Classification • 8B • Updated Feb 23 • 71 Text Classification • 2B • Updated Feb 23 • 4 Text Classification • 4B • Updated Feb 23 • 4 Viewer • Updated Feb 23 • 51.1k • 23 • 1