Reinforcement Learning • 8B • Updated • 87 • 6
Yaolun Zhang
Mercury7353
👁 Image
tianyi0216's profile picture👁 Image
shtefcs's profile picture👁 Image
specimba's profile picture
tianyi0216's profile picture👁 Image
shtefcs's profile picture👁 Image
specimba's profile picture
·
AI & ML interests
Code LLM, LLM Agent, Multi-Agent System
Recent Activity
liked a model 18 days ago
ByteDance-Seed/Seed-OSS-36B-Instruct authored a paper 27 days ago
When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs