👁 Image Submitted by 👁 Image Chen chao 26 From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning 👁 LARK-Lab LARK Lab@HKUST (GZ) 22 2
👁 Image Submitted by 👁 Image Zhou 16 Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It 👁 LARK-Lab LARK Lab@HKUST (GZ) 2
👁 Image Submitted by 👁 Image shawnxzhu 50 EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL 👁 LARK-Lab LARK Lab@HKUST (GZ) 73 1
👁 Image Submitted by 👁 Image Zhou 14 Efficient RLVR Training via Weighted Mutual Information Data Selection 👁 LARK-Lab LARK Lab@HKUST (GZ) 2