Voozh

AI & ML interests

Large Language Models

Recent Activity

👁 Image

FYYDCC updated a collection 11 days ago

Trainee2Trainer

👁 Image

FYYDCC updated a dataset 11 days ago

LARK-Lab/MAPF-FrozenLake-Benchmark

👁 Image

FYYDCC updated a model 11 days ago

LARK-Lab/Trainee2Trainer

View all activity

Papers

👁 Image

From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

👁 Image

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

View all Papers

👁 Image

Submitted by

👁 Image

Chen chao

From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

👁 LARK-Lab
LARK Lab@HKUST (GZ)

22 2

👁 Image

Submitted by

👁 Image

Zhou

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

👁 LARK-Lab
LARK Lab@HKUST (GZ)

👁 Image

Submitted by

👁 Image

shawnxzhu

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

👁 LARK-Lab
LARK Lab@HKUST (GZ)

73 1

👁 Image

Submitted by

👁 Image

Zhou

Efficient RLVR Training via Weighted Mutual Information Data Selection

👁 LARK-Lab
LARK Lab@HKUST (GZ)

URL: https://huggingface.co/LARK-Lab/papers

⇱ LARK-Lab (LARK Lab@HKUST (GZ))

AI & ML interests

Recent Activity

Papers

From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Efficient RLVR Training via Weighted Mutual Information Data Selection