Voozh

VOOZH

URL: https://huggingface.co/LARK-Lab/Trainee2Trainer

⇱ LARK-Lab/Trainee2Trainer · Hugging Face

This is the checkpoint of the From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

Downloads last month: 17

Safetensors

Model size

4B params

Tensor type

BF16

·

Model tree for LARK-Lab/Trainee2Trainer

Base model

Qwen/Qwen3-4B-Base

Finetuned

Finetuned

(744)

this model

Collection including LARK-Lab/Trainee2Trainer

This is the checkpoints and dataset for: From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning • 3 items • Updated 11 days ago

Paper for LARK-Lab/Trainee2Trainer

Paper • 2606.17682 • Published 13 days ago • 26