rlvr
Here are 46 public repositories matching this topic...
Awesome List for Agentic RL
- Updated
- HTML
[EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)
- Updated
- Python
Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934
- Updated
- Python
[ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"
- Updated
- Python
A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical guides on defining and collecting rewards to build more intelligent and aligned AI agents.
- Updated
Procedural symbolic reasoning data generators suite for synthetic pretraining
- Updated
- Python
🐝 SwarmBench: Benchmarking LLMs' Swarm Intelligence
- Updated
- Python
The official repository of the paper "Do Reasoning Models Enhance Embedding Models?"
- Updated
- Python
This is the official code of DeepSearch [ICLR 2026]
- Updated
- Python
CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR
- Updated
- Python
grpo to train long form QA and instructions with long-form reward model
- Updated
- Python
MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs
- Updated
- Python
[arXiv] "Not All Steps are Informative: On the Linearity of LLMs’ RLVR Training"
- Updated
- Python
PersonaMem-v2: Towards Personalized Intelligence via Learning Implicit User Personas and Agentic Memory
- Updated
- Python
Improve this page
Add a description, image, and links to the rlvr topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the rlvr topic, visit your repo's landing page and select "manage topics."
