group-relative-policy-optimization

Here are 7 public repositories matching this topic...

eVI-group-SCU / Dr-Seg

[CVPR 2026 Official Implementation] Dr. Seg: Revisiting GRPO Training for Visual Large Language Models through Perception-Oriented Design

reinforcement-learning computer-vision segmentation object-detection multimodal-large-language-models visual-large-language-models reasoning-segmentation group-relative-policy-optimization

Updated
Python

jeffasante / grpo-maze-solver

Star

A reinforcement learning agent that learns to solve mazes using Group Relative Policy Optimization (GRPO).

reinforcement-learning-agent grpo deep-seek-r1-grpo group-relative-policy-optimization

Updated
Python

Mr-Wonderfool / Multimodal-Reinforce-CoT

Star

Fine-tuning Qwen2.5-VL-3B-Instruct to output high quality chain-of-thoughts on GQA dataset with reinforcement learning

reinforcement-learning visual-question-answering chain-of-thought visual-language-models group-relative-policy-optimization

Updated
Python

johnnycrab / easy-GRPO

Star

A simple and explained implementation of (Dr.) GRPO in PyTorch.

reasoning llm reasoning-language-models grpo group-relative-policy-optimization

Updated
Python

Dichotoom / Bachelor-Project

Star

Bachelor thesis codebase: GRPO training for improving mathematical reasoning in small language models using reinforcement learning

reinforcement-learning language-models mathematical-reasoning group-relative-policy-optimization

Updated
Python

MSWagner / qwen-lora-grpo-letter-counting

Star

Fine-tuning Qwen2.5-3B-Instruct model with LoRa (Low-Rank Adaptation) and Group Relative Policy Optimization (GRPO)

reinforcement-learning lora llm generative-ai low-rank-adaptation qwen2-5 group-relative-policy-optimization

Updated
Jupyter Notebook

K-UniLab / Improfessor-AI

Star

Fine-tune a model that generates exams based on lecture materials.

question-generation llm supervised-finetuning direct-preference-optimization group-relative-policy-optimization gemma-3-4b-it

Updated
Python

Improve this page

Add a description, image, and links to the group-relative-policy-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the group-relative-policy-optimization topic, visit your repo's landing page and select "manage topics."

Learn more

URL: https://github.com/topics/group-relative-policy-optimization