VOOZH about

URL: https://huggingface.co/Tuwhy/Llama-3.2V-11B-Sherlock-SFT

⇱ Tuwhy/Llama-3.2V-11B-Sherlock-SFT · Hugging Face


Sherlock: Self-Correcting Reasoning in Vision-Language Models

Introduction

Sherlock is a training framework focus on improving Vision-Language Models reasoning and self-correction capabilities.

GitHub repo: https://github.com/DripNowhy/Sherlock

Project Page: https://dripnowhy.github.io/Sherlock/

arXiv: https://arxiv.org/abs/2505.22651

Downloads last month
5
Safetensors
Model size
11B params
Tensor type
BF16
·

Model tree for Tuwhy/Llama-3.2V-11B-Sherlock-SFT

Finetuned
(160)
this model

Dataset used to train Tuwhy/Llama-3.2V-11B-Sherlock-SFT

Collection including Tuwhy/Llama-3.2V-11B-Sherlock-SFT

Paper for Tuwhy/Llama-3.2V-11B-Sherlock-SFT