VOOZH

URL: https://huggingface.co/Tuwhy/Llama-3.2V-11B-Sherlock-SFT

⇱ Tuwhy/Llama-3.2V-11B-Sherlock-SFT · Hugging Face

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Introduction

Sherlock is a training framework focus on improving Vision-Language Models reasoning and self-correction capabilities.

GitHub repo: https://github.com/DripNowhy/Sherlock

Project Page: https://dripnowhy.github.io/Sherlock/

arXiv: https://arxiv.org/abs/2505.22651

Downloads last month: 5

Safetensors

Model size

11B params

Tensor type

BF16

·

Model tree for Tuwhy/Llama-3.2V-11B-Sherlock-SFT

Base model

meta-llama/Llama-3.2-11B-Vision-Instruct

Finetuned

(160)

this model

Dataset used to train Tuwhy/Llama-3.2V-11B-Sherlock-SFT

Collection including Tuwhy/Llama-3.2V-11B-Sherlock-SFT

Series model of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models" • 5 items • Updated May 29, 2025 • 3

Paper for Tuwhy/Llama-3.2V-11B-Sherlock-SFT

Paper • 2505.22651 • Published May 28, 2025 • 47