Garnet-OCR-3B-0422-GGUF

The Garnet-OCR-3B-0422 model is a fine-tuned and optimized evolution of Megalodon-OCR-Sync-0713, built on top of the Qwen2.5-VL-3B-Instruct architecture. This version is specifically designed for high-precision mathematical formula extraction, structured markdown generation, and accurate table reconstruction, making it highly effective for technical, scientific, and structured documents. Trained on an enhanced mixture of document-centric datasets, including large-scale OCR-caption pairs and structured document corpora, the model improves layout fidelity, symbolic reasoning, and content structuring across diverse document types such as research papers, scanned PDFs, handwritten equations, and analytical reports.

Model Files

File Name	Quant Type	File Size	File Link
Garnet-OCR-3B-0422.BF16.gguf	BF16	6.8 GB	Download
Garnet-OCR-3B-0422.F16.gguf	F16	6.8 GB	Download
Garnet-OCR-3B-0422.F32.gguf	F32	13.6 GB	Download
Garnet-OCR-3B-0422.Q8_0.gguf	Q8_0	3.62 GB	Download
Garnet-OCR-3B-0422.mmproj-bf16.gguf	mmproj-bf16	1.34 GB	Download
Garnet-OCR-3B-0422.mmproj-f16.gguf	mmproj-f16	1.34 GB	Download
Garnet-OCR-3B-0422.mmproj-f32.gguf	mmproj-f32	2.67 GB	Download
Garnet-OCR-3B-0422.mmproj-q8_0.gguf	mmproj-q8_0	848 MB	Download

Quants Usage

(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):

👁 image.png

Downloads last month: 593

GGUF

Model size

3B params

Architecture

qwen2vl

Hardware compatibility

8-bit

16-bit

32-bit

Model tree for prithivMLmods/Garnet-OCR-3B-0422-GGUF

Base model

Qwen/Qwen2.5-VL-3B-Instruct

Finetuned

prithivMLmods/Megalodon-OCR-Sync-0713

Finetuned

prithivMLmods/Garnet-OCR-3B-0422

Quantized

(3)

this model

Datasets used to train prithivMLmods/Garnet-OCR-3B-0422-GGUF

Collection including prithivMLmods/Garnet-OCR-3B-0422-GGUF

Collection of Garnet OCR Models • 6 items • Updated 2 days ago • 1