Voozh

Add ParseBench evaluation results

by boyang-runllama - opened 4 days ago

base: refs/heads/main

←

from: refs/pr/4

Discussion Files changed

4 days ago

This PR ensures your model shows up at https://huggingface.co/datasets/llamaindex/ParseBench.

This is based on the new evaluation results feature: https://huggingface.co/docs/hub/eval-results.

Note: this includes per-dimension performance across all 5 ParseBench dimensions (text_content, text_formatting, layout, chart, table) along with the overall mean score.

👁 Image

Add ParseBench evaluation results3e0df125

👁 Image

rhirae

4 days ago

Worth a caveat next to these numbers. On clean, structured pages it scores well, which is most of what ParseBench measures. On faint or low quality scans it hallucinates when it can't read the text it invents plausible content instead of leaving it blank.
In a side by side which I performed, it was the LEAST faithful of several OCR models -

olmOCR-2 > Qianfan-OCR > PaddleOCR-VL > HunyuanOCR > Unlimited-OCR

👁 Image

shihad22

4 days ago

can you share your training data

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment

URL: https://huggingface.co/baidu/Unlimited-OCR/discussions/4

⇱ baidu/Unlimited-OCR · Add ParseBench evaluation results

Add ParseBench evaluation results