VOOZH about

URL: https://github.com/open-compass

⇱ OpenCompass · GitHub


Skip to content
👁 Image
OpenCompass Website HOT      OpenCompass Toolkit TRY IT OUT

👁 GitHub Org's stars

What is OpenCompass ? OpenCompass is a platform focused on understanding of the AGI, include Large Language Model and Multi-modality Model.

We aim to:

  • develop high-quality libraries to reduce the difficulties in evaluation
  • provide convincing leaderboards for improving the understanding of the large models
  • create powerful toolchains targeting a variety of abilities and tasks
  • build solid benchmarks to support the large model research
  • research on inference of Large Model(analysis, reasoning, prompt engineering.)

Toolkit

OpenCompass

VLMEvalKit

Models

CompassVerifier

CompassJudger

Benchmarks and Methods

Pinned Loading

  1. opencompass Public

    OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

    Python 6.8k 754

  2. VLMEvalKit Public

    Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

    Python 4k 667

  3. MMBench Public

    Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"

    295 17

  4. [EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

    Jupyter Notebook 68 2

  5. CompassJudger Public

    The All-in-one Judge Models introduced by Opencompass

    119 6

  6. MMBench-GUI Public

    Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent with a hierarchical manner across multiple platforms, includi…

    Python 103 6

Repositories

Showing 10 of 47 repositories
You can’t perform that action at this time.