VOOZH about

URL: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/discussions/110

โ‡ฑ deepseek-ai/DeepSeek-V4-Pro ยท Add community evaluation results for GPQA, GSM8K, HLE, MMLU-PRO, SWE-BENCH_PRO, SWE-BENCH_VERIFIED, TERMINAL-BENCH-2.0


Add community evaluation results for GPQA, GSM8K, HLE, MMLU-PRO, SWE-BENCH_PRO, SWE-BENCH_VERIFIED, TERMINAL-BENCH-2.0

#110
by nielsr HF Staff - opened

This PR adds community-provided evaluation results for the following benchmarks:

These results were extracted from the model card. This is based on the new evaluation results feature.

Note: This is an automated PR. Please review the evaluation results before merging.

Ready to merge
This branch is ready to get merged automatically.

ยท Sign up or log in to comment