benchmarks
Nathan Habib PRO
👁 Image
Hyphonical's profile picture👁 Image
qingsong99's profile picture👁 Image
petchpanu's profile picture
Hyphonical's profile picture👁 Image
qingsong99's profile picture👁 Image
petchpanu's profile picture
·
AI & ML interests
Evals
Recent Activity
liked a model 3 days ago
nex-agi/Nex-N2-Pro liked a model 3 days ago
prefeitura-rio/Rio-3.5-Open-397BOrganizations
👁 Hugging Face's profile picture
👁 Evaluation datasets's profile picture
👁 Hugging Test Lab's profile picture
👁 HuggingFaceGECLM's profile picture
👁 BigCode's profile picture
👁 Hugging Face H4's profile picture
👁 BigCode Data's profile picture
👁 Blog-explorers's profile picture
👁 Hugging Face Smol Models Research's profile picture
👁 Hugging Face Smol Cluster's profile picture
👁 Open LLM Leaderboard's profile picture
👁 huggingPartyParis's profile picture
👁 Qwen's profile picture
👁 gg-hf's profile picture
👁 Nanotron Research's profile picture
👁 FineData's profile picture
👁 HF-contamination-detection's profile picture
👁 Top Contributors: Dataset Downloads's profile picture
👁 hsramall's profile picture
👁 La Leaderboard's profile picture
👁 gg-tt's profile picture
👁 HuggingFaceEval's profile picture
👁 Novel Challenge's profile picture
👁 LLHF's profile picture
👁 SLLHF's profile picture
👁 lbhf's profile picture
👁 Lighteval testing org's profile picture
👁 Hugging Face Science's profile picture
👁 Coordination Nationale pour l'IA's profile picture
👁 open-llm-leaderboard-react's profile picture
👁 Prompt Leaderboard's profile picture
👁 wut?'s profile picture
👁 smolagents's profile picture
👁 Your Bench's profile picture
👁 Open R1's profile picture
👁 gg-hf-g's profile picture
👁 OpenEvals's profile picture
👁 arc-agi-community's profile picture
👁 yofo's profile picture
👁 Inference Provider Automated Testing's profile picture
👁 gg-hf-gg's profile picture
👁 ML intern explorers's profile picture
👁 Exgentic's profile picture
👁 the-best-team's profile picture
👁 Evaluation datasets's profile picture
👁 Hugging Test Lab's profile picture
👁 HuggingFaceGECLM's profile picture
👁 BigCode's profile picture
👁 Hugging Face H4's profile picture
👁 BigCode Data's profile picture
👁 Blog-explorers's profile picture
👁 Hugging Face Smol Models Research's profile picture
👁 Hugging Face Smol Cluster's profile picture
👁 Open LLM Leaderboard's profile picture
👁 huggingPartyParis's profile picture
👁 Qwen's profile picture
👁 gg-hf's profile picture
👁 Nanotron Research's profile picture
👁 FineData's profile picture
👁 HF-contamination-detection's profile picture
👁 Top Contributors: Dataset Downloads's profile picture
👁 hsramall's profile picture
👁 La Leaderboard's profile picture
👁 gg-tt's profile picture
👁 HuggingFaceEval's profile picture
👁 Novel Challenge's profile picture
👁 LLHF's profile picture
👁 SLLHF's profile picture
👁 lbhf's profile picture
👁 Lighteval testing org's profile picture
👁 Hugging Face Science's profile picture
👁 Coordination Nationale pour l'IA's profile picture
👁 open-llm-leaderboard-react's profile picture
👁 Prompt Leaderboard's profile picture
👁 wut?'s profile picture
👁 smolagents's profile picture
👁 Your Bench's profile picture
👁 Open R1's profile picture
👁 gg-hf-g's profile picture
👁 OpenEvals's profile picture
👁 arc-agi-community's profile picture
👁 yofo's profile picture
👁 Inference Provider Automated Testing's profile picture
👁 gg-hf-gg's profile picture
👁 ML intern explorers's profile picture
👁 Exgentic's profile picture
👁 the-best-team's profile picture
benchmarks
RULER Datasets Falcon-H1-3B-Base
RULER Datasets
RULER Datasets Lamma3-Instruct
RULER Datasets
RULER Datasets Qwen2.5-Instruct
RULER Datasets
RULER Datasets Qwen-3-Instruct
RULER Datasets
RULER Datasets Qwen-3
RULER Datasets
agents
Agents ressources
All the ressources I found / used when getting up to speed with agents.
