Paper โข 2308.09583 โข Published โข 8
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
๐ Home Page
๐ค HF Repo โข๐ฑ Github Repo โข ๐ฆ Twitter
๐ [WizardLM] โข ๐ [WizardCoder] โข ๐ [WizardMath]
๐ Join our Discord
News
[2024/01/04] ๐ฅ We released WizardCoder-33B-V1.1 trained from deepseek-coder-33b-base, the SOTA OSS Code LLM on EvalPlus Leaderboard, achieves 79.9 pass@1 on HumanEval, 73.2 pass@1 on HumanEval-Plus, 78.9 pass@1 on MBPP, and 66.9 pass@1 on MBPP-Plus.
[2024/01/04] ๐ฅ WizardCoder-33B-V1.1 outperforms ChatGPT 3.5, Gemini Pro, and DeepSeek-Coder-33B-instruct on HumanEval and HumanEval-Plus pass@1.
[2024/01/04] ๐ฅ WizardCoder-33B-V1.1 is comparable with ChatGPT 3.5, and surpasses Gemini Pro on MBPP and MBPP-Plus pass@1.
| Model | Checkpoint | Paper | HumanEval | HumanEval+ | MBPP | MBPP+ | License |
|---|---|---|---|---|---|---|---|
| GPT-4-Turbo (Nov 2023) | - | - | 85.4 | 81.7 | 83.0 | 70.7 | - |
| GPT-4 (May 2023) | - | - | 88.4 | 76.8 | - | - | - |
| GPT-3.5-Turbo (Nov 2023) | - | - | 72.6 | 65.9 | 81.7 | 69.4 | - |
| Gemini Pro | - | - | 63.4 | 55.5 | 72.9 | 57.9 | - |
| DeepSeek-Coder-33B-instruct | - | - | 78.7 | 72.6 | 78.7 | 66.7 | - |
| WizardCoder-33B-V1.1 | ๐ค HF Link | ๐ [WizardCoder] | 79.9 | 73.2 | 78.9 | 66.9 | MSFTResearch |
| WizardCoder-Python-34B-V1.0 | ๐ค HF Link | ๐ [WizardCoder] | 73.2 | 64.6 | 73.2 | 59.9 | Llama2 |
| WizardCoder-15B-V1.0 | ๐ค HF Link | ๐ [WizardCoder] | 59.8 | 52.4 | -- | -- | OpenRAIL-M |
| WizardCoder-Python-13B-V1.0 | ๐ค HF Link | ๐ [WizardCoder] | 64.0 | -- | -- | -- | Llama2 |
| WizardCoder-Python-7B-V1.0 | ๐ค HF Link | ๐ [WizardCoder] | 55.5 | -- | -- | -- | Llama2 |
| WizardCoder-3B-V1.0 | ๐ค HF Link | ๐ [WizardCoder] | 34.8 | -- | -- | -- | OpenRAIL-M |
| WizardCoder-1B-V1.0 | ๐ค HF Link | ๐ [WizardCoder] | 23.8 | -- | -- | -- | OpenRAIL-M |
- ๐ฅ [08/11/2023] We release WizardMath Models.
- ๐ฅ Our WizardMath-70B-V1.0 model slightly outperforms some closed-source LLMs on the GSM8K, including ChatGPT 3.5, Claude Instant 1 and PaLM 2 540B.
- ๐ฅ Our WizardMath-70B-V1.0 model achieves 81.6 pass@1 on the GSM8k Benchmarks, which is 24.8 points higher than the SOTA open-source LLM.
- ๐ฅ Our WizardMath-70B-V1.0 model achieves 22.7 pass@1 on the MATH Benchmarks, which is 9.2 points higher than the SOTA open-source LLM.
| Model | Checkpoint | Paper | GSM8k | MATH | Online Demo | License |
|---|---|---|---|---|---|---|
| WizardMath-70B-V1.0 | ๐ค HF Link | ๐ [WizardMath] | 81.6 | 22.7 | Demo | Llama 2 |
| WizardMath-13B-V1.0 | ๐ค HF Link | ๐ [WizardMath] | 63.9 | 14.0 | Demo | Llama 2 |
| WizardMath-7B-V1.0 | ๐ค HF Link | ๐ [WizardMath] | 54.9 | 10.7 | Demo | Llama 2 |
- Downloads last month
- 299
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ 1 Ask for provider support
Model tree for WizardLMTeam/WizardCoder-15B-V1.0
Spaces using WizardLMTeam/WizardCoder-15B-V1.0 100
Papers for WizardLMTeam/WizardCoder-15B-V1.0
Evaluation results
- pass@1 on HumanEvalself-reported0.573
