OpenCompass/opencompass/configs/datasets/humaneval_pro
Dongsheng Zhu d939e32438 add bench
2025-05-09 02:36:39 +00:00
..
humaneval_pro_gen_3dc067.py add bench 2025-05-09 02:36:39 +00:00
humaneval_pro.py add bench 2025-05-09 02:36:39 +00:00
README.md add bench 2025-05-09 02:36:39 +00:00

HumanEval pro

OC results

model pass@1
qwen2.5-coder-7b-instruct-hf 65
qwen2.5-14b-instruct-hf 67
deepseek-v2-lite-chat-hf 35

CodeEval-pro results

model pass@1
qwen2.5-coder-7b-instruct-hf 65
qwen2.5-14b-instruct-hf 65
deepseek-v2-lite-chat-hf 28