OpenCompass/opencompass/datasets/livecodebench
Dongsheng Zhu 7a7a4517ab
[Update] History code bench pass@k update (#2102)
* bigcodebench

* humaneval

* humanevalx

* humanevalx

* livecodebench

* mbpp

* humaneval_plus

* fix bug

* template

* max_out fix

* template update
2025-05-19 17:03:33 +08:00
..
__init__.py [Update] Update Fullbench (#1712) 2024-11-26 14:26:55 +08:00
evaluator.py [Update] History code bench pass@k update (#2102) 2025-05-19 17:03:33 +08:00
execute_utils.py [Feature] Support LiveCodeBench (#1617) 2024-10-21 20:50:39 +08:00
extract_utils.py [Update] Update Skywork/Qwen-QwQ (#1728) 2024-12-05 19:30:43 +08:00
livecodebench.py [Update] Code evaluation alignment (#1909) 2025-03-04 18:49:38 +08:00
pass_k_utils.py [Update] Add configurations for llmjudge dataset (#1940) 2025-03-13 17:30:04 +08:00
prompts.py [Feature] Support LiveCodeBench (#1617) 2024-10-21 20:50:39 +08:00
testing_util.py [Feature] Support LiveCodeBench (#1617) 2024-10-21 20:50:39 +08:00