OpenCompass

mirror of https://github.com/open-compass/opencompass.git synced 2025-05-30 16:03:24 +08:00

History

Dongsheng Zhu 7a7a4517ab [Update] History code bench pass@k update (#2102 ) * bigcodebench * humaneval * humanevalx * humanevalx * livecodebench * mbpp * humaneval_plus * fix bug * template * max_out fix * template update		2025-05-19 17:03:33 +08:00
..
__init__.py	[Update] Update Fullbench (#1712 )	2024-11-26 14:26:55 +08:00
evaluator.py	[Update] History code bench pass@k update (#2102 )	2025-05-19 17:03:33 +08:00
execute_utils.py	[Feature] Support LiveCodeBench (#1617 )	2024-10-21 20:50:39 +08:00
extract_utils.py	[Update] Update Skywork/Qwen-QwQ (#1728 )	2024-12-05 19:30:43 +08:00
livecodebench.py	[Update] Code evaluation alignment (#1909 )	2025-03-04 18:49:38 +08:00
pass_k_utils.py	[Update] Add configurations for llmjudge dataset (#1940 )	2025-03-13 17:30:04 +08:00
prompts.py	[Feature] Support LiveCodeBench (#1617 )	2024-10-21 20:50:39 +08:00
testing_util.py	[Feature] Support LiveCodeBench (#1617 )	2024-10-21 20:50:39 +08:00