OpenCompass/opencompass/datasets/livecodebench
Linchen Xiao 408f5caff4
[Dataset] Add SuperGPQA subfield configs (#2124)
* update

* fix lint

* fix lint

* update precommit

* update precommit

* fix lint
2025-05-28 14:12:58 +08:00
..
__init__.py [Update] Update Fullbench (#1712) 2024-11-26 14:26:55 +08:00
evaluator.py [Update] History code bench pass@k update (#2102) 2025-05-19 17:03:33 +08:00
execute_utils.py [Dataset] Add SuperGPQA subfield configs (#2124) 2025-05-28 14:12:58 +08:00
extract_utils.py [Update] Update Skywork/Qwen-QwQ (#1728) 2024-12-05 19:30:43 +08:00
livecodebench.py [Update] Code evaluation alignment (#1909) 2025-03-04 18:49:38 +08:00
pass_k_utils.py [Update] Add configurations for llmjudge dataset (#1940) 2025-03-13 17:30:04 +08:00
prompts.py [Dataset] Add SuperGPQA subfield configs (#2124) 2025-05-28 14:12:58 +08:00
testing_util.py [Dataset] Add SuperGPQA subfield configs (#2124) 2025-05-28 14:12:58 +08:00