OpenCompass/opencompass/configs/datasets/cmmlu
Songyang Zhang 0d8df541bc
[Update] Update O1-style Benchmark and Prompts (#1742)
* Update JuderBench

* Support O1-style Prompts

* Update Code

* Update OpenAI

* Update BigCodeBench

* Update BigCodeBench

* Update BigCodeBench

* Update BigCodeBench

* Update BigCodeBench

* Update

* Update

* Update

* Update
2024-12-09 13:48:56 +08:00
..
cmmlu_0shot_cot_gen_305931.py [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
cmmlu_0shot_nocot_llmjudge_gen_e1cd9a.py [Update] Update Skywork/Qwen-QwQ (#1728) 2024-12-05 19:30:43 +08:00
cmmlu_gen_c13365.py [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
cmmlu_gen.py [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
cmmlu_ppl_8b9c76.py [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
cmmlu_ppl_041cbf.py [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
cmmlu_ppl.py [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
cmmlu_stem_0shot_nocot_llmjudge_gen_3653db.py [Update] Update O1-style Benchmark and Prompts (#1742) 2024-12-09 13:48:56 +08:00