OpenCompass/opencompass/configs/datasets/OlympiadBench
Songyang Zhang fd6fbf01a2
[Update] Support AIME-24 Evaluation for DeepSeek-R1 series (#1888)
* Update

* Update

* Update

* Update
2025-02-25 20:34:41 +08:00
..
OlympiadBench_0shot_gen_be8b13.py [Feature] Support OlympiadBench Benchmark (#1841) 2025-01-24 10:00:01 +08:00
OlympiadBench_0shot_llmverify_gen_be8b13.py [Update] Support AIME-24 Evaluation for DeepSeek-R1 series (#1888) 2025-02-25 20:34:41 +08:00
OlympiadBench_categories.py [Feature] Support OlympiadBench Benchmark (#1841) 2025-01-24 10:00:01 +08:00