OpenCompass/opencompass/configs/datasets/OlympiadBench
2025-05-14 10:17:34 +00:00
..
OlympiadBench_0shot_cascade_eval_gen_be8b13.py Update CascadeEvaluator 2025-05-14 10:17:34 +00:00
OlympiadBench_0shot_gen_be8b13.py [Feature] Support OlympiadBench Benchmark (#1841) 2025-01-24 10:00:01 +08:00
OlympiadBench_0shot_llmverify_gen_be8b13.py [Update] Support AIME-24 Evaluation for DeepSeek-R1 series (#1888) 2025-02-25 20:34:41 +08:00
OlympiadBench_categories.py [Update] Support OlympiadBench-Math/OmniMath/LiveMathBench-Hard (#1899) 2025-03-03 18:56:11 +08:00
OlympiadBenchMath_0shot_llmverify_gen_9c22f2.py [Update] Support OlympiadBench-Math/OmniMath/LiveMathBench-Hard (#1899) 2025-03-03 18:56:11 +08:00