OpenCompass/opencompass/configs/datasets/OlympiadBench
2025-02-26 15:29:19 +00:00
..
OlympiadBench_0shot_gen_be8b13.py [Feature] Support OlympiadBench Benchmark (#1841) 2025-01-24 10:00:01 +08:00
OlympiadBench_0shot_llmverify_gen_be8b13.py [Update] Support AIME-24 Evaluation for DeepSeek-R1 series (#1888) 2025-02-25 20:34:41 +08:00
OlympiadBench_categories.py [Update] Support OlympiadBench-Math/OmniMath/LiveMathBench-Hard with LLM Verify 2025-02-26 15:29:19 +00:00
OlympiadBenchMath_0shot_llmverify_gen_9c22f2.py [Update] Support OlympiadBench-Math/OmniMath/LiveMathBench-Hard with LLM Verify 2025-02-26 15:29:19 +00:00