OpenCompass/opencompass/configs/datasets/OlympiadBench
Songyang Zhang aa2b89b6f8
[Update] Add CascadeEvaluator with Data Replica (#2022)
* Update CascadeEvaluator

* Update CascadeEvaluator

* Update CascadeEvaluator

* Update Config

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Update

* Update
2025-05-20 16:46:55 +08:00
..
OlympiadBench_0shot_cascade_eval_gen_be8b13.py [Update] Add CascadeEvaluator with Data Replica (#2022) 2025-05-20 16:46:55 +08:00
OlympiadBench_0shot_gen_be8b13.py [Feature] Support OlympiadBench Benchmark (#1841) 2025-01-24 10:00:01 +08:00
OlympiadBench_0shot_llmverify_gen_be8b13.py [Update] Support AIME-24 Evaluation for DeepSeek-R1 series (#1888) 2025-02-25 20:34:41 +08:00
OlympiadBench_categories.py [Update] Support OlympiadBench-Math/OmniMath/LiveMathBench-Hard (#1899) 2025-03-03 18:56:11 +08:00
OlympiadBenchMath_0shot_llmverify_gen_9c22f2.py [Update] Support OlympiadBench-Math/OmniMath/LiveMathBench-Hard (#1899) 2025-03-03 18:56:11 +08:00