OpenCompass/opencompass/configs/datasets/OlympiadBench
Songyang Zhang c84bc18ac1
[Update] Support OlympiadBench-Math/OmniMath/LiveMathBench-Hard (#1899)
* [Update] Support OlympiadBench-Math/OmniMath/LiveMathBench-Hard with LLM Verify

* Update

* Update

* Update DeepSeek-R1 example

* Update DeepSeek-R1 example

* Update DeepSeek-R1 example
2025-03-03 18:56:11 +08:00
..
OlympiadBench_0shot_gen_be8b13.py [Feature] Support OlympiadBench Benchmark (#1841) 2025-01-24 10:00:01 +08:00
OlympiadBench_0shot_llmverify_gen_be8b13.py [Update] Support AIME-24 Evaluation for DeepSeek-R1 series (#1888) 2025-02-25 20:34:41 +08:00
OlympiadBench_categories.py [Update] Support OlympiadBench-Math/OmniMath/LiveMathBench-Hard (#1899) 2025-03-03 18:56:11 +08:00
OlympiadBenchMath_0shot_llmverify_gen_9c22f2.py [Update] Support OlympiadBench-Math/OmniMath/LiveMathBench-Hard (#1899) 2025-03-03 18:56:11 +08:00