OpenCompass/opencompass/datasets/livemathbench
Songyang Zhang 8fdb72f567
[Update] Update o1 eval prompt (#1806)
* Update XML prediction post-process

* Update LiveMathBench

* Update LiveMathBench

* Update New O1 Evaluation
2025-01-07 00:14:32 +08:00
..
__init__.py [Feature] Support LiveMathBench (#1727) 2024-11-30 00:07:19 +08:00
livemathbench.py [Update] Update o1 eval prompt (#1806) 2025-01-07 00:14:32 +08:00
prompts.py [Feature] Support LiveMathBench (#1727) 2024-11-30 00:07:19 +08:00
utils.py [Feature] Support G-Pass@k and LiveMathBench (#1772) 2024-12-30 16:59:39 +08:00