Songyang Zhang
|
c98599271b
|
[Update] Update OlympiadBench and Update LLM Judge (#1954)
|
2025-03-18 20:15:20 +08:00 |
|
Songyang Zhang
|
8fdb72f567
|
[Update] Update o1 eval prompt (#1806)
* Update XML prediction post-process
* Update LiveMathBench
* Update LiveMathBench
* Update New O1 Evaluation
|
2025-01-07 00:14:32 +08:00 |
|
Songyang Zhang
|
fc0556ec8e
|
[Fix] Fix generic_llm_evaluator output_path (#1798)
* Fix output_path
* Add Logger
|
2024-12-31 13:05:05 +08:00 |
|
Songyang Zhang
|
98435dd98e
|
[Feature] Update o1 evaluation with JudgeLLM (#1795)
* Update Generic LLM Evaluator
* Update o1 style evaluator
|
2024-12-30 17:31:00 +08:00 |
|