OpenCompass/configs/datasets/math
liushz a6f67e1a65
[Fix] Fix Math Evaluation with Judge Model Evaluator & Add README (#1103)
* Add Math Evaluation with Judge Model Evaluator

* Add Math Evaluation with Judge Model Evaluator

* Add Math Evaluation with Judge Model Evaluator

* Add Math Evaluation with Judge Model Evaluator

* Fix Llama-3 meta template

* Fix MATH with JudgeLM Evaluation

* Fix MATH with JudgeLM Evaluation

* Fix MATH with JudgeLM Evaluation

* Fix MATH with JudgeLM Evaluation

---------

Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>
2024-04-28 21:58:58 +08:00
..
deprecated_math_agent_evaluatorv2_gen_861b4f.py [Sync] Sync Internal (#941) 2024-03-04 14:42:36 +08:00
deprecated_math_evaluatorv2_gen_265cce.py [Sync] update taco (#1030) 2024-04-09 17:50:23 +08:00
math_0shot_gen_393424.py [Sync] update taco (#1030) 2024-04-09 17:50:23 +08:00
math_agent_evaluatorv2_gen_0c1b4e.py [Sync] Sync with internal codes 2023.01.08 (#777) 2024-01-08 14:07:24 +00:00
math_agent_gen_0c1b4e.py [Sync] Sync with internal codes 2023.01.08 (#777) 2024-01-08 14:07:24 +00:00
math_agent_gen_861b4f.py [Sync] minor test (#683) 2023-12-11 17:42:53 +08:00
math_agent_gen_af2293.py [Feat] Update math/agent (#716) 2023-12-19 21:20:42 +08:00
math_evaluatorv2_gen_9d2049.py [Sync] Sync Internal (#941) 2024-03-04 14:42:36 +08:00
math_evaluatorv2_gen_cecb31.py [Sync] update taco (#1030) 2024-04-09 17:50:23 +08:00
math_gen_1ed9c2.py [Sync] Sync Internal (#941) 2024-03-04 14:42:36 +08:00
math_gen_5e8458.py Update configs (#9) 2023-07-06 12:27:41 +08:00
math_gen_78ced2.py [Fix] Fix Math Evaluation with Judge Model Evaluator & Add README (#1103) 2024-04-28 21:58:58 +08:00
math_gen_265cce.py [Sync] Sync Internal (#941) 2024-03-04 14:42:36 +08:00
math_gen_943d32.py [Feat] Support cibench (#538) 2023-11-07 19:11:44 +08:00
math_gen_0957ff.py [Sync] Sync Internal (#941) 2024-03-04 14:42:36 +08:00
math_gen_559593.py Update configs (#9) 2023-07-06 12:27:41 +08:00
math_gen_736506.py [Sync] Updata dataset cfg for internMath (#837) 2024-01-24 16:30:32 +08:00
math_gen.py Update configs (#9) 2023-07-06 12:27:41 +08:00
math_intern_evaluator_gen_265cce.py [Sync] Sync Internal (#941) 2024-03-04 14:42:36 +08:00
math_llm_judge.py [Fix] Fix Math Evaluation with Judge Model Evaluator & Add README (#1103) 2024-04-28 21:58:58 +08:00