OpenCompass/opencompass/utils
bittersweet1999 6ba1c4937d
[Feature] Support Math evaluation via judgemodel (#1094)
* support openai math evaluation

* support openai math evaluation

* support openai math evaluation

* support math llm judge

* support math llm judge
2024-04-26 14:56:23 +08:00
..
__init__.py [Feat] support humaneval and mbpp pass@k (#598) 2023-11-16 21:22:06 +08:00
abbr.py [Feature] Add multi-model judge and fix some problems (#1016) 2024-04-02 11:52:06 +08:00
auxiliary.py [Feat] support humaneval and mbpp pass@k (#598) 2023-11-16 21:22:06 +08:00
build.py [Fix] Fixed repeated loading of VLLM (#1051) 2024-04-17 20:36:08 +08:00
collect_env.py [Docs] add issue and pr template (#12) 2023-07-06 11:55:01 +08:00
dependency.py [Feature]: Use multimodal (#73) 2023-08-03 11:07:50 +08:00
file.py [Feature] Simplify entry script (#204) 2023-08-25 17:36:30 +08:00
fileio.py initial commit 2023-07-04 21:34:55 +08:00
lark.py [Feature] Several enhancements (#142) 2023-08-01 18:19:49 +08:00
logging.py [Enhance] Supress warning raised by get_logger (#353) 2023-09-04 15:27:08 +08:00
menu.py [Feat] Support local runner for windows (#515) 2023-10-27 17:16:22 +08:00
prompt.py [Sync] update taco (#1030) 2024-04-09 17:50:23 +08:00
run.py [Feature] Support Math evaluation via judgemodel (#1094) 2024-04-26 14:56:23 +08:00
text_postprocessors.py [Sync] update taco (#1030) 2024-04-09 17:50:23 +08:00
types.py [Sync] Initial support of subjective evaluation (#421) 2023-09-22 15:42:31 +08:00