OpenCompass

mirror of https://github.com/open-compass/opencompass.git synced 2025-05-30 16:03:24 +08:00

History

bittersweet1999 fa54aa62f6 [Feature] Add Judgerbench and reorg subeval (#1593 ) * fix pip version * fix pip version * update (#1522) Co-authored-by: zhulin1 <zhulin1@pjlab.org.cn> * [Feature] Update Models (#1518) * Update Models * Update * Update humanevalx * Update * Update * [Feature] Dataset prompts update for ARC, BoolQ, Race (#1527) add judgerbench and reorg sub add judgerbench and reorg subeval add judgerbench and reorg subeval * add judgerbench and reorg subeval * add judgerbench and reorg subeval * add judgerbench and reorg subeval * add judgerbench and reorg subeval --------- Co-authored-by: zhulinJulia24 <145004780+zhulinJulia24@users.noreply.github.com> Co-authored-by: zhulin1 <zhulin1@pjlab.org.cn> Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> Co-authored-by: Linchen Xiao <xxllcc1993@gmail.com>		2024-10-15 16:36:05 +08:00
..
postprocessors	[Feature] Add custom model postprocess function (#1519 )	2024-09-18 14:40:51 +08:00
__init__.py	[Feature] Add Judgerbench and reorg subeval (#1593 )	2024-10-15 16:36:05 +08:00
abbr.py	[Feature] Add multi-model judge and fix some problems (#1016 )	2024-04-02 11:52:06 +08:00
auxiliary.py	[Feat] support humaneval and mbpp pass@k (#598 )	2023-11-16 21:22:06 +08:00
build.py	[Feature] Add huggingface apply_chat_template (#1098 )	2024-05-14 14:50:16 +08:00
collect_env.py	[Feature] Update pip install (#1324 )	2024-07-29 18:32:50 +08:00
datasets_info.py	[Feature] Update Models (#1518 )	2024-09-12 23:35:30 +08:00
datasets.py	[Feature] Add Ruler datasets (#1310 )	2024-08-20 11:40:11 +08:00
dependency.py	[Feature]: Use multimodal (#73 )	2023-08-03 11:07:50 +08:00
dict_postprocessors.py	[Feature] Add Judgerbench and reorg subeval (#1593 )	2024-10-15 16:36:05 +08:00
file.py	fix output typing, change mutable list to immutable tuple (#989 )	2024-04-26 23:07:34 +08:00
fileio.py	[Fix] Fix Slurm ENV (#1392 )	2024-08-06 01:35:20 +08:00
lark.py	[Feature] Several enhancements (#142 )	2023-08-01 18:19:49 +08:00
logging.py	[Enhance] Supress warning raised by get_logger (#353 )	2023-09-04 15:27:08 +08:00
menu.py	[Feat] Support local runner for windows (#515 )	2023-10-27 17:16:22 +08:00
model_postprocessors.py	[Feature] Add GaoKaoMath Dataset for Evaluation & MATH Model Eval Config (#1589 )	2024-10-12 19:13:06 +08:00
prompt.py	Support wildbench (#1266 )	2024-06-24 13:16:27 +08:00
run.py	[Feature] Integrate lmdeploy pipeline api (#1198 )	2024-10-09 22:58:06 +08:00
text_postprocessors.py	[Feature] Fullbench v0.1 language update (#1463 )	2024-08-28 14:01:05 +08:00
types.py	[Sync] Initial support of subjective evaluation (#421 )	2023-09-22 15:42:31 +08:00