OpenCompass/opencompass/utils
Mo Li b50d163265
[Fix] Refactor Needlebench Configs for CLI Testing Support (#1020)
* add needlebench datasets suffix

* fix import

* update run.py args for summarizer key and dataset suffix

* update utils/run.py
2024-04-07 15:12:56 +08:00
..
__init__.py [Feat] support humaneval and mbpp pass@k (#598) 2023-11-16 21:22:06 +08:00
abbr.py [Feature] Add multi-model judge and fix some problems (#1016) 2024-04-02 11:52:06 +08:00
auxiliary.py [Feat] support humaneval and mbpp pass@k (#598) 2023-11-16 21:22:06 +08:00
build.py [Sync] Add InternLM2 Keyset Evaluation Demo (#807) 2024-01-17 13:48:12 +08:00
collect_env.py [Docs] add issue and pr template (#12) 2023-07-06 11:55:01 +08:00
dependency.py [Feature]: Use multimodal (#73) 2023-08-03 11:07:50 +08:00
file.py [Feature] Simplify entry script (#204) 2023-08-25 17:36:30 +08:00
fileio.py initial commit 2023-07-04 21:34:55 +08:00
lark.py [Feature] Several enhancements (#142) 2023-08-01 18:19:49 +08:00
logging.py [Enhance] Supress warning raised by get_logger (#353) 2023-09-04 15:27:08 +08:00
menu.py [Feat] Support local runner for windows (#515) 2023-10-27 17:16:22 +08:00
prompt.py [Refactor] Move fix_id_list to Retriever (#442) 2023-10-07 12:53:41 +08:00
run.py [Fix] Refactor Needlebench Configs for CLI Testing Support (#1020) 2024-04-07 15:12:56 +08:00
text_postprocessors.py [Sync] Merge branch 'dev' into zfz/update-keyset-demo (#876) 2024-02-05 23:29:10 +08:00
types.py [Sync] Initial support of subjective evaluation (#421) 2023-09-22 15:42:31 +08:00