OpenCompass

mirror of https://github.com/open-compass/opencompass.git synced 2025-05-30 16:03:24 +08:00

History

Junnan Liu 73c80953c6 [Feature] Support Dataset Repeat and G-Pass Compute for Each Evaluator (#1886 ) * support dataset repeat and g-pass compute for each evaluator * fix pre-commit errors * delete print * delete gpassk_evaluator and fix potential errors * change `repeat` to `n` * fix `repeat` to `n` in openicl_eval * update doc for multi-run and g-pass * update latex equation in doc * update eng doc for multi-run and g-pass * update datasets.md * update datasets.md * fix multi-line equation * fix multi-line equation * fix multi-line equation * fix multi-line equation * fix multi-line equation * fix multi-line equation * fix multi-line equation in zh_cn user_guides * mmodify pre-commit-zh-cn * recover pre-commit and edit math expr in doc * del [TIP] * del cite tag in doc * del extract_model param in livemathbench config		2025-02-26 19:43:12 +08:00
..
outer_eval	[Fix] fix alpacaeval while add caching path (#1139 )	2024-05-11 14:02:26 +08:00
__init__.py	[Deperecate] Remove multi-modal related stuff (#1072 )	2024-04-26 21:20:14 +08:00
base.py	[Update] Update Skywork/Qwen-QwQ (#1728 )	2024-12-05 19:30:43 +08:00
llm_eval.py	Add release contribution	2023-07-05 03:15:31 +00:00
openicl_attack.py	force register (#1311 )	2024-07-11 19:59:35 +08:00
openicl_eval.py	[Feature] Support Dataset Repeat and G-Pass Compute for Each Evaluator (#1886 )	2025-02-26 19:43:12 +08:00
openicl_infer.py	Update openicl_infer.py (#1308 )	2024-08-23 10:39:22 +08:00
subjective_eval.py	[Feature] Support subjective evaluation for reasoning model (#1868 )	2025-02-20 12:19:46 +08:00