OpenCompass

mirror of https://github.com/open-compass/opencompass.git synced 2025-05-30 16:03:24 +08:00

History

Junnan Liu 73c80953c6 [Feature] Support Dataset Repeat and G-Pass Compute for Each Evaluator (#1886 ) * support dataset repeat and g-pass compute for each evaluator * fix pre-commit errors * delete print * delete gpassk_evaluator and fix potential errors * change `repeat` to `n` * fix `repeat` to `n` in openicl_eval * update doc for multi-run and g-pass * update latex equation in doc * update eng doc for multi-run and g-pass * update datasets.md * update datasets.md * fix multi-line equation * fix multi-line equation * fix multi-line equation * fix multi-line equation * fix multi-line equation * fix multi-line equation * fix multi-line equation in zh_cn user_guides * mmodify pre-commit-zh-cn * recover pre-commit and edit math expr in doc * del [TIP] * del cite tag in doc * del extract_model param in livemathbench config		2025-02-26 19:43:12 +08:00
..
icl_evaluator	[Feature] Support Dataset Repeat and G-Pass Compute for Each Evaluator (#1886 )	2025-02-26 19:43:12 +08:00
icl_inferencer	[Fix] Compatible with old versions (#1616 )	2024-10-21 10:16:29 +08:00
icl_retriever	[Update] Support AIME-24 Evaluation for DeepSeek-R1 series (#1888 )	2025-02-25 20:34:41 +08:00
utils	[Enhancement] Test linting in CI and fix existing linting errors (#69 )	2023-07-17 15:59:10 +08:00
__init__.py	[Enhancement] Test linting in CI and fix existing linting errors (#69 )	2023-07-17 15:59:10 +08:00
icl_dataset_reader.py	[Sync] Initial support of subjective evaluation (#421 )	2023-09-22 15:42:31 +08:00
icl_prompt_template.py	[Sync] update taco (#1030 )	2024-04-09 17:50:23 +08:00