OpenCompass

mirror of https://github.com/open-compass/opencompass.git synced 2025-05-30 16:03:24 +08:00

History

Junnan Liu 73c80953c6 [Feature] Support Dataset Repeat and G-Pass Compute for Each Evaluator (#1886 ) * support dataset repeat and g-pass compute for each evaluator * fix pre-commit errors * delete print * delete gpassk_evaluator and fix potential errors * change `repeat` to `n` * fix `repeat` to `n` in openicl_eval * update doc for multi-run and g-pass * update latex equation in doc * update eng doc for multi-run and g-pass * update datasets.md * update datasets.md * fix multi-line equation * fix multi-line equation * fix multi-line equation * fix multi-line equation * fix multi-line equation * fix multi-line equation * fix multi-line equation in zh_cn user_guides * mmodify pre-commit-zh-cn * recover pre-commit and edit math expr in doc * del [TIP] * del cite tag in doc * del extract_model param in livemathbench config		2025-02-26 19:43:12 +08:00
..
hf_metrics	[Feature] Use local accuracy from hf implements (#416 )	2023-09-20 16:35:22 +08:00
__init__.py	[Feature] Support Dataset Repeat and G-Pass Compute for Each Evaluator (#1886 )	2025-02-26 19:43:12 +08:00
icl_agent_evaluator.py	Support GSM8k evaluation with tools by Lagent and LangChain (#277 )	2023-09-22 15:28:22 +08:00
icl_aucroc_evaluator.py	[Enhancement] Test linting in CI and fix existing linting errors (#69 )	2023-07-17 15:59:10 +08:00
icl_base_evaluator.py	[Feature] Support Dataset Repeat and G-Pass Compute for Each Evaluator (#1886 )	2025-02-26 19:43:12 +08:00
icl_bpc_evaluator.py	[Feature] Adding support for LLM Compression Evaluation (#1108 )	2024-04-30 10:51:01 +08:00
icl_circular_evaluator.py	[Sync] update configs (#734 )	2023-12-25 21:59:16 +08:00
icl_em_evaluator.py	[Sync] update (#517 )	2023-10-27 20:31:22 +08:00
icl_hf_evaluator.py	[Sync] update evaluator (#1175 )	2024-05-21 14:22:46 +08:00
icl_jieba_rouge_evaluator.py	fix jieba rouge (#467 )	2023-10-12 10:25:19 +08:00
icl_korbench_evaluator.py	[Feature] Add Korbench dataset (#1713 )	2024-11-25 20:11:27 +08:00
icl_misc_evaluator.py	[Feature] Support inference ppl datasets (#1315 )	2024-07-22 17:59:30 +08:00
icl_plugin_evaluator.py	[Sync] Fix TEvalEvaluator (#929 )	2024-02-28 16:05:30 +08:00
icl_toxic_evaluator.py	[Fix] Fix CI (#70 )	2023-07-17 19:10:59 +08:00
lm_evaluator.py	[Feature] Support subjective evaluation for reasoning model (#1868 )	2025-02-20 12:19:46 +08:00
math_evaluator.py	[Feature] Math Verify with model post_processor (#1881 )	2025-02-20 19:32:12 +08:00