OpenCompass/opencompass/openicl
Junnan Liu 73c80953c6
[Feature] Support Dataset Repeat and G-Pass Compute for Each Evaluator (#1886)
* support dataset repeat and g-pass compute for each evaluator

* fix pre-commit errors

* delete print

* delete gpassk_evaluator and fix potential errors

* change `repeat` to `n`

* fix `repeat` to `n` in openicl_eval

* update doc for multi-run and g-pass

* update latex equation in doc

* update eng doc for multi-run and g-pass

* update datasets.md

* update datasets.md

* fix multi-line equation

* fix multi-line equation

* fix multi-line equation

* fix multi-line equation

* fix multi-line equation

* fix multi-line equation

* fix multi-line equation in zh_cn user_guides

* mmodify pre-commit-zh-cn

* recover pre-commit and edit math expr in doc

* del [TIP]

* del cite tag in doc

* del extract_model param in livemathbench config
2025-02-26 19:43:12 +08:00
..
icl_evaluator [Feature] Support Dataset Repeat and G-Pass Compute for Each Evaluator (#1886) 2025-02-26 19:43:12 +08:00
icl_inferencer [Fix] Compatible with old versions (#1616) 2024-10-21 10:16:29 +08:00
icl_retriever [Update] Support AIME-24 Evaluation for DeepSeek-R1 series (#1888) 2025-02-25 20:34:41 +08:00
utils [Enhancement] Test linting in CI and fix existing linting errors (#69) 2023-07-17 15:59:10 +08:00
__init__.py [Enhancement] Test linting in CI and fix existing linting errors (#69) 2023-07-17 15:59:10 +08:00
icl_dataset_reader.py [Sync] Initial support of subjective evaluation (#421) 2023-09-22 15:42:31 +08:00
icl_prompt_template.py [Sync] update taco (#1030) 2024-04-09 17:50:23 +08:00