OpenCompass/opencompass/tasks
Junnan Liu 73c80953c6
[Feature] Support Dataset Repeat and G-Pass Compute for Each Evaluator (#1886)
* support dataset repeat and g-pass compute for each evaluator

* fix pre-commit errors

* delete print

* delete gpassk_evaluator and fix potential errors

* change `repeat` to `n`

* fix `repeat` to `n` in openicl_eval

* update doc for multi-run and g-pass

* update latex equation in doc

* update eng doc for multi-run and g-pass

* update datasets.md

* update datasets.md

* fix multi-line equation

* fix multi-line equation

* fix multi-line equation

* fix multi-line equation

* fix multi-line equation

* fix multi-line equation

* fix multi-line equation in zh_cn user_guides

* mmodify pre-commit-zh-cn

* recover pre-commit and edit math expr in doc

* del [TIP]

* del cite tag in doc

* del extract_model param in livemathbench config
2025-02-26 19:43:12 +08:00
..
outer_eval [Fix] fix alpacaeval while add caching path (#1139) 2024-05-11 14:02:26 +08:00
__init__.py [Deperecate] Remove multi-modal related stuff (#1072) 2024-04-26 21:20:14 +08:00
base.py [Update] Update Skywork/Qwen-QwQ (#1728) 2024-12-05 19:30:43 +08:00
llm_eval.py Add release contribution 2023-07-05 03:15:31 +00:00
openicl_attack.py force register (#1311) 2024-07-11 19:59:35 +08:00
openicl_eval.py [Feature] Support Dataset Repeat and G-Pass Compute for Each Evaluator (#1886) 2025-02-26 19:43:12 +08:00
openicl_infer.py Update openicl_infer.py (#1308) 2024-08-23 10:39:22 +08:00
subjective_eval.py [Feature] Support subjective evaluation for reasoning model (#1868) 2025-02-20 12:19:46 +08:00