OpenCompass/opencompass/tasks
bittersweet1999 2d4e559763
[Feature] Add multi-model judge and fix some problems (#1016)
* support multi-model judge and moe judge

* test_moe

* test_moe

* test

* add moe judge

* support multi-judge-model
2024-04-02 11:52:06 +08:00
..
outer_eval [Feature] Support AlpacaEval_V2 (#1006) 2024-03-28 16:49:04 +08:00
__init__.py [Feat] implementation for support promptbench (#239) 2023-09-15 15:06:53 +08:00
base.py [Fix] Use a copy of the config object in Task (#174) 2023-08-09 15:24:49 +08:00
llm_eval.py Add release contribution 2023-07-05 03:15:31 +00:00
mm_infer.py [Feature] Update xunfei api (#572) 2023-11-10 22:46:06 +08:00
openicl_attack.py [Feat] implementation for support promptbench (#239) 2023-09-15 15:06:53 +08:00
openicl_eval.py [Sync] Merge branch 'dev' into zfz/update-keyset-demo (#876) 2024-02-05 23:29:10 +08:00
openicl_infer.py [Sync] Add InternLM2 Keyset Evaluation Demo (#807) 2024-01-17 13:48:12 +08:00
subjective_eval.py [Feature] Add multi-model judge and fix some problems (#1016) 2024-04-02 11:52:06 +08:00