OpenCompass/opencompass/tasks
bittersweet1999 2ee8e8a1a1
[Feature] add mtbench (#829)
* add mtbench

* add mtbench

* Update configs/datasets/subjective/multiround/mtbench_judgeby_gpt4.py

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* Update configs/datasets/subjective/multiround/mtbench_judgeby_gpt4.py

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* Update opencompass/datasets/subjective/__init__.py

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* Update opencompass/datasets/subjective/mtbench.py

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* fix mtbench

---------

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
2024-01-24 12:11:47 +08:00
..
__init__.py [Feat] implementation for support promptbench (#239) 2023-09-15 15:06:53 +08:00
base.py [Fix] Use a copy of the config object in Task (#174) 2023-08-09 15:24:49 +08:00
llm_eval.py Add release contribution 2023-07-05 03:15:31 +00:00
mm_infer.py [Feature] Update xunfei api (#572) 2023-11-10 22:46:06 +08:00
openicl_attack.py [Feat] implementation for support promptbench (#239) 2023-09-15 15:06:53 +08:00
openicl_eval.py [Sync] Sync with internal codes 2023.01.08 (#777) 2024-01-08 14:07:24 +00:00
openicl_infer.py [Sync] Add InternLM2 Keyset Evaluation Demo (#807) 2024-01-17 13:48:12 +08:00
subjective_eval.py [Feature] add mtbench (#829) 2024-01-24 12:11:47 +08:00