OpenCompass/opencompass
bittersweet1999 2ee8e8a1a1
[Feature] add mtbench (#829)
* add mtbench

* add mtbench

* Update configs/datasets/subjective/multiround/mtbench_judgeby_gpt4.py

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* Update configs/datasets/subjective/multiround/mtbench_judgeby_gpt4.py

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* Update opencompass/datasets/subjective/__init__.py

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* Update opencompass/datasets/subjective/mtbench.py

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* fix mtbench

---------

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
2024-01-24 12:11:47 +08:00
..
datasets [Feature] add mtbench (#829) 2024-01-24 12:11:47 +08:00
lagent [Sync] Add InternLM2 Keyset Evaluation Demo (#807) 2024-01-17 13:48:12 +08:00
metrics [Feat] Support multi-modal evaluation on MME benchmark. (#197) 2023-08-21 15:53:20 +08:00
models Add LightllmApi KeyError log & Update doc (#816) 2024-01-18 22:23:38 +08:00
multimodal [Feature]: To be compatible with the latest version of MiniGPT-4 (#539) 2023-11-04 09:50:36 +08:00
openicl [Feature] add mtbench (#829) 2024-01-24 12:11:47 +08:00
partitioners [Sync] Sync with internal codes 2023.01.08 (#777) 2024-01-08 14:07:24 +00:00
runners [Sync] Add InternLM2 Keyset Evaluation Demo (#807) 2024-01-17 13:48:12 +08:00
summarizers [Feature] add mtbench (#829) 2024-01-24 12:11:47 +08:00
tasks [Feature] add mtbench (#829) 2024-01-24 12:11:47 +08:00
utils [Sync] Add InternLM2 Keyset Evaluation Demo (#807) 2024-01-17 13:48:12 +08:00
__init__.py [Sync] Bump version to 0.2.1 (#778) 2024-01-08 14:56:28 +00:00
registry.py [Sync] update github token (#475) 2023-10-13 06:50:54 -05:00