OpenCompass/configs/datasets/subjective
bittersweet1999 1f9f728f22
[Feature] support compassbench Checklist evaluation (#1339)
* fix pip version

* fix pip version

* support checklist eval

* init

* add lan

* fix typo
2024-07-19 16:40:44 +08:00
..
alignbench [Refactor] Reorganize subjective eval (#1284) 2024-07-05 22:11:37 +08:00
alpaca_eval [Refactor] Reorganize subjective eval (#1284) 2024-07-05 22:11:37 +08:00
arena_hard [Fix] Change abbr for arenahard dataset (#1302) 2024-07-11 12:42:03 +08:00
compassarena [Refactor] Reorganize subjective eval (#1284) 2024-07-05 22:11:37 +08:00
compassbench [Feature] support compassbench Checklist evaluation (#1339) 2024-07-19 16:40:44 +08:00
creationbench [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
fofo [Refactor] Reorganize subjective eval (#1284) 2024-07-05 22:11:37 +08:00
multiround [Fix] add bc for alignbench summarizer (#1306) 2024-07-12 11:06:20 +08:00
subjective_cmp [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
wildbench Support wildbench (#1266) 2024-06-24 13:16:27 +08:00