OpenCompass/opencompass/summarizers/subjective
bittersweet1999 e404b72c52
[Feature] support arenahard evaluation (#1096)
* support arenahard

* support arenahard

* support arenahard
2024-04-26 15:42:00 +08:00
..
__init__.py [Feature] support arenahard evaluation (#1096) 2024-04-26 15:42:00 +08:00
alignmentbench.py [Sync] update taco (#1030) 2024-04-09 17:50:23 +08:00
all_obj.py [Feature] Support Math evaluation via judgemodel (#1094) 2024-04-26 14:56:23 +08:00
alpacaeval.py [Feature] Add multi-model judge and fix some problems (#1016) 2024-04-02 11:52:06 +08:00
arenahard.py [Feature] support arenahard evaluation (#1096) 2024-04-26 15:42:00 +08:00
compass_arena.py [Sync] deprecate old mbpps (#1064) 2024-04-19 20:49:46 +08:00
corev2.py reorganize subject files (#801) 2024-01-16 18:03:11 +08:00
creationbench.py reorganize subject files (#801) 2024-01-16 18:03:11 +08:00
information_retrival.py reorganize subject files (#801) 2024-01-16 18:03:11 +08:00
mtbench.py [Sync] deprecate old mbpps (#1064) 2024-04-19 20:49:46 +08:00
multiround.py [Fix] Fix MultiRound Subjective Evaluation(#1043) 2024-04-22 12:06:03 +08:00
subjective_post_process.py reorganize subject files (#801) 2024-01-16 18:03:11 +08:00
utils.py fix compass arena (#854) 2024-01-30 16:34:38 +08:00