OpenCompass/opencompass/summarizers/subjective
bittersweet1999 6ba1c4937d
[Feature] Support Math evaluation via judgemodel (#1094)
* support openai math evaluation

* support openai math evaluation

* support openai math evaluation

* support math llm judge

* support math llm judge
2024-04-26 14:56:23 +08:00
..
__init__.py [Feature] Support Math evaluation via judgemodel (#1094) 2024-04-26 14:56:23 +08:00
alignmentbench.py [Sync] update taco (#1030) 2024-04-09 17:50:23 +08:00
all_obj.py [Feature] Support Math evaluation via judgemodel (#1094) 2024-04-26 14:56:23 +08:00
alpacaeval.py [Feature] Add multi-model judge and fix some problems (#1016) 2024-04-02 11:52:06 +08:00
compass_arena.py [Sync] deprecate old mbpps (#1064) 2024-04-19 20:49:46 +08:00
corev2.py reorganize subject files (#801) 2024-01-16 18:03:11 +08:00
creationbench.py reorganize subject files (#801) 2024-01-16 18:03:11 +08:00
information_retrival.py reorganize subject files (#801) 2024-01-16 18:03:11 +08:00
mtbench.py [Sync] deprecate old mbpps (#1064) 2024-04-19 20:49:46 +08:00
multiround.py [Fix] Fix MultiRound Subjective Evaluation(#1043) 2024-04-22 12:06:03 +08:00
subjective_post_process.py reorganize subject files (#801) 2024-01-16 18:03:11 +08:00
utils.py fix compass arena (#854) 2024-01-30 16:34:38 +08:00