OpenCompass

mirror of https://github.com/open-compass/opencompass.git synced 2025-05-30 16:03:24 +08:00

History

bittersweet1999 6ba1c4937d [Feature] Support Math evaluation via judgemodel (#1094 ) * support openai math evaluation * support openai math evaluation * support openai math evaluation * support math llm judge * support math llm judge		2024-04-26 14:56:23 +08:00
..
__init__.py	[Feature] Support Math evaluation via judgemodel (#1094 )	2024-04-26 14:56:23 +08:00
alignmentbench.py	[Sync] update taco (#1030 )	2024-04-09 17:50:23 +08:00
all_obj.py	[Feature] Support Math evaluation via judgemodel (#1094 )	2024-04-26 14:56:23 +08:00
alpacaeval.py	[Feature] Add multi-model judge and fix some problems (#1016 )	2024-04-02 11:52:06 +08:00
compass_arena.py	[Sync] deprecate old mbpps (#1064 )	2024-04-19 20:49:46 +08:00
corev2.py	reorganize subject files (#801 )	2024-01-16 18:03:11 +08:00
creationbench.py	reorganize subject files (#801 )	2024-01-16 18:03:11 +08:00
information_retrival.py	reorganize subject files (#801 )	2024-01-16 18:03:11 +08:00
mtbench.py	[Sync] deprecate old mbpps (#1064 )	2024-04-19 20:49:46 +08:00
multiround.py	[Fix] Fix MultiRound Subjective Evaluation(#1043 )	2024-04-22 12:06:03 +08:00
subjective_post_process.py	reorganize subject files (#801 )	2024-01-16 18:03:11 +08:00
utils.py	fix compass arena (#854 )	2024-01-30 16:34:38 +08:00