OpenCompass

mirror of https://github.com/open-compass/opencompass.git synced 2025-05-30 16:03:24 +08:00

History

Yike Yuan a6552224cb [Feat] Support multi-modal evaluation on MME benchmark. (#197 ) * [Feat] Support multi-modal evaluation on MME benchmark. * [Fix] Remove debug code. * [Fix] Remove redundant codes and add type hints. * [Fix] Rename in config. * [Fix] Rebase main. * [Fix] Fix isort and yapf conflict.		2023-08-21 15:53:20 +08:00
..
datasets	[Dataset] LongBench (#236 )	2023-08-21 14:15:20 +08:00
models	[Feat] Add codegeex2 and Humanevalx (#210 )	2023-08-17 11:03:16 +08:00
multimodal	[Feat] Support multi-modal evaluation on MME benchmark. (#197 )	2023-08-21 15:53:20 +08:00
summarizers	[Dataset] LongBench (#236 )	2023-08-21 14:15:20 +08:00
eval_codegeex2.py	[Feat] Add codegeex2 and Humanevalx (#210 )	2023-08-17 11:03:16 +08:00
eval_demo.py	[Doc] Update logo icon (#32 )	2023-07-08 16:40:24 +08:00
eval_gpt3.5.py	[Feature] Calculate max_out_len without hard code for OpenAI model (#158 )	2023-08-08 15:16:56 +08:00
eval_gpt4.py	[Enhancement] Add humaneval postprocessor for GPT models & eval config for GPT4, enhance the original humaneval postprocessor (#129 )	2023-08-10 16:31:12 +08:00
eval_internlm_7b.py	Add models (#18 )	2023-07-06 16:02:39 +08:00
eval_internlm_chat_7b_turbomind.py	[Feature] Support turbomind (#166 )	2023-08-10 16:25:11 +08:00
eval_internLM.py	[Feature] Support intern lanuage model (#51 )	2023-07-27 18:49:36 +08:00
eval_LEval.py	[Feature] Add LEval datasets	2023-08-11 17:38:31 +08:00
eval_llama_7b.py	[Feature] add llama-oriented dataset configs (#82 )	2023-08-11 12:48:05 +08:00