OpenCompass/configs
Yike Yuan a6552224cb
[Feat] Support multi-modal evaluation on MME benchmark. (#197)
* [Feat] Support multi-modal evaluation on MME benchmark.

* [Fix] Remove debug code.

* [Fix] Remove redundant codes and add type hints.

* [Fix] Rename in config.

* [Fix] Rebase main.

* [Fix] Fix isort and yapf conflict.
2023-08-21 15:53:20 +08:00
..
datasets [Dataset] LongBench (#236) 2023-08-21 14:15:20 +08:00
models [Feat] Add codegeex2 and Humanevalx (#210) 2023-08-17 11:03:16 +08:00
multimodal [Feat] Support multi-modal evaluation on MME benchmark. (#197) 2023-08-21 15:53:20 +08:00
summarizers [Dataset] LongBench (#236) 2023-08-21 14:15:20 +08:00
eval_codegeex2.py [Feat] Add codegeex2 and Humanevalx (#210) 2023-08-17 11:03:16 +08:00
eval_demo.py [Doc] Update logo icon (#32) 2023-07-08 16:40:24 +08:00
eval_gpt3.5.py [Feature] Calculate max_out_len without hard code for OpenAI model (#158) 2023-08-08 15:16:56 +08:00
eval_gpt4.py [Enhancement] Add humaneval postprocessor for GPT models & eval config for GPT4, enhance the original humaneval postprocessor (#129) 2023-08-10 16:31:12 +08:00
eval_internlm_7b.py Add models (#18) 2023-07-06 16:02:39 +08:00
eval_internlm_chat_7b_turbomind.py [Feature] Support turbomind (#166) 2023-08-10 16:25:11 +08:00
eval_internLM.py [Feature] Support intern lanuage model (#51) 2023-07-27 18:49:36 +08:00
eval_LEval.py [Feature] Add LEval datasets 2023-08-11 17:38:31 +08:00
eval_llama_7b.py [Feature] add llama-oriented dataset configs (#82) 2023-08-11 12:48:05 +08:00