OpenCompass

mirror of https://github.com/open-compass/opencompass.git synced 2025-05-30 16:03:24 +08:00

History

Yike Yuan 8d368d1cd6 [Feat] Support visualglm and llava for MMBench evaluation. (#211 ) * [Feat] Support visualglm inference on MMBench. * [Feat] Support llava inference on MMBench. * [Fix] Fix pre-commit format. * [Fix] Add docstring for llava * [Fix] Fix multi-process inference error of LlaVA and add comments. 1. Set `low_cpu_mem_usage` to False to address device issue. 2. Add docstring and type hints. 3. Rename class and remove registry. * [Fix] Pre-commit fix. * [Fix] add forward entry, add dynamic import to seedbench * [Fix] Fix pre-commit. * [Fix] Fix missing context. * [Fix] Fix docstring.		2023-08-21 15:57:30 +08:00
..
datasets	[Dataset] LongBench (#236 )	2023-08-21 14:15:20 +08:00
models	[Feat] Add codegeex2 and Humanevalx (#210 )	2023-08-17 11:03:16 +08:00
multimodal	[Feat] Support visualglm and llava for MMBench evaluation. (#211 )	2023-08-21 15:57:30 +08:00
summarizers	[Dataset] LongBench (#236 )	2023-08-21 14:15:20 +08:00
eval_codegeex2.py	[Feat] Add codegeex2 and Humanevalx (#210 )	2023-08-17 11:03:16 +08:00
eval_demo.py	[Doc] Update logo icon (#32 )	2023-07-08 16:40:24 +08:00
eval_gpt3.5.py	[Feature] Calculate max_out_len without hard code for OpenAI model (#158 )	2023-08-08 15:16:56 +08:00
eval_gpt4.py	[Enhancement] Add humaneval postprocessor for GPT models & eval config for GPT4, enhance the original humaneval postprocessor (#129 )	2023-08-10 16:31:12 +08:00
eval_internlm_7b.py	Add models (#18 )	2023-07-06 16:02:39 +08:00
eval_internlm_chat_7b_turbomind.py	[Feature] Support turbomind (#166 )	2023-08-10 16:25:11 +08:00
eval_internLM.py	[Feature] Support intern lanuage model (#51 )	2023-07-27 18:49:36 +08:00
eval_LEval.py	[Feature] Add LEval datasets	2023-08-11 17:38:31 +08:00
eval_llama_7b.py	[Feature] add llama-oriented dataset configs (#82 )	2023-08-11 12:48:05 +08:00