OpenCompass/configs
Yike Yuan 8d368d1cd6
[Feat] Support visualglm and llava for MMBench evaluation. (#211)
* [Feat] Support visualglm inference on MMBench.

* [Feat] Support llava inference on MMBench.

* [Fix] Fix pre-commit format.

* [Fix] Add docstring for llava

* [Fix] Fix multi-process inference error of LlaVA and add comments.
1. Set `low_cpu_mem_usage` to False to address device issue.
2. Add docstring and type hints.
3. Rename class and remove registry.

* [Fix] Pre-commit fix.

* [Fix] add forward entry, add dynamic import to seedbench

* [Fix] Fix pre-commit.

* [Fix] Fix missing context.

* [Fix] Fix docstring.
2023-08-21 15:57:30 +08:00
..
datasets [Dataset] LongBench (#236) 2023-08-21 14:15:20 +08:00
models [Feat] Add codegeex2 and Humanevalx (#210) 2023-08-17 11:03:16 +08:00
multimodal [Feat] Support visualglm and llava for MMBench evaluation. (#211) 2023-08-21 15:57:30 +08:00
summarizers [Dataset] LongBench (#236) 2023-08-21 14:15:20 +08:00
eval_codegeex2.py [Feat] Add codegeex2 and Humanevalx (#210) 2023-08-17 11:03:16 +08:00
eval_demo.py [Doc] Update logo icon (#32) 2023-07-08 16:40:24 +08:00
eval_gpt3.5.py [Feature] Calculate max_out_len without hard code for OpenAI model (#158) 2023-08-08 15:16:56 +08:00
eval_gpt4.py [Enhancement] Add humaneval postprocessor for GPT models & eval config for GPT4, enhance the original humaneval postprocessor (#129) 2023-08-10 16:31:12 +08:00
eval_internlm_7b.py Add models (#18) 2023-07-06 16:02:39 +08:00
eval_internlm_chat_7b_turbomind.py [Feature] Support turbomind (#166) 2023-08-10 16:25:11 +08:00
eval_internLM.py [Feature] Support intern lanuage model (#51) 2023-07-27 18:49:36 +08:00
eval_LEval.py [Feature] Add LEval datasets 2023-08-11 17:38:31 +08:00
eval_llama_7b.py [Feature] add llama-oriented dataset configs (#82) 2023-08-11 12:48:05 +08:00