OpenCompass

mirror of https://github.com/open-compass/opencompass.git synced 2025-05-30 16:03:24 +08:00

Author	SHA1	Message	Date
Songyang Zhang	3f36db3b06	[Feature] Support turbomind (#166 ) * support turbomind * update doc * Update docs/en/advanced_guides/evaluation_turbomind.md Co-authored-by: Tong Gao <gaotongxiao@gmail.com> * Update docs/zh_cn/advanced_guides/evaluation_turbomind.md Co-authored-by: Tong Gao <gaotongxiao@gmail.com> * Update docs/zh_cn/advanced_guides/evaluation_turbomind.md Co-authored-by: Tong Gao <gaotongxiao@gmail.com> * Update docs/en/advanced_guides/evaluation_turbomind.md Co-authored-by: Tong Gao <gaotongxiao@gmail.com> * update --------- Co-authored-by: Tong Gao <gaotongxiao@gmail.com>	2023-08-10 16:25:11 +08:00
Leymore	e7fc54baf1	[Feature] Add Xiezhi SQuAD2.0 ANLI (#101 ) * add Xiezhi SQuAD2.0 ANLI; update WSC * update * update * update doc string	2023-08-10 14:04:18 +08:00
Yuan Liu	a205629ff3	[Feature]: Refactor input and output (#176 ) * [Feature]: Refactor input and output * [Feature]: Update tasks	2023-08-10 14:01:28 +08:00
Leymore	876ade71a5	[Fix] Fix AGIEval multiple choice (#137 ) * update agieval data * rename variables	2023-08-10 11:38:24 +08:00
dependabot[bot]	0555d59a6a	Bump requests from 2.28.1 to 2.31.0 (#178 ) Bumps [requests](https://github.com/psf/requests) from 2.28.1 to 2.31.0. - [Release notes](https://github.com/psf/requests/releases) - [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md) - [Commits](https://github.com/psf/requests/compare/v2.28.1...v2.31.0) --- updated-dependencies: - dependency-name: requests dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-08-09 19:41:09 +08:00
Tong Gao	e6194df29e	[Fix] Use a copy of the config object in Task (#174 )	2023-08-09 15:24:49 +08:00
Haodong Duan	d5d4f47371	[API] Refine OpenAI (#175 )	2023-08-09 12:38:57 +08:00
Zaida Zhou	af436f5951	[Feature] Calculate max_out_len without hard code for OpenAI model (#158 ) * calulate max_out_len without hard code * set default value * update configs * Update configs/eval_gpt3.5.py Co-authored-by: Tong Gao <gaotongxiao@gmail.com> --------- Co-authored-by: Tong Gao <gaotongxiao@gmail.com>	2023-08-08 15:16:56 +08:00
Yuan Liu	2f1949e7a1	[Feature]: Add mm suport for local (#169 )	2023-08-08 14:21:58 +08:00
Songyang Zhang	5b80d83866	[Docs] update readme (#165 )	2023-08-08 12:49:04 +08:00
Haodong Duan	6ca2be6626	[Script] Add scripts to evaluate MMBench (#161 ) * update * update * Update README.md Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> * refine * update default * update CN README --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>	2023-08-07 16:53:36 +08:00
Tong Gao	1bab316624	update internal readme (#162 )	2023-08-07 14:27:15 +08:00
Tong Gao	bbdedc6c95	[Enhancement] Optimize OpenAI models (#128 ) * [Feature] Enhance OpenAI API, add example config for GPT evaluation	2023-08-03 14:55:16 +08:00
Haodong Duan	d17a5b94fa	[Refine] Refine PR #122 (#123 ) * update * update	2023-08-03 14:54:38 +08:00
Yuan Liu	191a3f6f9d	[Feature]: Use multimodal (#73 ) * [Feature]: Add minigpt-4 * [Feature]: Add mm local runner * [Feature]: Add instructblip * [Feature]: Delete redundant file * [Feature]: Delete redundant file * [Feature]: Add README to InstructBLIP * [Feature]: Update MiniGPT-4 * [Fix]: Fix lint * [Feature]add omnibenchmark readme (#49) * add omnibenchmark readme * fix * Update OmniMMBench.md * Update OmniMMBench.md * Update OmniMMBench.md * [Fix]: Refine name (#54) * [Feature]: Unify out and err * [Fix]: Fix lint * [Feature]: Rename to mmbench and change weight path * [Feature]: Delete Omni in instructblip * [Feature]: Check the avaliablity of lavis * [Fix]: Fix lint * [Feature]: Refactor MM * [Refactor]: Refactor path * [Feature]: Delete redundant files * [Refactor]: Delete redundant files --------- Co-authored-by: Wangbo Zhao(黑色枷锁) <56866854+wangbo-zhao@users.noreply.github.com>	2023-08-03 11:07:50 +08:00
Zaida Zhou	289e0567bd	Fix typo in readme (#152 )	2023-08-02 19:01:39 +08:00
Leymore	bbe45c68a3	[Doc] update acknowledgements (#147 )	2023-08-02 10:16:53 +08:00
Tong Gao	8b163bd8e9	[Feature] Several enhancements (#142 )	2023-08-01 18:19:49 +08:00
Tong Gao	c00179d46b	[Feature] Evaluating acc based on minimum edit distance, update SIQA (#130 ) * [Feature] Support evaluating acc based on minimum edit distance, update SIQA * update	2023-08-01 14:24:27 +08:00
Ezra-Yu	e9b7b8ab02	[DOC] Add metric doc (#118 ) * update * update * update metric docs * update index.rst * update metrics	2023-08-01 11:47:04 +08:00
Songyang Zhang	d860b61d04	[Enhancement] Update README.md (#119 ) * Update README.md * update README_zh-CN.md * update get_started --------- Co-authored-by: Leymore <zfz-960727@163.com>	2023-07-31 18:26:46 +08:00
Leymore	262ab794fb	[Docs] Update prompt docs (#46 ) * [Docs] Update prompt docs * update * [Docs] Prompt docs (#112) * update docs * update * update * Update en prompt template * Update en prompt doc * fix * fix --------- Co-authored-by: Tong Gao <gaotongxiao@gmail.com>	2023-07-29 00:46:13 +08:00
Anakin Skywalker	e04f88424d	edit doc (#125 )	2023-07-28 17:33:51 +08:00
Leymore	d862f570aa	[Feature] Add SC (#126 ) * add self-consistency * add CoT method Self-Consistency * fix typo error and update openicl_eval * add tydiQA-GoldP task * fix sc * rename gsm8k_sc * fix sc * add self-consistency doc * refine sc --------- Authored-by: liushz <qq1791167085@163.com>	2023-07-28 17:29:37 +08:00
Haodong Duan	538b439302	[Fix] Fix seed in HFEvaluator (#122 )	2023-07-28 11:29:01 +08:00
Haodong Duan	46c9645753	[Feature] Allow explicitly setting the temperature for API model (#121 ) * allow explicitly setting the temperature * update	2023-07-28 11:28:15 +08:00
Tong Gao	80ce18f860	[Docs] Update issue templates for proper guidance to discussions (#116 )	2023-07-27 19:38:41 +08:00
gowithme	57fcfc975a	[Feature] Support intern lanuage model (#51 ) * support internLM * support internLM * simplify intern model files * update storage_manager * support internLM * Modify the file organization structure * support internLM * support internLM * support internLM * support internLM * change some details	2023-07-27 18:49:36 +08:00
vansin	8a4d0867ab	Doc: add twitter link (#111 )	2023-07-27 17:19:35 +08:00
Ezra-Yu	d1ec6047af	[Doc] Update Readme and Fix failed links (#108 ) * update reame and fix failed links * update * update review	2023-07-27 17:15:25 +08:00
Hubert	aa13067735	[Feat] add auto assignee bot (#105 ) * [Feat] add auto assignee bot * minor fix	2023-07-26 16:43:31 +08:00
Hubert	b7184e9db5	[Refactor] Update crows-pairs evaluation (#98 ) * [Refactor] Update crows-pairs evaluation * [Refactor] Update crows-pairs evaluation * minor	2023-07-26 11:21:32 +08:00
Haodong Duan	4b0aa80466	[Fix] MMBench Doc Fix (#96 ) * update * update * fix lint	2023-07-25 10:43:22 +08:00
Tong Gao	3715be6595	[Fix] Fix llama configs (#72 ) Co-authored-by: Leymore <zfz-960727@163.com>	2023-07-25 10:21:31 +08:00
Haonan Li	e9cdb24ddd	[Feature] Add CMMLU dataset (#91 ) * add CMMLU * debug cmmlu * add slurm args `qos` * fix format: space before comment * remove unused variable * change the location of `answer is` --------- Co-authored-by: 李浩楠 <lihaonan@lihaonandeMacBook-Air.local> Co-authored-by: 李浩楠 <haonan.li> Co-authored-by: Leymore <zfz-960727@163.com>	2023-07-25 10:14:27 +08:00
Haodong Duan	6e885d668b	force utf-8 encoding for all non-dataset fileios (#97 )	2023-07-25 10:06:01 +08:00
Leymore	3fe5ee096c	[Feature] Add heuristic size partitioner (#63 ) * [Feature] Add heuristic size partitioner * update	2023-07-20 11:53:24 +08:00
Leymore	eea8b04417	[Feature] Add llama-2 models (#81 ) * add llama-2 models * update docs --------- Co-authored-by: gaotongxiao <gaotongxiao@gmail.com>	2023-07-19 19:51:29 +08:00
Hubert	f83e125e5a	[Feat] Support CValues Responsibility dataset (#78 ) * [Feat] support CValues * minor fix	2023-07-18 18:45:15 +08:00
LZH	26e2f171f4	[Feature] Support load PEFT adapter for HuggingFace model (#74 ) * support peft for HuggingFace model * add docstring	2023-07-18 16:21:43 +08:00
liushz	f36c0496f3	[Feature] Add tydiqa-goldp (#75 ) Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>	2023-07-18 14:54:35 +08:00
Hubert	29598e3619	[Feat] add falcon-40b (#76 ) * [Feat] add falcon-40b * minor fix	2023-07-18 14:40:16 +08:00
Tong Gao	311bf0daa7	[Fix] Fix CI (#70 ) * [Fix] Fix CI * [Fix] Fix CI * [Fix] Fix CI * update	2023-07-17 19:10:59 +08:00
Tong Gao	29006e39c0	[Fix] Fix circular import of PromptTemplate (#71 )	2023-07-17 19:09:38 +08:00
Tong Gao	1e44541730	[Enhancement] Test linting in CI and fix existing linting errors (#69 ) * [Enhancement] Test linting in CI * fix linting	2023-07-17 15:59:10 +08:00
Leymore	9a16448905	[Fix] eval_llama_7b (#68 )	2023-07-17 15:28:21 +08:00
Leymore	edb23d15d1	[Feature] Add baichuan13b model configs (#60 ) * [Feature] Add baichuan13b * update num_gpus	2023-07-17 14:38:12 +08:00
Leymore	1326aff77e	[Feature] Add logger info and remove dataset bugs (#61 ) * Add logger info and remove dataset bugs * fix typo	2023-07-17 14:26:30 +08:00
Tong Gao	77a1cc4486	[Docs] Update evaluation doc (#39 )	2023-07-17 14:12:19 +08:00
Leymore	e19a0c1cf8	[Feature] add --dry-run option (#59 )	2023-07-17 10:41:38 +08:00

... 16 17 18 19 20

953 Commits