OpenCompass

mirror of https://github.com/open-compass/opencompass.git synced 2025-05-30 16:03:24 +08:00

Author	SHA1	Message	Date
Leymore	fbf5089c40	[Sync] update github token (#475 )	2023-10-13 06:50:54 -05:00
Leymore	362c33dff4	fix jieba rouge (#467 )	2023-10-12 10:25:19 +08:00
Leymore	d7ff933a73	[Fix] Use jieba rouge in lcsts (#459 ) * use jieba rouge in lcsts * use rouge_chinese	2023-10-09 10:10:33 +08:00
Tong Gao	c903e7f662	[Docs] Fix dead links in readme (#455 )	2023-10-07 00:14:29 -05:00
Tong Gao	119bfd1569	[Refactor] Move fix_id_list to Retriever (#442 ) * [Refactor] Move fix_id_list to Retriever * update * move to base * fix	2023-10-07 12:53:41 +08:00
Tong Gao	767c12a660	[Docs] update get_started (#435 ) * [Docs] update get_started * [Docs] Refactor get_started * update * add zh FAQ * add cn doc * update * fix dead links --------- Co-authored-by: Leymore <zfz-960727@163.com>	2023-10-07 11:49:40 +08:00
Lyu Han	6738247142	Integrate turbomind inference via its RPC API instead of its python API (#414 ) * support tis * integrate turbomind inference via its RPC API instead of its python API * update guide * update ip address spec * update according to reviewer's comments	2023-10-07 10:27:48 +08:00
Leymore	9db5652638	[Feature] re-implement ceval load dataset (#446 )	2023-09-27 21:18:48 +08:00
Hubert	d9f3e88dfe	[Fix] fix clp potential error and support bs>1 (#439 ) * [Fix] fix clp potential error and support bs>1 * [Fix] fix clp potential error and support bs>1 * minor fix * minor fix	2023-09-27 16:32:57 +08:00
philipwangOvO	3bb3d330eb	[Sync] Update LongEval (#443 )	2023-09-27 16:32:40 +08:00
Leymore	2bb7beeca3	[Fix] use eval field check (#441 )	2023-09-27 15:11:14 +08:00
Leymore	d6261e109d	[Doc] Update dataset list (#437 ) * add new dataset list * add new dataset list * add new dataset list * update * update * update readme --------- Co-authored-by: gaotongxiao <gaotongxiao@gmail.com>	2023-09-27 15:02:09 +08:00
Kevin Wang	dc1b82c346	[SIG] add GLUE_MRPC dataset (#440 )	2023-09-27 11:44:54 +08:00
Kevin Wang	14fdecfecc	[Dataset] add GLUE QQP dataset (#438 )	2023-09-27 11:36:43 +08:00
Kevin Wang	d8354fe5d8	[SIG] add GLUE_CoLA dataset (#406 ) * [Dataset] add GLUE_CoLA dataset * [update] use HFDataset to load glue/cola dataset * update --------- Co-authored-by: gaotongxiao <gaotongxiao@gmail.com>	2023-09-27 11:30:44 +08:00
Kevin Wang	012546666b	[SIG] add WikiText-2&103 (#397 ) * fix conflict * add eval_cfg	2023-09-26 14:31:15 +08:00
Tong Gao	9b21613d17	Bump version to 0.1.5 (#432 )	2023-09-22 19:17:23 +08:00
chenbohua3	b2926eac8f	[Feature] support customize config path (#423 ) * support customize config path * support customize config path * support customize config path	2023-09-22 19:12:02 +08:00
Yixiao Fang	524579b5af	[Docs] Add multimodal docs (#334 ) * add multimodal docs * fix lint --------- Co-authored-by: Tong Gao <gaotongxiao@gmail.com>	2023-09-22 18:58:29 +08:00
liushz	c5224c2a91	[Feature] Add kaoshi dataset (#392 ) * Add ToT method * Update ToT * Update ToT * Update ToT * Update ToT * Update ToT * Add Koashi * Update Kaoshi * Update Kaoshi * Update kaoshi * Update kaoshi * Update Kaoshi * Update Kaoshi * Update Kaoshi * Update Kaoshi * update Kaoshi * update * update * fix --------- Co-authored-by: gaotongxiao <gaotongxiao@gmail.com>	2023-09-22 18:46:33 +08:00
TTTTTiam	2a62bea1a4	add evaluation of scibench (#393 ) * add evaluation of scibench * add evaluation of scibench * update scibench * remove scibench evaluator --------- Co-authored-by: Leymore <zfz-960727@163.com>	2023-09-22 17:42:08 +08:00
Tong Gao	07574fddbb	[Fix] keep keys (#431 )	2023-09-22 17:30:54 +08:00
Tong Gao	a1ea3c094a	[Sync] Initial support of subjective evaluation (#421 ) Co-authored-by: Leymore <zfz-960727@163.com>	2023-09-22 15:42:31 +08:00
Ma Zerun	0f2c388280	Support GSM8k evaluation with tools by Lagent and LangChain (#277 ) * Support GSM8k evaluation with tools by Lagent and LangChain * Avoid to use MMEngine new feature * update document --------- Co-authored-by: Leymore <zfz-960727@163.com>	2023-09-22 15:28:22 +08:00
Tong Gao	681d3013de	[Feature] Log gold answer in prediction output (#419 ) * [Feature] Log gold answer in prediction output * support clp golden ans * minor fix --------- Co-authored-by: yingfhu <yingfhu@gmail.com>	2023-09-22 12:44:40 +08:00
Yike Yuan	97fdc51102	[Fix] Fix performance issue of visualglm. (#424 ) * [Fix] Visualglm performance fixed. * [Fix] Hide ckpt path.	2023-09-21 19:54:23 +08:00
Hubert	8803f7f7a6	[Feat] support antropics evals dataset (#422 ) * [Feat] support anthropics ai risk dataset * [Feat] support anthropics evals dataset * [Feat] support anthropics evals dataset	2023-09-20 18:36:44 +08:00
Leymore	ae0cd8752f	[Feature] Use local accuracy from hf implements (#416 ) * use local accuracy from hf implements * add load from hf fallback	2023-09-20 16:35:22 +08:00
Leymore	94755f8e2f	update news (#420 )	2023-09-20 16:08:49 +08:00
Zequn Liu	ff2c15a09f	[fix] summarizer debug logger (#417 )	2023-09-20 15:29:26 +08:00
Tong Gao	3e980a9737	[Docs] Add intro figure to README (#413 ) * [Docs] Add intro figure to README * update	2023-09-19 20:19:35 +08:00
Yike Yuan	bd50bad8b5	[Feat] Support mm models on public dataset and fix several issues. (#412 ) * [Feat] Add public dataset support for visualglm, qwenvl, and flamingo * [Fix] MMBench related changes. * [Fix] Openflamingo inference. * [Fix] Hide ckpt path. * [Fix] Pre-commit. --------- Co-authored-by: Haodong Duan <dhd.efz@gmail.com>	2023-09-19 19:08:44 +08:00
Yuanhan Zhang	7c2726c23b	[Model] Yhzhang/add mlugowl llamaadapter (#405 ) * refine gitignore * [Feature]: Add minigpt-4 * [Feature]: Add mm local runner * [Feature]: Add instructblip * add otter and llama-adapter * add owl * add llama2-adapter and owl * lint * [Feature]: Add minigpt-4 * [Feature]: Add instructblip * add otter and llama-adapter * add owl * add llama2-adapter and owl * lint * lint * update * lint * lint * add __init__.py * update * update * update * update * [Feature]: Add minigpt-4 * [Feature]: Add mm local runner * [Feature]: Add instructblip * add otter and llama-adapter * add owl * add llama2-adapter and owl * lint * [Feature]: Add minigpt-4 * [Feature]: Add instructblip * add otter and llama-adapter * add owl * add llama2-adapter and owl * lint * lint * update * lint * lint * add __init__.py * update * update * update * update * optimize mmbench dataset args * update * update * run commit hook --------- Co-authored-by: liuyuan <3463423099@qq.com> Co-authored-by: kennymckormick <dhd@pku.edu.cn> Co-authored-by: kennymckormick <dhd.efz@gmail.com>	2023-09-19 14:21:26 +08:00
so2liu	267401bded	[Feat] add custom summarizer argument in CLI run mode 在CLI启动模式中添加自定义Summarizer参数 (#411 ) * feat: add custom summarizer in CLI run mode * feat: search local config by match_cfg_file	2023-09-18 18:11:22 +08:00
philipwangOvO	f57c0702f7	[Docs] Readme in longeval (#389 ) * [Docs] Readme in longeval * [Docs] Readme in longeval * [Docs] Readme in longeval * [Docs] Readme in longeval * [Docs] Readme in longeval * [Docs] Readme in longeval * [Docs] Readme in longeval	2023-09-18 17:06:00 +08:00
Hubert	2c15a0c01d	[Feat] refine docs and codes for more user guides (#409 )	2023-09-18 16:12:13 +08:00
Hubert	a11cb45c83	[Feat] implementation for support promptbench (#239 ) * [Feat] support adv_glue dataset for adversarial robustness * reorg files * minor fix * minor fix * support prompt bench demo * minor fix * minor fix * minor fix * minor fix * minor fix * minor fix * minor fix * minor fix	2023-09-15 15:06:53 +08:00
Hubert	de8a154795	[Feat] support ds1000 dataset (#395 ) * [Feat] support ds1000 datase	2023-09-15 12:50:27 +08:00
Yuan Liu	545d50a4c0	[Fix]: Add has_image to scienceqa (#391 ) Co-authored-by: bensenliu <bensenliu@tencent.com>	2023-09-13 13:07:14 +08:00
Xidong Wang	47a752cd56	[Dataset] Add CMB (#376 ) * Add CMB * modify CMB --------- Co-authored-by: wangxidong <xidongw@163.com>	2023-09-12 19:16:41 +08:00
Tong Gao	4d89533fbc	[Docs] Add FAQ (#384 ) * [Docs] Add FAQ * [Docs] Add FAQ	2023-09-12 12:11:38 +08:00
cdpath	722eb39526	fix potential oom issue (#387 )	2023-09-12 10:41:03 +08:00
Tong Gao	b9b145c335	[Docs] Fix incorrect name in get_started (#380 )	2023-09-11 16:10:09 +08:00
Tong Gao	ca68637c4e	[CI] Publish to Pypi (#366 )	2023-09-11 14:26:14 +08:00
liushz	63ced828d8	Update get_started.md (#377 )	2023-09-11 10:58:17 +08:00
Songyang Zhang	acc75c3d27	update news (#375 )	2023-09-11 10:11:54 +08:00
Tong Gao	c7a8b8fe98	Bump version to 0.1.4 (#367 )	2023-09-08 20:51:38 +08:00
Leymore	2c915218e8	[Feaure] Add new models: baichuan2, tigerbot, vicuna v1.5 (#373 ) * add bag of new models: baichuan2, tigerbot, vicuna v1.5 * update * re-organize models * update readme * update	2023-09-08 15:41:20 +08:00
Leymore	b48d084020	[Fix] update bbh implement & fix bbh suffix (#371 )	2023-09-08 15:14:30 +08:00
Yixiao Fang	fada77a31c	[Feature] Add open source dataset eval config of instruct-blip (#370 ) * add configs * refactor model * add post processor and prompt constructor	2023-09-08 15:07:09 +08:00

1 2 3 4 5

232 Commits