OpenCompass

mirror of https://github.com/open-compass/opencompass.git synced 2025-05-30 16:03:24 +08:00

Author	SHA1	Message	Date
Songyang Zhang	aa2b89b6f8	[Update] Add CascadeEvaluator with Data Replica (#2022 ) * Update CascadeEvaluator * Update CascadeEvaluator * Update CascadeEvaluator * Update Config * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update * Update	2025-05-20 16:46:55 +08:00
Songyang Zhang	c98599271b	[Update] Update OlympiadBench and Update LLM Judge (#1954 )	2025-03-18 20:15:20 +08:00
Jason Cheung	5d2d253d83	[BUG] Fix model_kwargs pass logic for vllm (#1958 )	2025-03-18 20:08:15 +08:00
liushz	5c8e91f329	[Fix] Fix vllm max_seq_len parameter transfer (#1745 ) * [Fix] Fix vllm max_seq_len parameter transfer * [Fix] Fix vllm max_seq_len parameter transfer * Update pr-run-test.yml * Update pr-run-test.yml --------- Co-authored-by: zhulinJulia24 <145004780+zhulinJulia24@users.noreply.github.com>	2024-12-16 21:44:36 +08:00
Lyu Han	b52ba65c26	[Feature] Integrate lmdeploy pipeline api (#1198 ) * integrate lmdeploy's pipeline api * fix linting * update user guide * rename * update * update * update * rollback class name * update * remove unused code * update * update * fix ci check * compatibility * remove concurrency * Update configs/models/hf_internlm/lmdeploy_internlm2_chat_7b.py * Update docs/zh_cn/advanced_guides/evaluation_lmdeploy.md * [Bug] fix lint --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> Co-authored-by: tonysy <sy.zhangbuaa@gmail.com>	2024-10-09 22:58:06 +08:00
Linchen Xiao	94b6bd65fc	[Fix] Fix cli evaluation for multiple models (#1454 ) * update * update	2024-08-23 17:15:36 +08:00
Songyang Zhang	5485207fbe	[Bump] Bump version to 0.3.1 (#1450 ) * [Bump] Bump version 0.3.1 * Update	2024-08-23 10:47:57 +08:00
Linchen Xiao	0fe9756c5d	[Doc] Update Readme (#1439 ) * update * update * update * update * update * update * update * update * update * update * update * update	2024-08-22 14:48:45 +08:00
liushz	d3963bceae	[Bug] Add model support for 'huggingface_above_v4_33' when using '-a' (#1430 ) Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>	2024-08-22 13:40:24 +08:00
Fengzhe Zhou	1d3a26c732	[Doc] quick start swap tabs (#1263 ) * [doc] quick start swap tabs * update docs * update * update * update * update * update * update * update	2024-07-05 23:51:42 +08:00
bittersweet1999	7c381e5be8	[Fix] fix summarizer (#1217 ) * fix summarizer * fix summarizer	2024-05-31 11:40:47 +08:00
Fengzhe Zhou	a77b8a5cec	[Sync] format (#1214 )	2024-05-30 00:21:58 +08:00
Fengzhe Zhou	d656e818f8	[Docs] Remove --no-batch-padding and Use --hf-num-gpus (#1205 ) * [Docs] Remove --no-batch-padding and Use -hf-num-gpus * update	2024-05-29 16:30:10 +08:00
Fengzhe Zhou	2954913d9b	[Sync] bump version (#1204 )	2024-05-28 23:09:59 +08:00
liushz	ba620c4afe	Update accelerator (#1195 ) * Add Math Evaluation with Judge Model Evaluator * Add Math Evaluation with Judge Model Evaluator * Add Math Evaluation with Judge Model Evaluator * Add Math Evaluation with Judge Model Evaluator * Fix Llama-3 meta template * Fix MATH with JudgeLM Evaluation * Fix MATH with JudgeLM Evaluation * Fix MATH with JudgeLM Evaluation * Fix MATH with JudgeLM Evaluation * Update acclerator * Update MathBench * Update accelerator --------- Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>	2024-05-28 17:17:54 +08:00
Fengzhe Zhou	5de85406ce	[Sync] add OC16 entry (#1171 )	2024-05-17 16:50:58 +08:00
Fengzhe Zhou	8ea2c404d7	[Feat] enable HuggingFacewithChatTemplate with --accelerator via cli (#1163 ) * enable HuggingFacewithChatTemplate with --accelerator via cli * rm vllm_internlm2_chat_7b	2024-05-15 21:51:07 +08:00
liushz	e3c0448bbc	Update accelerator (#1152 ) * Update acclerator * update run --------- Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn> Co-authored-by: Fengzhe Zhou <zfz-960727@163.com>	2024-05-15 14:31:47 +08:00
Fengzhe Zhou	7505b3cadf	[Feature] Add huggingface apply_chat_template (#1098 ) * add TheoremQA with 5-shot * add huggingface_above_v4_33 classes * use num_worker partitioner in cli * update theoremqa * update TheoremQA * add TheoremQA * rename theoremqa -> TheoremQA * update TheoremQA output path * rewrite many model configs * update huggingface * further update * refine configs * update configs * update configs * add configs/eval_llama3_instruct.py * add summarizer multi faceted * update bbh datasets * update configs/models/hf_llama/lmdeploy_llama3_8b_instruct.py * rename class * update readme * update hf above v4.33	2024-05-14 14:50:16 +08:00
Fengzhe Zhou	19d7e630d6	[Sync] Update accelerator (#1122 ) (cherry picked from commit 4beb6d9ab655d8a626971841b7acfd9fae9d438f) Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>	2024-05-09 14:32:31 +08:00
dmitrysarov	cce5b6fbb6	fix output typing, change mutable list to immutable tuple (#989 ) * fix output typing, change mutable list to immutable tuple * import missed type * format --------- Co-authored-by: Leymore <zfz-960727@163.com>	2024-04-26 23:07:34 +08:00
Haodong Duan	3a232db471	[Deperecate] Remove multi-modal related stuff (#1072 ) * Remove MultiModal * update index.rst * update README * remove mmbench codes * update news --------- Co-authored-by: Leymore <zfz-960727@163.com>	2024-04-26 21:20:14 +08:00
bittersweet1999	6ba1c4937d	[Feature] Support Math evaluation via judgemodel (#1094 ) * support openai math evaluation * support openai math evaluation * support openai math evaluation * support math llm judge * support math llm judge	2024-04-26 14:56:23 +08:00
Fengzhe Zhou	8c85edd1cd	[Sync] deprecate old mbpps (#1064 )	2024-04-19 20:49:46 +08:00
Fengzhe Zhou	b39f501563	[Sync] update taco (#1030 )	2024-04-09 17:50:23 +08:00
Mo Li	b50d163265	[Fix] Refactor Needlebench Configs for CLI Testing Support (#1020 ) * add needlebench datasets suffix * fix import * update run.py args for summarizer key and dataset suffix * update utils/run.py	2024-04-07 15:12:56 +08:00
Fengzhe Zhou	3a68083ecc	[Sync] update configs (#734 )	2023-12-25 21:59:16 +08:00
Fengzhe Zhou	6405cd2db5	use example summarizer by default (#508 )	2023-10-27 11:45:29 +08:00
Leymore	fccfcb6f5b	fix summary default (#483 )	2023-10-17 11:32:38 +08:00
chenbohua3	b2926eac8f	[Feature] support customize config path (#423 ) * support customize config path * support customize config path * support customize config path	2023-09-22 19:12:02 +08:00
Yuanhan Zhang	7c2726c23b	[Model] Yhzhang/add mlugowl llamaadapter (#405 ) * refine gitignore * [Feature]: Add minigpt-4 * [Feature]: Add mm local runner * [Feature]: Add instructblip * add otter and llama-adapter * add owl * add llama2-adapter and owl * lint * [Feature]: Add minigpt-4 * [Feature]: Add instructblip * add otter and llama-adapter * add owl * add llama2-adapter and owl * lint * lint * update * lint * lint * add __init__.py * update * update * update * update * [Feature]: Add minigpt-4 * [Feature]: Add mm local runner * [Feature]: Add instructblip * add otter and llama-adapter * add owl * add llama2-adapter and owl * lint * [Feature]: Add minigpt-4 * [Feature]: Add instructblip * add otter and llama-adapter * add owl * add llama2-adapter and owl * lint * lint * update * lint * lint * add __init__.py * update * update * update * update * optimize mmbench dataset args * update * update * run commit hook --------- Co-authored-by: liuyuan <3463423099@qq.com> Co-authored-by: kennymckormick <dhd@pku.edu.cn> Co-authored-by: kennymckormick <dhd.efz@gmail.com>	2023-09-19 14:21:26 +08:00
so2liu	267401bded	[Feat] add custom summarizer argument in CLI run mode 在CLI启动模式中添加自定义Summarizer参数 (#411 ) * feat: add custom summarizer in CLI run mode * feat: search local config by match_cfg_file	2023-09-18 18:11:22 +08:00
Tong Gao	ce65d3393b	[Sync] Use finally to clean up temp files (#337 )	2023-09-04 15:20:16 +08:00
Leymore	e810974068	[Fix] Fix when missing both pad and eos token (#287 ) * fix when missing both pad and eos token * update pad_token_id impl	2023-08-31 16:53:39 +08:00
Tong Gao	9058be07b8	[Feature] Simplify entry script (#204 ) * [Feature] Simply entry script * update	2023-08-25 17:36:30 +08:00

35 Commits