OpenCompass

mirror of https://github.com/open-compass/opencompass.git synced 2025-05-30 16:03:24 +08:00

History

Linchen Xiao 8e55c9c6ee [Update] Compassbench v1.3 (#1396 ) * stash files * compassbench subjective evaluation added * evaluation update * fix lint * update docs * Update lint * changes saved * changes saved * CompassBench subjective summarizer added (#1349) * subjective summarizer added * fix lint [Fix] Fix MathBench (#1351) Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn> [Update] Update model support list (#1353) * fix pip version * fix pip version * update model support subjective summarizer updated knowledge, math objective done (data need update) remove secrets objective changes saved knowledge data added * secrets removed * changed added * summarizer modified * summarizer modified * compassbench coding added * fix lint * objective summarizer updated * compass_bench_v1.3 updated * update files in config folder * remove unused model * lcbench modified * removed model evaluation configs * remove duplicated sdk implementation --------- Co-authored-by: zhangsongyang <zhangsongyang@pjlab.org.cn>		2024-08-12 19:09:19 +08:00
..
groups	Calm dataset (#1385 )	2024-08-01 10:03:21 +08:00
agent_bench.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
charm_reason.py	[Sync] Sync with internal codes 2024.06.28 (#1279 )	2024-06-28 14:16:34 +08:00
chat_OC15_multi_faceted.py	[Sync] bump version (#1204 )	2024-05-28 23:09:59 +08:00
chat_OC15.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
cibench.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
code_passk.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
compassbench_v1_1_objective_public.py	[Sync] Sync with internal codes 2024.06.28 (#1279 )	2024-06-28 14:16:34 +08:00
compassbench_v1_1_objective.py	[Sync] Sync with internal codes 2024.06.28 (#1279 )	2024-06-28 14:16:34 +08:00
compassbench_v1_3_objective.py	[Update] Compassbench v1.3 (#1396 )	2024-08-12 19:09:19 +08:00
compassbench_v1_objective.py	[Sync] update github workflow (#1156 )	2024-05-14 22:42:23 +08:00
contamination.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
example.py	Add `en` and `zh` groups to longbench summarizer; Fix longbench overall score (#1216 )	2024-07-26 11:50:41 +08:00
infinitebench.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
internlm2_keyset.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
lawbench.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
leaderboard.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
leval.py	[Sync] Add InternLM2 Keyset Evaluation Demo (#807 )	2024-01-17 13:48:12 +08:00
longbench.py	[Sync] Add InternLM2 Keyset Evaluation Demo (#807 )	2024-01-17 13:48:12 +08:00
longeval_v2.py	[Sync] Sync with internal codes 2023.01.08 (#777 )	2024-01-08 14:07:24 +00:00
lveval.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
math_agent.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
math_baseline.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
mathbench_v1.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
mathbench.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
medium.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
mmlu_pro.py	[Sync] Sync with internal codes 2024.06.28 (#1279 )	2024-06-28 14:16:34 +08:00
needlebench.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
plugineval.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
small.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
subjective.py	[Sync] update github token (#475 )	2023-10-13 06:50:54 -05:00
teval.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00
tiny.py	[Format] Add config lints (#892 )	2024-05-14 15:35:58 +08:00