OpenCompass/configs/summarizers
Linchen Xiao 8e55c9c6ee
[Update] Compassbench v1.3 (#1396)
* stash files

* compassbench subjective evaluation added

* evaluation update

* fix lint

* update docs

* Update lint

* changes saved

* changes saved

* CompassBench subjective summarizer added (#1349)

* subjective summarizer added

* fix lint

[Fix] Fix MathBench (#1351)

Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>

[Update] Update model support list (#1353)

* fix pip version

* fix pip version

* update model support

subjective summarizer updated

knowledge, math objective done (data need update)

remove secrets

objective changes saved

knowledge data added

* secrets removed

* changed added

* summarizer modified

* summarizer modified

* compassbench coding added

* fix lint

* objective summarizer updated

* compass_bench_v1.3 updated

* update files in config folder

* remove unused model

* lcbench modified

* removed model evaluation configs

* remove duplicated sdk implementation

---------

Co-authored-by: zhangsongyang <zhangsongyang@pjlab.org.cn>
2024-08-12 19:09:19 +08:00
..
groups Calm dataset (#1385) 2024-08-01 10:03:21 +08:00
agent_bench.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
charm_reason.py [Sync] Sync with internal codes 2024.06.28 (#1279) 2024-06-28 14:16:34 +08:00
chat_OC15_multi_faceted.py [Sync] bump version (#1204) 2024-05-28 23:09:59 +08:00
chat_OC15.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
cibench.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
code_passk.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
compassbench_v1_1_objective_public.py [Sync] Sync with internal codes 2024.06.28 (#1279) 2024-06-28 14:16:34 +08:00
compassbench_v1_1_objective.py [Sync] Sync with internal codes 2024.06.28 (#1279) 2024-06-28 14:16:34 +08:00
compassbench_v1_3_objective.py [Update] Compassbench v1.3 (#1396) 2024-08-12 19:09:19 +08:00
compassbench_v1_objective.py [Sync] update github workflow (#1156) 2024-05-14 22:42:23 +08:00
contamination.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
example.py Add en and zh groups to longbench summarizer; Fix longbench overall score (#1216) 2024-07-26 11:50:41 +08:00
infinitebench.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
internlm2_keyset.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
lawbench.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
leaderboard.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
leval.py [Sync] Add InternLM2 Keyset Evaluation Demo (#807) 2024-01-17 13:48:12 +08:00
longbench.py [Sync] Add InternLM2 Keyset Evaluation Demo (#807) 2024-01-17 13:48:12 +08:00
longeval_v2.py [Sync] Sync with internal codes 2023.01.08 (#777) 2024-01-08 14:07:24 +00:00
lveval.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
math_agent.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
math_baseline.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
mathbench_v1.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
mathbench.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
medium.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
mmlu_pro.py [Sync] Sync with internal codes 2024.06.28 (#1279) 2024-06-28 14:16:34 +08:00
needlebench.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
plugineval.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
small.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
subjective.py [Sync] update github token (#475) 2023-10-13 06:50:54 -05:00
teval.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
tiny.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00