..
legacy
[Sync] update github workflow ( #1156 )
2024-05-14 22:42:23 +08:00
agieval.py
Add release contribution
2023-07-05 03:15:31 +00:00
bbh.py
[Feat] support opencompass
2023-07-04 22:11:33 +08:00
calm.py
Calm dataset ( #1385 )
2024-08-01 10:03:21 +08:00
ceval.py
[Feature] re-implement ceval load dataset ( #446 )
2023-09-27 21:18:48 +08:00
charm_reason.py
[Sync] format ( #1214 )
2024-05-30 00:21:58 +08:00
cibench.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
cmmlu.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
ds1000.py
[Sync] some renaming ( #641 )
2023-11-27 16:06:49 +08:00
flores.py
Update configs ( #9 )
2023-07-06 12:27:41 +08:00
GaokaoBench.py
Update configs ( #9 )
2023-07-06 12:27:41 +08:00
infinitebench.py
[Feature] Add InfiniteBench ( #739 )
2023-12-26 15:36:27 +08:00
jigsaw_multilingual.py
Update configs ( #9 )
2023-07-06 12:27:41 +08:00
lawbench.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
lcbench.py
[Sync] update taco ( #1030 )
2024-04-09 17:50:23 +08:00
leval.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
longbench.py
Add en
and zh
groups to longbench summarizer; Fix longbench overall score ( #1216 )
2024-07-26 11:50:41 +08:00
lveval.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
mathbench_2024.py
[Sync] bump version ( #1204 )
2024-05-28 23:09:59 +08:00
mathbench_agent.py
[Sync] Add InternLM2 Keyset Evaluation Demo ( #807 )
2024-01-17 13:48:12 +08:00
mathbench_v1_2024_lang.py
[Fix] Update option postprocess & mathbench language summarizer ( #1413 )
2024-08-22 14:49:07 +08:00
mathbench_v1_2024.py
Update MathBench summarizer & fix cot setting ( #1282 )
2024-07-01 21:51:17 +08:00
mathbench_v1.py
[Sync] update taco ( #1030 )
2024-04-09 17:50:23 +08:00
mathbench.py
[Sync] Add InternLM2 Keyset Evaluation Demo ( #807 )
2024-01-17 13:48:12 +08:00
mgsm.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
mmlu_pro.py
[Sync] Sync with internal codes 2024.06.28 ( #1279 )
2024-06-28 14:16:34 +08:00
mmlu.py
OpenCompass Public MR
2023-07-05 03:15:21 +00:00
MMLUArabic.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
plugineval.py
[Sync] update taco ( #1030 )
2024-04-09 17:50:23 +08:00
ruler.py
[Feature] Add Ruler datasets ( #1310 )
2024-08-20 11:40:11 +08:00
scibench.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
scicode.py
[Feature] Add SciCode summarizer config ( #1514 )
2024-09-10 16:06:02 +08:00
teval.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
tydiqa.py
[Sync] Fix cmnli, fix vicuna meta template, fix longbench postprocess and other minor fixes ( #625 )
2023-11-23 14:05:59 +08:00
xiezhi.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00