..
groups
[Feature] Add SciCode summarizer config ( #1514 )
2024-09-10 16:06:02 +08:00
agent_bench.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
charm_reason.py
[Sync] Sync with internal codes 2024.06.28 ( #1279 )
2024-06-28 14:16:34 +08:00
chat_OC15_multi_faceted.py
[Sync] bump version ( #1204 )
2024-05-28 23:09:59 +08:00
chat_OC15.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
cibench.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
code_passk.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
compassbench_v1_1_objective_public.py
[Sync] Sync with internal codes 2024.06.28 ( #1279 )
2024-06-28 14:16:34 +08:00
compassbench_v1_1_objective.py
[Sync] Sync with internal codes 2024.06.28 ( #1279 )
2024-06-28 14:16:34 +08:00
compassbench_v1_3_objective.py
[Update] Compassbench v1.3 ( #1396 )
2024-08-12 19:09:19 +08:00
compassbench_v1_objective.py
[Sync] update github workflow ( #1156 )
2024-05-14 22:42:23 +08:00
contamination.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
example.py
Add en
and zh
groups to longbench summarizer; Fix longbench overall score ( #1216 )
2024-07-26 11:50:41 +08:00
infinitebench.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
internlm2_keyset.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
lawbench.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
leaderboard.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
leval.py
[Sync] Add InternLM2 Keyset Evaluation Demo ( #807 )
2024-01-17 13:48:12 +08:00
longbench.py
[Sync] Add InternLM2 Keyset Evaluation Demo ( #807 )
2024-01-17 13:48:12 +08:00
longeval_v2.py
[Sync] Sync with internal codes 2023.01.08 ( #777 )
2024-01-08 14:07:24 +00:00
lveval.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
math_agent.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
math_baseline.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
mathbench_v1.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
mathbench.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
medium.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
mmlu_pro.py
[Sync] Sync with internal codes 2024.06.28 ( #1279 )
2024-06-28 14:16:34 +08:00
needlebench.py
[Feature] Needlebench auto-download update ( #1480 )
2024-09-05 17:22:42 +08:00
plugineval.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
ruler.py
[Feature] Add Ruler datasets ( #1310 )
2024-08-20 11:40:11 +08:00
scicode.py
[Feature] Add SciCode summarizer config ( #1514 )
2024-09-10 16:06:02 +08:00
small.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
subjective.py
[Sync] update github token ( #475 )
2023-10-13 06:50:54 -05:00
teval.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
tiny.py
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00