.. |
groups
|
[Feature] Add AceGPT-MMLUArabic benchmark (#1099)
|
2024-05-08 15:00:26 +08:00 |
agent_bench.py
|
[Sync] Sync Internal (#941)
|
2024-03-04 14:42:36 +08:00 |
cibench.py
|
Update CIBench (#1089)
|
2024-04-26 18:46:02 +08:00 |
code_passk.py
|
[Sync] Sync Internal (#941)
|
2024-03-04 14:42:36 +08:00 |
compass_knowledge.py
|
[Sync] update taco (#1030)
|
2024-04-09 17:50:23 +08:00 |
compass_math.py
|
[Sync] Sync Internal (#941)
|
2024-03-04 14:42:36 +08:00 |
compassbench_v1_language.py
|
[Sync] Sync Internal (#941)
|
2024-03-04 14:42:36 +08:00 |
compassbench_v1_objective.py
|
[Sync] update taco (#1030)
|
2024-04-09 17:50:23 +08:00 |
compassbench_v1_reason.py
|
[Sync] Sync Internal (#941)
|
2024-03-04 14:42:36 +08:00 |
contamination.py
|
[Feature] Contamination analysis for MMLU, Hellaswag, and ARC_c (#699)
|
2024-01-08 15:51:48 +08:00 |
example.py
|
add mgsm datasets (#1081)
|
2024-05-06 15:29:34 +08:00 |
infinitebench.py
|
[Feature] Add InfiniteBench (#739)
|
2023-12-26 15:36:27 +08:00 |
internlm2_keyset.py
|
[Sync] Add InternLM2 Keyset Evaluation Demo (#807)
|
2024-01-17 13:48:12 +08:00 |
lawbench.py
|
[Feature] Add lawbench (#460)
|
2023-10-13 06:51:36 -05:00 |
leaderboard.py
|
[Sync] Merge branch 'dev' into zfz/update-keyset-demo (#876)
|
2024-02-05 23:29:10 +08:00 |
leval.py
|
[Sync] Add InternLM2 Keyset Evaluation Demo (#807)
|
2024-01-17 13:48:12 +08:00 |
longbench.py
|
[Sync] Add InternLM2 Keyset Evaluation Demo (#807)
|
2024-01-17 13:48:12 +08:00 |
longeval_v2.py
|
[Sync] Sync with internal codes 2023.01.08 (#777)
|
2024-01-08 14:07:24 +00:00 |
lveval.py
|
[Feature] add lveval benchmark (#914)
|
2024-03-04 11:22:03 +08:00 |
math_agent.py
|
[Feat] Update math/agent (#716)
|
2023-12-19 21:20:42 +08:00 |
math_baseline.py
|
[Feat] Update math/agent (#716)
|
2023-12-19 21:20:42 +08:00 |
mathbench_v1.py
|
[Sync] update taco (#1030)
|
2024-04-09 17:50:23 +08:00 |
mathbench.py
|
Mathbench update postprocess (#600)
|
2023-11-20 16:48:55 +08:00 |
medium.py
|
[Feature] Use dataset in local path (#570)
|
2023-11-13 13:00:37 +08:00 |
needlebench.py
|
[Fix] Fix NeedleBench Summarizer Typo (#1125)
|
2024-05-08 20:00:15 +08:00 |
plugineval.py
|
[Sync] Merge branch 'dev' into zfz/update-keyset-demo (#876)
|
2024-02-05 23:29:10 +08:00 |
small.py
|
[Feature] Use dataset in local path (#570)
|
2023-11-13 13:00:37 +08:00 |
subjective.py
|
[Sync] update github token (#475)
|
2023-10-13 06:50:54 -05:00 |
teval.py
|
[Sync] Merge branch 'dev' into zfz/update-keyset-demo (#876)
|
2024-02-05 23:29:10 +08:00 |
tiny.py
|
[Sync] update 20240308 (#953)
|
2024-03-11 22:34:19 +08:00 |