.. |
agieval.py
|
Add release contribution
|
2023-07-05 03:15:31 +00:00 |
bbh.py
|
[Feat] support opencompass
|
2023-07-04 22:11:33 +08:00 |
ceval.py
|
[Feature] re-implement ceval load dataset (#446)
|
2023-09-27 21:18:48 +08:00 |
cibench.py
|
Update CIBench (#1089)
|
2024-04-26 18:46:02 +08:00 |
cmmlu.py
|
[Feature] Add qwen & qwen-chat support (#286)
|
2023-08-31 11:29:05 +08:00 |
ds1000.py
|
[Sync] some renaming (#641)
|
2023-11-27 16:06:49 +08:00 |
flores.py
|
Update configs (#9)
|
2023-07-06 12:27:41 +08:00 |
GaokaoBench.py
|
Update configs (#9)
|
2023-07-06 12:27:41 +08:00 |
infinitebench.py
|
[Feature] Add InfiniteBench (#739)
|
2023-12-26 15:36:27 +08:00 |
jigsaw_multilingual.py
|
Update configs (#9)
|
2023-07-06 12:27:41 +08:00 |
lawbench.py
|
[Feature] Add lawbench (#460)
|
2023-10-13 06:51:36 -05:00 |
lcbench.py
|
[Sync] update taco (#1030)
|
2024-04-09 17:50:23 +08:00 |
leval.py
|
[Sync] Add InternLM2 Keyset Evaluation Demo (#807)
|
2024-01-17 13:48:12 +08:00 |
longbench.py
|
[Sync] Add InternLM2 Keyset Evaluation Demo (#807)
|
2024-01-17 13:48:12 +08:00 |
lveval.py
|
[Feature] add lveval benchmark (#914)
|
2024-03-04 11:22:03 +08:00 |
mathbench_agent.py
|
[Sync] Add InternLM2 Keyset Evaluation Demo (#807)
|
2024-01-17 13:48:12 +08:00 |
mathbench_v1.py
|
[Sync] update taco (#1030)
|
2024-04-09 17:50:23 +08:00 |
mathbench.py
|
[Sync] Add InternLM2 Keyset Evaluation Demo (#807)
|
2024-01-17 13:48:12 +08:00 |
mmlu.py
|
OpenCompass Public MR
|
2023-07-05 03:15:21 +00:00 |
plugineval.py
|
[Sync] update taco (#1030)
|
2024-04-09 17:50:23 +08:00 |
scibench.py
|
add evaluation of scibench (#393)
|
2023-09-22 17:42:08 +08:00 |
teval.py
|
[Sync] Merge branch 'dev' into zfz/update-keyset-demo (#876)
|
2024-02-05 23:29:10 +08:00 |
tydiqa.py
|
[Sync] Fix cmnli, fix vicuna meta template, fix longbench postprocess and other minor fixes (#625)
|
2023-11-23 14:05:59 +08:00 |
xiezhi.py
|
[Feature] Add qwen & qwen-chat support (#286)
|
2023-08-31 11:29:05 +08:00 |