..
adv_glue
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
agieval
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
anli
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
anthropics_evals
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
apps
[Sync] Sync with internal codes 2024.06.28 ( #1279 )
2024-06-28 14:16:34 +08:00
ARC_c
[Feature] Fullbench v0.1 language update ( #1463 )
2024-08-28 14:01:05 +08:00
ARC_e
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
bbh
[Feature] Support import configs/models/summarizers from whl ( #1376 )
2024-08-01 00:42:48 +08:00
calm
Calm dataset ( #1385 )
2024-08-01 10:03:21 +08:00
ceval
[Feature] Support import configs/models/summarizers from whl ( #1376 )
2024-08-01 00:42:48 +08:00
CHARM
[Feature] Update CHARM Memeorziation ( #1230 )
2024-07-26 18:42:30 +08:00
ChemBench
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
CIBench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
civilcomments
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
clozeTest_maxmin
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
CLUE_afqmc
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
CLUE_C3
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
CLUE_cmnli
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
CLUE_CMRC
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
CLUE_DRCD
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
CLUE_ocnli
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
cmb
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
cmmlu
[Feature] Support import configs/models/summarizers from whl ( #1376 )
2024-08-01 00:42:48 +08:00
collections
[Sync] Sync with internal codes 2024.06.28 ( #1279 )
2024-06-28 14:16:34 +08:00
commonsenseqa
[Bug] Commonsenseqa dataset fix ( #1425 )
2024-08-16 15:54:07 +08:00
commonsenseqa_cn
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
compassbench_20_v1_1
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
compassbench_20_v1_1_public
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
compassbench_v1_3
[Update] Compassbench v1.3 ( #1396 )
2024-08-12 19:09:19 +08:00
contamination
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
crowspairs
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
crowspairs_cn
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
cvalues
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
demo
[Doc] quick start swap tabs ( #1263 )
2024-07-05 23:51:42 +08:00
drop
[Sync] Sync with internal codes 2024.06.28 ( #1279 )
2024-06-28 14:16:34 +08:00
ds1000
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
FewCLUE_bustm
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
FewCLUE_chid
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
FewCLUE_cluewsc
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
FewCLUE_csl
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
FewCLUE_eprstmt
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
FewCLUE_ocnli_fc
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
FewCLUE_tnews
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
FinanceIQ
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
flames
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
flores
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
game24
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
GaokaoBench
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
GLUE_CoLA
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
GLUE_MRPC
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
GLUE_QQP
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
govrepcrs
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
gpqa
[Feature] Update pip install ( #1324 )
2024-07-29 18:32:50 +08:00
gsm8k
[Feature] Add model postprocess function ( #1484 )
2024-09-05 21:10:29 +08:00
gsm8k_contamination
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
gsm_hard
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
hellaswag
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
humaneval
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
humaneval_cn
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
humaneval_multi
[feat] support multipl-e ( #846 )
2024-02-06 23:30:28 +08:00
humaneval_plus
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
humanevalx
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
hungarian_exam
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
IFEval
[Doc] Update running command in README ( #1206 )
2024-05-30 00:06:39 +08:00
inference_ppl
[Feature] Support inference ppl datasets ( #1315 )
2024-07-22 17:59:30 +08:00
infinitebench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
iwslt2017
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
jigsawmultilingual
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
kaoshi
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
lambada
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
lawbench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
LCBench
[Feature] Support import configs/models/summarizers from whl ( #1376 )
2024-08-01 00:42:48 +08:00
lcsts
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
leval
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
llm_compression
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
longbench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
lveval
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
mastermath2024v1
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
math
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
math401
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
MathBench
[Fix] Fix MathBench ( #1351 )
2024-07-23 13:35:38 +08:00
mbpp
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
mbpp_cn
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
mbpp_plus
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
MedBench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
mgsm
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
mmlu
[Feature] Add model postprocess function ( #1484 )
2024-09-05 21:10:29 +08:00
mmlu_pro
[Feature] Mmlu-pro auto-download ( #1464 )
2024-08-30 10:03:40 +08:00
MMLUArabic
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
narrativeqa
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
needlebench
[Feature] Needlebench auto-download update ( #1480 )
2024-09-05 17:22:42 +08:00
NPHardEval
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
nq
[Feature] Add model postprocess function ( #1484 )
2024-09-05 21:10:29 +08:00
nq_cn
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
obqa
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
OpenFinData
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
piqa
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
PJExam
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
promptbench
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
py150
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
qabench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
qasper
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
qaspercut
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
QuALITY
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
race
[Feature] Fullbench v0.1 language update ( #1463 )
2024-08-28 14:01:05 +08:00
realtoxicprompts
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
rolebench
[Feature] Add abbr for rolebench dataset ( #1431 )
2024-08-20 11:22:48 +08:00
ruler
[Feature] Add Ruler datasets ( #1310 )
2024-08-20 11:40:11 +08:00
s3eval
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
safety
Align prompt files with their hash ( #1 )
2023-07-05 18:28:58 +08:00
scibench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
scicode
[Feature] Add support for SciCode ( #1417 )
2024-08-22 13:42:25 +08:00
siqa
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
squad20
[Feature] Add Xiezhi SQuAD2.0 ANLI ( #101 )
2023-08-10 14:04:18 +08:00
storycloze
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
strategyqa
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
subjective
[Update] Support auto-download of FOFO/MT-Bench-101 ( #1423 )
2024-08-16 11:57:41 +08:00
summedits
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
summscreen
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
SuperGLUE_AX_b
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
SuperGLUE_AX_g
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
SuperGLUE_BoolQ
[Feature] Fullbench v0.1 language update ( #1463 )
2024-08-28 14:01:05 +08:00
SuperGLUE_CB
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
SuperGLUE_COPA
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
SuperGLUE_MultiRC
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
SuperGLUE_ReCoRD
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
SuperGLUE_RTE
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
SuperGLUE_WiC
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
SuperGLUE_WSC
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
SVAMP
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
TabMWP
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
taco
[Sync] Sync with internal codes 2024.06.28 ( #1279 )
2024-06-28 14:16:34 +08:00
teval
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
TheoremQA
[Doc] Update running command in README ( #1206 )
2024-05-30 00:06:39 +08:00
triviaqa
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
triviaqarc
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
truthfulqa
[Fix] Update SciCode and Gemma model ( #1449 )
2024-08-23 10:42:27 +08:00
tydiqa
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
wikibench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
wikitext
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
winograd
[Feature] Support import configs/models/summarizers from whl ( #1376 )
2024-08-01 00:42:48 +08:00
winogrande
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00
XCOPA
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
xiezhi
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
XLSum
Update configs ( #9 )
2023-07-06 12:27:41 +08:00
Xsum
[Feature] Support ModelScope datasets ( #1289 )
2024-07-29 13:48:32 +08:00