..
adv_glue
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
agieval
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
anli
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
anthropics_evals
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
apps
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
ARC_c
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
ARC_e
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
bbh
[Sync] update github workflow ( #1156 )
2024-05-14 22:42:23 +08:00
ceval
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
ChemBench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
CIBench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
civilcomments
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
clozeTest_maxmin
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
CLUE_afqmc
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
CLUE_C3
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
CLUE_cmnli
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
CLUE_CMRC
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
CLUE_DRCD
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
CLUE_ocnli
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
cmb
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
cmmlu
[Sync] update evaluator ( #1175 )
2024-05-21 14:22:46 +08:00
collections
[Sync] update github workflow ( #1156 )
2024-05-14 22:42:23 +08:00
commonsenseqa
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
commonsenseqa_cn
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
contamination
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
crowspairs
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
crowspairs_cn
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
cvalues
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
drop
[Feature] update drop dataset from openai simple eval ( #1092 )
2024-05-06 13:37:08 +08:00
ds1000
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
FewCLUE_bustm
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
FewCLUE_chid
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
FewCLUE_cluewsc
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
FewCLUE_csl
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
FewCLUE_eprstmt
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
FewCLUE_ocnli_fc
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
FewCLUE_tnews
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
FinanceIQ
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
flames
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
flores
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
game24
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
GaokaoBench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
GLUE_CoLA
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
GLUE_MRPC
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
GLUE_QQP
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
govrepcrs
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
gpqa
[Feature] Add gpqa prompt from simple_evals, openai ( #1080 )
2024-04-26 20:13:00 +08:00
gsm8k
[Sync] update github workflow ( #1156 )
2024-05-14 22:42:23 +08:00
gsm8k_contamination
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
gsm_hard
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
hellaswag
[Sync] update evaluator ( #1175 )
2024-05-21 14:22:46 +08:00
humaneval
Add humaneval prompt from simple_evals, openai ( #1076 )
2024-04-24 17:40:50 +08:00
humaneval_cn
[Feat] update code config ( #749 )
2023-12-29 18:46:34 +08:00
humaneval_multi
[feat] support multipl-e ( #846 )
2024-02-06 23:30:28 +08:00
humaneval_plus
[Fix] fix a bug of humanevalplus config ( #944 )
2024-03-05 11:37:17 +08:00
humanevalx
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
hungarian_exam
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
IFEval
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
infinitebench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
iwslt2017
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
jigsawmultilingual
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
kaoshi
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
lambada
[Feature] Use dataset in local path ( #570 )
2023-11-13 13:00:37 +08:00
lawbench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
lcsts
[Fix] Use jieba rouge in lcsts ( #459 )
2023-10-09 10:10:33 +08:00
leval
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
llm_compression
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
longbench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
lveval
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
mastermath2024v1
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
math
[Sync] update github workflow ( #1156 )
2024-05-14 22:42:23 +08:00
math401
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
MathBench
Update MathBench ( #1176 )
2024-05-21 14:45:43 +08:00
mbpp
[Sync] add OC16 entry ( #1171 )
2024-05-17 16:50:58 +08:00
mbpp_cn
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
mbpp_plus
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
MedBench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
mgsm
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
mmlu
[Sync] update evaluator ( #1175 )
2024-05-21 14:22:46 +08:00
MMLUArabic
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
narrativeqa
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
needlebench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
NPHardEval
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
nq
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
nq_cn
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
obqa
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
OpenFinData
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
piqa
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
PJExam
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
promptbench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
py150
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
qabench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
qasper
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
qaspercut
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
QuALITY
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
race
[Sync] update evaluator ( #1175 )
2024-05-21 14:22:46 +08:00
realtoxicprompts
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
rolebench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
s3eval
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
safety
Align prompt files with their hash ( #1 )
2023-07-05 18:28:58 +08:00
scibench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
siqa
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
squad20
[Feature] Add Xiezhi SQuAD2.0 ANLI ( #101 )
2023-08-10 14:04:18 +08:00
storycloze
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
strategyqa
[Feature] Use dataset in local path ( #570 )
2023-11-13 13:00:37 +08:00
subjective
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
summedits
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
summscreen
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
SuperGLUE_AX_b
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
SuperGLUE_AX_g
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
SuperGLUE_BoolQ
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
SuperGLUE_CB
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
SuperGLUE_COPA
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
SuperGLUE_MultiRC
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
SuperGLUE_ReCoRD
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
SuperGLUE_RTE
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
SuperGLUE_WiC
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
SuperGLUE_WSC
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
SVAMP
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
TabMWP
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
taco
[Sync] add OC16 entry ( #1171 )
2024-05-17 16:50:58 +08:00
teval
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
TheoremQA
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
triviaqa
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
triviaqarc
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
truthfulqa
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
tydiqa
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
wikibench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
wikitext
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
winograd
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
winogrande
[Sync] update evaluator ( #1175 )
2024-05-21 14:22:46 +08:00
XCOPA
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
xiezhi
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
XLSum
Update configs ( #9 )
2023-07-06 12:27:41 +08:00
Xsum
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00
z_bench
[Format] Add config lints ( #892 )
2024-05-14 15:35:58 +08:00