OpenCompass/configs/datasets
Hari Seldon 14b4b735cb
[Feature] Add support for SciCode (#1417)
* add SciCode

* add SciCode

* add SciCode

* add SciCode

* add SciCode

* add SciCode

* add SciCode

* add SciCode w/ bg

* add scicode

* Update README.md

* Update README.md

* Delete configs/eval_SciCode.py

* rename

* 1

* rename

* Update README.md

* Update scicode.py

* Update scicode.py

* fix some bugs

* Update

* Update

---------

Co-authored-by: root <HariSeldon0>
Co-authored-by: tonysy <sy.zhangbuaa@gmail.com>
2024-08-22 13:42:25 +08:00
..
adv_glue [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
agieval [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
anli [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
anthropics_evals [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
apps [Sync] Sync with internal codes 2024.06.28 (#1279) 2024-06-28 14:16:34 +08:00
ARC_c [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
ARC_e [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
bbh [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
calm Calm dataset (#1385) 2024-08-01 10:03:21 +08:00
ceval [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
CHARM [Feature] Update CHARM Memeorziation (#1230) 2024-07-26 18:42:30 +08:00
ChemBench [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
CIBench [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
civilcomments [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
clozeTest_maxmin [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
CLUE_afqmc [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
CLUE_C3 [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
CLUE_cmnli [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
CLUE_CMRC [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
CLUE_DRCD [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
CLUE_ocnli [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
cmb [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
cmmlu [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
collections [Sync] Sync with internal codes 2024.06.28 (#1279) 2024-06-28 14:16:34 +08:00
commonsenseqa [Bug] Commonsenseqa dataset fix (#1425) 2024-08-16 15:54:07 +08:00
commonsenseqa_cn [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
compassbench_20_v1_1 [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
compassbench_20_v1_1_public [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
compassbench_v1_3 [Update] Compassbench v1.3 (#1396) 2024-08-12 19:09:19 +08:00
contamination [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
crowspairs [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
crowspairs_cn [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
cvalues [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
demo [Doc] quick start swap tabs (#1263) 2024-07-05 23:51:42 +08:00
drop [Sync] Sync with internal codes 2024.06.28 (#1279) 2024-06-28 14:16:34 +08:00
ds1000 [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
FewCLUE_bustm [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
FewCLUE_chid [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
FewCLUE_cluewsc [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
FewCLUE_csl [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
FewCLUE_eprstmt [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
FewCLUE_ocnli_fc [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
FewCLUE_tnews [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
FinanceIQ [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
flames [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
flores [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
game24 [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
GaokaoBench [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
GLUE_CoLA [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
GLUE_MRPC [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
GLUE_QQP [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
govrepcrs [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
gpqa [Feature] Update pip install (#1324) 2024-07-29 18:32:50 +08:00
gsm8k [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
gsm8k_contamination [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
gsm_hard [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
hellaswag [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
humaneval [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
humaneval_cn [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
humaneval_multi [feat] support multipl-e (#846) 2024-02-06 23:30:28 +08:00
humaneval_plus [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
humanevalx [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
hungarian_exam [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
IFEval [Doc] Update running command in README (#1206) 2024-05-30 00:06:39 +08:00
inference_ppl [Feature] Support inference ppl datasets (#1315) 2024-07-22 17:59:30 +08:00
infinitebench [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
iwslt2017 [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
jigsawmultilingual [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
kaoshi [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
lambada [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
lawbench [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
LCBench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
lcsts [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
leval [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
llm_compression [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
longbench [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
lveval [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
mastermath2024v1 [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
math [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
math401 [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
MathBench [Fix] Fix MathBench (#1351) 2024-07-23 13:35:38 +08:00
mbpp [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
mbpp_cn [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
mbpp_plus [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
MedBench [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
mgsm [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
mmlu [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
mmlu_pro [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
MMLUArabic [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
narrativeqa [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
needlebench [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
NPHardEval [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
nq [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
nq_cn [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
obqa [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
OpenFinData [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
piqa [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
PJExam [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
promptbench [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
py150 [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
qabench [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
qasper [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
qaspercut [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
QuALITY [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
race [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
realtoxicprompts [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
rolebench [Feature] Add abbr for rolebench dataset (#1431) 2024-08-20 11:22:48 +08:00
ruler [Feature] Add Ruler datasets (#1310) 2024-08-20 11:40:11 +08:00
s3eval [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
safety Align prompt files with their hash (#1) 2023-07-05 18:28:58 +08:00
scibench [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
scicode [Feature] Add support for SciCode (#1417) 2024-08-22 13:42:25 +08:00
siqa [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
squad20 [Feature] Add Xiezhi SQuAD2.0 ANLI (#101) 2023-08-10 14:04:18 +08:00
storycloze [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
strategyqa [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
subjective [Update] Support auto-download of FOFO/MT-Bench-101 (#1423) 2024-08-16 11:57:41 +08:00
summedits [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
summscreen [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
SuperGLUE_AX_b [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
SuperGLUE_AX_g [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
SuperGLUE_BoolQ [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
SuperGLUE_CB [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
SuperGLUE_COPA [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
SuperGLUE_MultiRC [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
SuperGLUE_ReCoRD [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
SuperGLUE_RTE [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
SuperGLUE_WiC [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
SuperGLUE_WSC [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
SVAMP [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
TabMWP [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
taco [Sync] Sync with internal codes 2024.06.28 (#1279) 2024-06-28 14:16:34 +08:00
teval [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
TheoremQA [Doc] Update running command in README (#1206) 2024-05-30 00:06:39 +08:00
triviaqa [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
triviaqarc [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
truthfulqa [Bug] Commonsenseqa dataset fix (#1425) 2024-08-16 15:54:07 +08:00
tydiqa [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
wikibench [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
wikitext [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
winograd [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
winogrande [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00
XCOPA [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
xiezhi [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
XLSum Update configs (#9) 2023-07-06 12:27:41 +08:00
Xsum [Feature] Support ModelScope datasets (#1289) 2024-07-29 13:48:32 +08:00