OpenCompass/opencompass/configs/datasets
Linchen Xiao 8e55c9c6ee
[Update] Compassbench v1.3 (#1396)
* stash files

* compassbench subjective evaluation added

* evaluation update

* fix lint

* update docs

* Update lint

* changes saved

* changes saved

* CompassBench subjective summarizer added (#1349)

* subjective summarizer added

* fix lint

[Fix] Fix MathBench (#1351)

Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>

[Update] Update model support list (#1353)

* fix pip version

* fix pip version

* update model support

subjective summarizer updated

knowledge, math objective done (data need update)

remove secrets

objective changes saved

knowledge data added

* secrets removed

* changed added

* summarizer modified

* summarizer modified

* compassbench coding added

* fix lint

* objective summarizer updated

* compass_bench_v1.3 updated

* update files in config folder

* remove unused model

* lcbench modified

* removed model evaluation configs

* remove duplicated sdk implementation

---------

Co-authored-by: zhangsongyang <zhangsongyang@pjlab.org.cn>
2024-08-12 19:09:19 +08:00
..
adv_glue [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
agieval [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
anli [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
anthropics_evals [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
apps [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
ARC_c [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
ARC_e [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
bbh [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
calm [Feature] Support OpenAI ChatCompletion (#1389) 2024-08-01 19:10:13 +08:00
ceval [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
CHARM [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
ChemBench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
CIBench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
civilcomments [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
clozeTest_maxmin [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
CLUE_afqmc [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
CLUE_C3 [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
CLUE_cmnli [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
CLUE_CMRC [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
CLUE_DRCD [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
CLUE_ocnli [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
cmb [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
cmmlu [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
collections [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
commonsenseqa [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
commonsenseqa_cn [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
compassbench_20_v1_1 [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
compassbench_20_v1_1_public [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
compassbench_v1_3 [Update] Compassbench v1.3 (#1396) 2024-08-12 19:09:19 +08:00
contamination [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
crowspairs [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
crowspairs_cn [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
cvalues [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
demo [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
drop [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
ds1000 [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
FewCLUE_bustm [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
FewCLUE_chid [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
FewCLUE_cluewsc [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
FewCLUE_csl [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
FewCLUE_eprstmt [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
FewCLUE_ocnli_fc [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
FewCLUE_tnews [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
FinanceIQ [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
flames [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
flores [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
game24 [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
GaokaoBench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
GLUE_CoLA [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
GLUE_MRPC [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
GLUE_QQP [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
govrepcrs [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
gpqa [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
gsm8k [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
gsm8k_contamination [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
gsm_hard [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
hellaswag [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
humaneval [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
humaneval_cn [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
humaneval_multi [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
humaneval_plus [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
humanevalx [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
hungarian_exam [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
IFEval [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
inference_ppl [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
infinitebench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
iwslt2017 [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
jigsawmultilingual [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
kaoshi [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
lambada [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
lawbench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
LCBench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
lcsts [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
leval [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
llm_compression [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
longbench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
lveval [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
mastermath2024v1 [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
math [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
math401 [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
MathBench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
mbpp [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
mbpp_cn [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
mbpp_plus [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
MedBench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
mgsm [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
mmlu [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
mmlu_pro [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
MMLUArabic [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
narrativeqa [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
needlebench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
NPHardEval [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
nq [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
nq_cn [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
obqa [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
OpenFinData [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
piqa [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
PJExam [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
promptbench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
py150 [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
qabench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
qasper [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
qaspercut [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
QuALITY [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
race [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
realtoxicprompts [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
rolebench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
s3eval [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
safety [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
scibench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
siqa [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
squad20 [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
storycloze [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
strategyqa [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
subjective [Update] Compassbench v1.3 (#1396) 2024-08-12 19:09:19 +08:00
summedits [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
summscreen [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
SuperGLUE_AX_b [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
SuperGLUE_AX_g [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
SuperGLUE_BoolQ [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
SuperGLUE_CB [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
SuperGLUE_COPA [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
SuperGLUE_MultiRC [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
SuperGLUE_ReCoRD [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
SuperGLUE_RTE [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
SuperGLUE_WiC [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
SuperGLUE_WSC [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
SVAMP [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
TabMWP [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
taco [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
teval [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
TheoremQA [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
triviaqa [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
triviaqarc [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
truthfulqa [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
tydiqa [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
wikibench [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
wikitext [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
winograd [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
winogrande [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
XCOPA [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
xiezhi [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
XLSum [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
Xsum [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00