.. |
agieval
|
[Fix] Fix AGIEval multiple choice (#137)
|
2023-08-10 11:38:24 +08:00 |
__init__.py
|
[Feature] Add Xiezhi SQuAD2.0 ANLI (#101)
|
2023-08-10 14:04:18 +08:00 |
afqmcd.py
|
Add release contribution
|
2023-07-05 03:15:31 +00:00 |
anli.py
|
[Feature] Add Xiezhi SQuAD2.0 ANLI (#101)
|
2023-08-10 14:04:18 +08:00 |
arc.py
|
Add Release Contraibution
|
2023-07-05 02:22:40 +00:00 |
ax.py
|
Add release contribution
|
2023-07-05 03:15:31 +00:00 |
base.py
|
Add release contribution
|
2023-07-05 03:15:31 +00:00 |
bbh.py
|
Add release contribution
|
2023-07-05 03:15:31 +00:00 |
boolq.py
|
Add release contribution
|
2023-07-05 03:15:31 +00:00 |
bustum.py
|
Add release contribution
|
2023-07-05 03:15:31 +00:00 |
c3.py
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
cb.py
|
[Feat] support opencompass
|
2023-07-04 22:11:33 +08:00 |
ceval.py
|
[Enhancement] Test linting in CI and fix existing linting errors (#69)
|
2023-07-17 15:59:10 +08:00 |
chid.py
|
[Feat] support opencompass
|
2023-07-04 22:11:33 +08:00 |
civilcomments.py
|
[Feat] support opencompass
|
2023-07-04 22:11:33 +08:00 |
cluewsc.py
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
cmmlu.py
|
[Feature] Add CMMLU dataset (#91)
|
2023-07-25 10:14:27 +08:00 |
cmnli.py
|
update datasets
|
2023-07-05 01:45:26 +00:00 |
cmrc.py
|
Update configs (#9)
|
2023-07-06 12:27:41 +08:00 |
commonsenseqa.py
|
[Feat] support opencompass
|
2023-07-04 22:11:33 +08:00 |
copa.py
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
crowspairs.py
|
[Refactor] Update crows-pairs evaluation (#98)
|
2023-07-26 11:21:32 +08:00 |
csl.py
|
Support a batch of datasets.
|
2023-07-05 01:30:27 +00:00 |
cvalues.py
|
[Feat] Support CValues Responsibility dataset (#78)
|
2023-07-18 18:45:15 +08:00 |
drcd.py
|
Update configs (#9)
|
2023-07-06 12:27:41 +08:00 |
drop.py
|
Support a batch of datasets.
|
2023-07-05 01:30:27 +00:00 |
eprstmt.py
|
[Feat] support opencompass
|
2023-07-04 22:11:33 +08:00 |
flores.py
|
[Enhancement] Test linting in CI and fix existing linting errors (#69)
|
2023-07-17 15:59:10 +08:00 |
GaokaoBench.py
|
[Feature] Add logger info and remove dataset bugs (#61)
|
2023-07-17 14:26:30 +08:00 |
govrepcrs.py
|
Add release contribution
|
2023-07-05 03:15:31 +00:00 |
gsm8k.py
|
Support a batch of datasets.
|
2023-07-05 01:30:27 +00:00 |
hellaswag.py
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
huggingface.py
|
[Feat] support opencompass
|
2023-07-04 22:11:33 +08:00 |
humaneval.py
|
[Enhancement] Add humaneval postprocessor for GPT models & eval config for GPT4, enhance the original humaneval postprocessor (#129)
|
2023-08-10 16:31:12 +08:00 |
iwslt2017.py
|
Add release contribution
|
2023-07-05 03:15:31 +00:00 |
jigsawmultilingual.py
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
lambada.py
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
lcsts.py
|
Support a batch of datasets.
|
2023-07-05 01:30:27 +00:00 |
math.py
|
Update configs (#9)
|
2023-07-06 12:27:41 +08:00 |
mbpp.py
|
Update configs (#9)
|
2023-07-06 12:27:41 +08:00 |
mmlu.py
|
[Feature] Add logger info and remove dataset bugs (#61)
|
2023-07-17 14:26:30 +08:00 |
multirc.py
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
narrativeqa.py
|
Add release contribution
|
2023-07-05 03:15:31 +00:00 |
natural_question.py
|
Update configs (#9)
|
2023-07-06 12:27:41 +08:00 |
obqa.py
|
Support a batch of datasets.
|
2023-07-05 01:30:27 +00:00 |
piqa.py
|
[Feat] support opencompass
|
2023-07-04 22:11:33 +08:00 |
qasper.py
|
Add Release Contraibution
|
2023-07-05 02:22:40 +00:00 |
qaspercut.py
|
Add Release Contraibution
|
2023-07-05 02:22:40 +00:00 |
race.py
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
realtoxicprompts.py
|
Update configs (#9)
|
2023-07-06 12:27:41 +08:00 |
record.py
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
safety.py
|
Add Release Contraibution
|
2023-07-05 02:22:40 +00:00 |
siqa.py
|
[Feature] Evaluating acc based on minimum edit distance, update SIQA (#130)
|
2023-08-01 14:24:27 +08:00 |
squad20.py
|
[Feature] Add Xiezhi SQuAD2.0 ANLI (#101)
|
2023-08-10 14:04:18 +08:00 |
storycloze.py
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
strategyqa.py
|
Update configs (#9)
|
2023-07-06 12:27:41 +08:00 |
summedits.py
|
[Enhancement] Test linting in CI and fix existing linting errors (#69)
|
2023-07-17 15:59:10 +08:00 |
summscreen.py
|
Support a batch of datasets.
|
2023-07-05 01:30:27 +00:00 |
TheoremQA.py
|
Update configs (#9)
|
2023-07-06 12:27:41 +08:00 |
tnews.py
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
triviaqa.py
|
Update configs (#9)
|
2023-07-06 12:27:41 +08:00 |
triviaqarc.py
|
Add Release Contraibution
|
2023-07-05 02:22:40 +00:00 |
truthfulqa.py
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
tydiqa.py
|
[Feature] Add tydiqa-goldp (#75)
|
2023-07-18 14:54:35 +08:00 |
wic.py
|
Support a batch of datasets.
|
2023-07-05 01:30:27 +00:00 |
winograd.py
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
winogrande.py
|
Add Release Contraibution
|
2023-07-05 02:22:40 +00:00 |
wsc.py
|
Update configs (#9)
|
2023-07-06 12:27:41 +08:00 |
xcopa.py
|
Add Release Contraibution
|
2023-07-05 02:22:40 +00:00 |
xiezhi.py
|
[Feature] Add Xiezhi SQuAD2.0 ANLI (#101)
|
2023-08-10 14:04:18 +08:00 |
xlsum.py
|
update datasets
|
2023-07-05 01:45:26 +00:00 |
xsum.py
|
update datasets
|
2023-07-05 01:45:26 +00:00 |