OpenCompass/configs/summarizers/groups
Linchen Xiao a4b54048ae
[Feature] Add Ruler datasets (#1310)
* [Feature] Add Ruler datasets

* pre-commit fixed

* Add model specific tokenizer to dataset

* pre-commit modified

* remove unused import

* fix linting

* add trust_remote to tokenizer load

* lint fix

* comments resolved

* fix lint

* Add readme

* Fix lint

* ruler refactorize

* fix lint

* lint fix

* updated

* lint fix

* fix wonderwords import issue

* prompt modified

* update

* readme updated

* update

* ruler dataset added

* Update

---------

Co-authored-by: tonysy <sy.zhangbuaa@gmail.com>
2024-08-20 11:40:11 +08:00
..
legacy [Sync] update github workflow (#1156) 2024-05-14 22:42:23 +08:00
agieval.py Add release contribution 2023-07-05 03:15:31 +00:00
bbh.py [Feat] support opencompass 2023-07-04 22:11:33 +08:00
calm.py Calm dataset (#1385) 2024-08-01 10:03:21 +08:00
ceval.py [Feature] re-implement ceval load dataset (#446) 2023-09-27 21:18:48 +08:00
charm_reason.py [Sync] format (#1214) 2024-05-30 00:21:58 +08:00
cibench.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
cmmlu.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
ds1000.py [Sync] some renaming (#641) 2023-11-27 16:06:49 +08:00
flores.py Update configs (#9) 2023-07-06 12:27:41 +08:00
GaokaoBench.py Update configs (#9) 2023-07-06 12:27:41 +08:00
infinitebench.py [Feature] Add InfiniteBench (#739) 2023-12-26 15:36:27 +08:00
jigsaw_multilingual.py Update configs (#9) 2023-07-06 12:27:41 +08:00
lawbench.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
lcbench.py [Sync] update taco (#1030) 2024-04-09 17:50:23 +08:00
leval.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
longbench.py Add en and zh groups to longbench summarizer; Fix longbench overall score (#1216) 2024-07-26 11:50:41 +08:00
lveval.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
mathbench_2024.py [Sync] bump version (#1204) 2024-05-28 23:09:59 +08:00
mathbench_agent.py [Sync] Add InternLM2 Keyset Evaluation Demo (#807) 2024-01-17 13:48:12 +08:00
mathbench_v1_2024_lang.py Update MathBench summarizer & fix cot setting (#1282) 2024-07-01 21:51:17 +08:00
mathbench_v1_2024.py Update MathBench summarizer & fix cot setting (#1282) 2024-07-01 21:51:17 +08:00
mathbench_v1.py [Sync] update taco (#1030) 2024-04-09 17:50:23 +08:00
mathbench.py [Sync] Add InternLM2 Keyset Evaluation Demo (#807) 2024-01-17 13:48:12 +08:00
mgsm.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
mmlu_pro.py [Sync] Sync with internal codes 2024.06.28 (#1279) 2024-06-28 14:16:34 +08:00
mmlu.py OpenCompass Public MR 2023-07-05 03:15:21 +00:00
MMLUArabic.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
plugineval.py [Sync] update taco (#1030) 2024-04-09 17:50:23 +08:00
ruler.py [Feature] Add Ruler datasets (#1310) 2024-08-20 11:40:11 +08:00
scibench.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
teval.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00
tydiqa.py [Sync] Fix cmnli, fix vicuna meta template, fix longbench postprocess and other minor fixes (#625) 2023-11-23 14:05:59 +08:00
xiezhi.py [Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00