OpenCompass/configs/summarizers/groups
klein e4830a6926
Update CIBench (#1089)
* modify the requirements/runtime.txt: numpy==1.23.4 --> numpy>=1.23.4

* update cibench: dataset and evluation

* cibench summarizer bug

* update cibench

* move extract_code import

---------

Co-authored-by: zhangchuyu@pjlab.org.cn <zhangchuyu@pjlab.org.cn>
Co-authored-by: Leymore <zfz-960727@163.com>
2024-04-26 18:46:02 +08:00
..
agieval.py Add release contribution 2023-07-05 03:15:31 +00:00
bbh.py [Feat] support opencompass 2023-07-04 22:11:33 +08:00
ceval.py [Feature] re-implement ceval load dataset (#446) 2023-09-27 21:18:48 +08:00
cibench.py Update CIBench (#1089) 2024-04-26 18:46:02 +08:00
cmmlu.py [Feature] Add qwen & qwen-chat support (#286) 2023-08-31 11:29:05 +08:00
ds1000.py [Sync] some renaming (#641) 2023-11-27 16:06:49 +08:00
flores.py Update configs (#9) 2023-07-06 12:27:41 +08:00
GaokaoBench.py Update configs (#9) 2023-07-06 12:27:41 +08:00
infinitebench.py [Feature] Add InfiniteBench (#739) 2023-12-26 15:36:27 +08:00
jigsaw_multilingual.py Update configs (#9) 2023-07-06 12:27:41 +08:00
lawbench.py [Feature] Add lawbench (#460) 2023-10-13 06:51:36 -05:00
lcbench.py [Sync] update taco (#1030) 2024-04-09 17:50:23 +08:00
leval.py [Sync] Add InternLM2 Keyset Evaluation Demo (#807) 2024-01-17 13:48:12 +08:00
longbench.py [Sync] Add InternLM2 Keyset Evaluation Demo (#807) 2024-01-17 13:48:12 +08:00
lveval.py [Feature] add lveval benchmark (#914) 2024-03-04 11:22:03 +08:00
mathbench_agent.py [Sync] Add InternLM2 Keyset Evaluation Demo (#807) 2024-01-17 13:48:12 +08:00
mathbench_v1.py [Sync] update taco (#1030) 2024-04-09 17:50:23 +08:00
mathbench.py [Sync] Add InternLM2 Keyset Evaluation Demo (#807) 2024-01-17 13:48:12 +08:00
mmlu.py OpenCompass Public MR 2023-07-05 03:15:21 +00:00
plugineval.py [Sync] update taco (#1030) 2024-04-09 17:50:23 +08:00
scibench.py add evaluation of scibench (#393) 2023-09-22 17:42:08 +08:00
teval.py [Sync] Merge branch 'dev' into zfz/update-keyset-demo (#876) 2024-02-05 23:29:10 +08:00
tydiqa.py [Sync] Fix cmnli, fix vicuna meta template, fix longbench postprocess and other minor fixes (#625) 2023-11-23 14:05:59 +08:00
xiezhi.py [Feature] Add qwen & qwen-chat support (#286) 2023-08-31 11:29:05 +08:00