OpenCompass/opencompass
bittersweet1999 1c95790fdd
New subjective judgement (#660)
* TabMWP

* TabMWP

* fixed

* fixed

* fixed

* done

* done

* done

* add new subjective judgement

* add new subjective judgement

* add new subjective judgement

* add new subjective judgement

* add new subjective judgement

* modified to a more general way

* modified to a more general way

* final

* final

* add summarizer

* add new summarize

* fixed

* fixed

* fixed

---------

Co-authored-by: caomaosong <caomaosong@pjlab.org.cn>
2023-12-06 13:28:33 +08:00
..
datasets New subjective judgement (#660) 2023-12-06 13:28:33 +08:00
lagent [Feature] Support chat style inferencer. (#643) 2023-11-30 14:00:06 +08:00
metrics [Feat] Support multi-modal evaluation on MME benchmark. (#197) 2023-08-21 15:53:20 +08:00
models [Feat] update gsm8k and math agent config (#652) 2023-12-01 15:08:38 +08:00
multimodal [Feature]: To be compatible with the latest version of MiniGPT-4 (#539) 2023-11-04 09:50:36 +08:00
openicl [Feature] Support chat style inferencer. (#643) 2023-11-30 14:00:06 +08:00
partitioners [Sync] update (#517) 2023-10-27 20:31:22 +08:00
runners [Sync] Fix cmnli, fix vicuna meta template, fix longbench postprocess and other minor fixes (#625) 2023-11-23 14:05:59 +08:00
summarizers New subjective judgement (#660) 2023-12-06 13:28:33 +08:00
tasks [Bug] fix icl eval with nested list (#632) 2023-11-24 13:43:26 +08:00
utils [Feat] support zhipu post process (#642) 2023-11-27 19:57:36 +08:00
__init__.py [Sync] Bump version to 0.1.9 (#644) 2023-11-28 11:42:43 +08:00
registry.py [Sync] update github token (#475) 2023-10-13 06:50:54 -05:00