OpenCompass/opencompass
liushz 2737249f31
[Feature] Add mathbench dataset and circular evaluator (#408)
* add_mathbench

* update mathbench

* support non circular eval dataset

---------

Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>
Co-authored-by: yingfhu <yingfhu@gmail.com>
2023-10-18 04:08:31 -05:00
..
datasets [Feature] Add mathbench dataset and circular evaluator (#408) 2023-10-18 04:08:31 -05:00
metrics [Feat] Support multi-modal evaluation on MME benchmark. (#197) 2023-08-21 15:53:20 +08:00
models Integrate turbomind inference via its RPC API instead of its python API (#414) 2023-10-07 10:27:48 +08:00
multimodal [Fix] Fix performance issue of visualglm. (#424) 2023-09-21 19:54:23 +08:00
openicl [Feature] Add mathbench dataset and circular evaluator (#408) 2023-10-18 04:08:31 -05:00
partitioners [Sync] update github token (#475) 2023-10-13 06:50:54 -05:00
runners [Sync] Initial support of subjective evaluation (#421) 2023-09-22 15:42:31 +08:00
summarizers [Sync] update github token (#475) 2023-10-13 06:50:54 -05:00
tasks [Fix] Split if and only if complete eos string shows up (#477) 2023-10-13 06:52:20 -05:00
utils fix summary default (#483) 2023-10-17 11:32:38 +08:00
__init__.py Bump version to 0.1.6 (#478) 2023-10-13 06:54:51 -05:00
registry.py [Sync] update github token (#475) 2023-10-13 06:50:54 -05:00