OpenCompass/opencompass
Linchen Xiao 8e55c9c6ee
[Update] Compassbench v1.3 (#1396)
* stash files

* compassbench subjective evaluation added

* evaluation update

* fix lint

* update docs

* Update lint

* changes saved

* changes saved

* CompassBench subjective summarizer added (#1349)

* subjective summarizer added

* fix lint

[Fix] Fix MathBench (#1351)

Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>

[Update] Update model support list (#1353)

* fix pip version

* fix pip version

* update model support

subjective summarizer updated

knowledge, math objective done (data need update)

remove secrets

objective changes saved

knowledge data added

* secrets removed

* changed added

* summarizer modified

* summarizer modified

* compassbench coding added

* fix lint

* objective summarizer updated

* compass_bench_v1.3 updated

* update files in config folder

* remove unused model

* lcbench modified

* removed model evaluation configs

* remove duplicated sdk implementation

---------

Co-authored-by: zhangsongyang <zhangsongyang@pjlab.org.cn>
2024-08-12 19:09:19 +08:00
..
cli [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
configs [Update] Compassbench v1.3 (#1396) 2024-08-12 19:09:19 +08:00
datasets [Update] Compassbench v1.3 (#1396) 2024-08-12 19:09:19 +08:00
lagent Update CIBench (#1089) 2024-04-26 18:46:02 +08:00
metrics [Feat] Support multi-modal evaluation on MME benchmark. (#197) 2023-08-21 15:53:20 +08:00
models [Update] Compassbench v1.3 (#1396) 2024-08-12 19:09:19 +08:00
openicl [Feature] Support inference ppl datasets (#1315) 2024-07-22 17:59:30 +08:00
partitioners [Feature] Support import configs/models/summarizers from whl (#1376) 2024-08-01 00:42:48 +08:00
runners [Doc] Update README (#1404) 2024-08-08 16:18:33 +08:00
summarizers [Update] Compassbench v1.3 (#1396) 2024-08-12 19:09:19 +08:00
tasks [Fix] Fix Slurm ENV (#1392) 2024-08-06 01:35:20 +08:00
utils [Update] Compassbench v1.3 (#1396) 2024-08-12 19:09:19 +08:00
__init__.py [Bump] Bump version for v0.3.0 (#1398) 2024-08-07 01:25:24 +08:00
registry.py [Deperecate] Remove multi-modal related stuff (#1072) 2024-04-26 21:20:14 +08:00