OpenCompass/opencompass
Mo Li 33f8df1ca3
[Update] Change NeedleInAHaystackDataset to dynamic dataset loading (#754)
* Add NeedleInAHaystack Test

* Apply pre-commit formatting

* Update configs/eval_hf_internlm_chat_20b_cdme.py

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* add needle in haystack test

* update needle in haystack test

* update plot function in tools_needleinahaystack.py

* optimizing needleinahaystack dataset generation strategy

* modify minor formatting issues

* add English version support

* change NeedleInAHaystackDataset to dynamic loading

* change NeedleInAHaystackDataset to dynamic loading

* fix needleinahaystack test eval bug

* fix needleinahaystack config bug

---------

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
2024-01-02 17:22:56 +08:00
..
datasets [Update] Change NeedleInAHaystackDataset to dynamic dataset loading (#754) 2024-01-02 17:22:56 +08:00
lagent [Feat] update python action and slurm (#694) 2023-12-13 10:41:10 +08:00
metrics [Feat] Support multi-modal evaluation on MME benchmark. (#197) 2023-08-21 15:53:20 +08:00
models [Feat] update code config (#749) 2023-12-29 18:46:34 +08:00
multimodal [Feature]: To be compatible with the latest version of MiniGPT-4 (#539) 2023-11-04 09:50:36 +08:00
openicl [Sync] update configs (#734) 2023-12-25 21:59:16 +08:00
partitioners [Fix] Fix subjective alignbench (#730) 2023-12-23 20:06:53 +08:00
runners [Fix] Update alignmentbench (#704) 2023-12-14 18:24:21 +08:00
summarizers add creationbench (#753) 2023-12-29 10:03:44 +00:00
tasks [Fix] SubSizePartition fix (#746) 2023-12-28 11:46:46 +08:00
utils [Sync] update configs (#734) 2023-12-25 21:59:16 +08:00
__init__.py [Sync] format (#690) 2023-12-12 14:03:45 +08:00
registry.py [Sync] update github token (#475) 2023-10-13 06:50:54 -05:00