OpenCompass/tools
Mo Li 33f8df1ca3
[Update] Change NeedleInAHaystackDataset to dynamic dataset loading (#754)
* Add NeedleInAHaystack Test

* Apply pre-commit formatting

* Update configs/eval_hf_internlm_chat_20b_cdme.py

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* add needle in haystack test

* update needle in haystack test

* update plot function in tools_needleinahaystack.py

* optimizing needleinahaystack dataset generation strategy

* modify minor formatting issues

* add English version support

* change NeedleInAHaystackDataset to dynamic loading

* change NeedleInAHaystackDataset to dynamic loading

* fix needleinahaystack test eval bug

* fix needleinahaystack config bug

---------

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
2024-01-02 17:22:56 +08:00
..
case_analyzer.py [Docs] update descriptions for tools (#270) 2023-08-25 16:00:26 +08:00
collect_code_preds.py [Feat] support wizardcoder series (#344) 2023-09-06 17:52:35 +08:00
convert_alignmentbench.py Update merge script (#733) 2023-12-25 16:45:22 +08:00
eval_mmbench.py [Script] Add scripts to evaluate MMBench (#161) 2023-08-07 16:53:36 +08:00
list_configs.py [Feature] Simplify entry script (#204) 2023-08-25 17:36:30 +08:00
prediction_merger.py [Docs] update descriptions for tools (#270) 2023-08-25 16:00:26 +08:00
prompt_viewer.py [Sync] some renaming (#641) 2023-11-27 16:06:49 +08:00
test_api_model.py initial commit 2023-07-04 21:34:55 +08:00
tools_needleinahaystack.py [Update] Change NeedleInAHaystackDataset to dynamic dataset loading (#754) 2024-01-02 17:22:56 +08:00
update_dataset_suffix.py [Refactor] Move fix_id_list to Retriever (#442) 2023-10-07 12:53:41 +08:00
viz_multi_model.py [Feature] Add multi model viz (#509) 2023-10-30 12:11:33 +08:00