OpenCompass/tools
Mo Li 8142f399a8
[Feature] Upgrade the needle-in-a-haystack experiment to Needlebench (#913)
* add needlebench

* simplify needlebench 32k, 128k, 200k for eval

* update act prompt

* fix bug in needlebench summarizer

* add needlebench intro, fix summarizer

* lint summarizer

* fix linting error

* move readme.md

* update readme for needlebench

* update docs of needlebench

* simplify needlebench summarizers
2024-03-04 11:10:52 +08:00
..
case_analyzer.py [Docs] update descriptions for tools (#270) 2023-08-25 16:00:26 +08:00
collect_code_preds.py [Feat] support wizardcoder series (#344) 2023-09-06 17:52:35 +08:00
convert_alignmentbench.py [Fix] Fix small bug in alignbench (#764) 2024-01-03 07:44:53 +00:00
eval_mmbench.py [Script] Add scripts to evaluate MMBench (#161) 2023-08-07 16:53:36 +08:00
list_configs.py [Feature] Simplify entry script (#204) 2023-08-25 17:36:30 +08:00
prediction_merger.py [Docs] update descriptions for tools (#270) 2023-08-25 16:00:26 +08:00
prompt_viewer.py [Sync] Merge branch 'dev' into zfz/update-keyset-demo (#876) 2024-02-05 23:29:10 +08:00
test_api_model.py initial commit 2023-07-04 21:34:55 +08:00
update_dataset_suffix.py [Sync] Sync with internal codes 2023.01.08 (#777) 2024-01-08 14:07:24 +00:00
viz_multi_model.py [Feature] Add multi model viz (#509) 2023-10-30 12:11:33 +08:00