OpenCompass/tools
Songyang Zhang 0d8df541bc
[Update] Update O1-style Benchmark and Prompts (#1742)
* Update JuderBench

* Support O1-style Prompts

* Update Code

* Update OpenAI

* Update BigCodeBench

* Update BigCodeBench

* Update BigCodeBench

* Update BigCodeBench

* Update BigCodeBench

* Update

* Update

* Update

* Update
2024-12-09 13:48:56 +08:00
..
case_analyzer.py [Docs] update descriptions for tools (#270) 2023-08-25 16:00:26 +08:00
collect_code_preds.py [Feat] support wizardcoder series (#344) 2023-09-06 17:52:35 +08:00
compare_configs.py [Feature] Support OpenAI ChatCompletion (#1389) 2024-08-01 19:10:13 +08:00
convert_alignmentbench.py [Sync] Sync with internal codes 2024.06.28 (#1279) 2024-06-28 14:16:34 +08:00
list_configs.py [Feature] Simplify entry script (#204) 2023-08-25 17:36:30 +08:00
prediction_merger.py [Sync] Sync with internal codes 2024.06.28 (#1279) 2024-06-28 14:16:34 +08:00
prompt_viewer.py [Feature] Add huggingface apply_chat_template (#1098) 2024-05-14 14:50:16 +08:00
test_api_model.py initial commit 2023-07-04 21:34:55 +08:00
update_dataset_suffix.py [Update] Update O1-style Benchmark and Prompts (#1742) 2024-12-09 13:48:56 +08:00
viz_multi_model.py [Feature] Add multi model viz (#509) 2023-10-30 12:11:33 +08:00