OpenCompass/tools
Songyang Zhang fd6fbf01a2
[Update] Support AIME-24 Evaluation for DeepSeek-R1 series (#1888)
* Update

* Update

* Update

* Update
2025-02-25 20:34:41 +08:00
..
case_analyzer.py [Docs] update descriptions for tools (#270) 2023-08-25 16:00:26 +08:00
collect_code_preds.py [Feat] support wizardcoder series (#344) 2023-09-06 17:52:35 +08:00
compare_configs.py [Feature] Support OpenAI ChatCompletion (#1389) 2024-08-01 19:10:13 +08:00
convert_alignmentbench.py [Sync] Sync with internal codes 2024.06.28 (#1279) 2024-06-28 14:16:34 +08:00
list_configs.py [Update] Support AIME-24 Evaluation for DeepSeek-R1 series (#1888) 2025-02-25 20:34:41 +08:00
prediction_merger.py [Sync] Sync with internal codes 2024.06.28 (#1279) 2024-06-28 14:16:34 +08:00
prompt_viewer.py [Feature] Add huggingface apply_chat_template (#1098) 2024-05-14 14:50:16 +08:00
test_api_model.py initial commit 2023-07-04 21:34:55 +08:00
update_dataset_suffix.py [Update] Update O1-style Benchmark and Prompts (#1742) 2024-12-09 13:48:56 +08:00
viz_multi_model.py [Feature] Add multi model viz (#509) 2023-10-30 12:11:33 +08:00