OpenCompass/opencompass/configs/datasets/subjective/multiround
bittersweet1999 f407930475
[Feature] Support subjective evaluation for reasoning model (#1868)
* fix pip version

* fix pip version

* add subeval for reasoning model

* add subeval for reasoning model

* update configs

* update config

* update config

* update config

* update files
2025-02-20 12:19:46 +08:00
..
mtbench101_judge_new.py [Feature] Support subjective evaluation for reasoning model (#1868) 2025-02-20 12:19:46 +08:00
mtbench101_judge.py [Feature] Support subjective evaluation for reasoning model (#1868) 2025-02-20 12:19:46 +08:00
mtbench_single_judge_diff_temp_new.py [Fix] Compatible with old versions (#1616) 2024-10-21 10:16:29 +08:00
mtbench_single_judge_diff_temp.py [Fix] Compatible with old versions (#1616) 2024-10-21 10:16:29 +08:00