.. |
__init__.py
|
[Feature] support compassbench Checklist evaluation (#1339)
|
2024-07-19 16:40:44 +08:00 |
alignbench.py
|
[Fix] Fix Slurm ENV (#1392)
|
2024-08-06 01:35:20 +08:00 |
arena_hard.py
|
[Fix] Fix Slurm ENV (#1392)
|
2024-08-06 01:35:20 +08:00 |
compass_arena.py
|
[Refactor] Reorganize subjective eval (#1284)
|
2024-07-05 22:11:37 +08:00 |
compassbench_checklist.py
|
[Feature] support compassbench Checklist evaluation (#1339)
|
2024-07-19 16:40:44 +08:00 |
compassbench_control_length_bias.py
|
[Refactor] Reorganize subjective eval (#1284)
|
2024-07-05 22:11:37 +08:00 |
compassbench.py
|
[Refactor] Reorganize subjective eval (#1284)
|
2024-07-05 22:11:37 +08:00 |
corev2.py
|
reorganize subject files (#801)
|
2024-01-16 18:03:11 +08:00 |
creationbench.py
|
reorganize subject files (#801)
|
2024-01-16 18:03:11 +08:00 |
fofo.py
|
[Refactor] Reorganize subjective eval (#1284)
|
2024-07-05 22:11:37 +08:00 |
information_retrival.py
|
reorganize subject files (#801)
|
2024-01-16 18:03:11 +08:00 |
mtbench101.py
|
[Sync] bump version 0.2.6+local (#1294)
|
2024-07-06 00:44:06 +08:00 |
mtbench.py
|
[Fix] Fix Slurm ENV (#1392)
|
2024-08-06 01:35:20 +08:00 |
multiround.py
|
reorganize subject files (#801)
|
2024-01-16 18:03:11 +08:00 |
subjective_cmp.py
|
[Fix] Fix Slurm ENV (#1392)
|
2024-08-06 01:35:20 +08:00 |
wildbench.py
|
[Fix] minor update wildbench (#1335)
|
2024-07-26 11:19:04 +08:00 |