mirror of
https://github.com/open-compass/opencompass.git
synced 2025-05-30 16:03:24 +08:00
![]() * fix lint issues * updated gitignore * changed infer_order from random to double for the pairwise_judge.py (not changing for pairwise_bt_judge.py * added return statement to CompassArenaBradleyTerrySummarizer to return overall score for each judger model |
||
---|---|---|
.. | ||
alignbench | ||
alpaca_eval | ||
arena_hard | ||
compass_arena_subjective_bench | ||
compassarena | ||
compassbench | ||
flames | ||
fofo | ||
followbench | ||
hellobench | ||
judgerbench | ||
multiround | ||
wildbench |