mirror of
https://github.com/open-compass/opencompass.git
synced 2025-05-30 16:03:24 +08:00
![]() * stash files * compassbench subjective evaluation added * evaluation update * fix lint * update docs * Update lint * changes saved * changes saved * CompassBench subjective summarizer added (#1349) * subjective summarizer added * fix lint [Fix] Fix MathBench (#1351) Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn> [Update] Update model support list (#1353) * fix pip version * fix pip version * update model support subjective summarizer updated knowledge, math objective done (data need update) remove secrets objective changes saved knowledge data added * secrets removed * changed added * summarizer modified * summarizer modified * compassbench coding added * fix lint * objective summarizer updated * compass_bench_v1.3 updated * update files in config folder * remove unused model * lcbench modified * removed model evaluation configs * remove duplicated sdk implementation --------- Co-authored-by: zhangsongyang <zhangsongyang@pjlab.org.cn> |
||
---|---|---|
.. | ||
groups | ||
agent_bench.py | ||
charm_reason.py | ||
chat_OC15_multi_faceted.py | ||
chat_OC15.py | ||
cibench.py | ||
code_passk.py | ||
compassbench_v1_1_objective_public.py | ||
compassbench_v1_1_objective.py | ||
compassbench_v1_3_objective.py | ||
compassbench_v1_objective.py | ||
contamination.py | ||
example.py | ||
infinitebench.py | ||
internlm2_keyset.py | ||
lawbench.py | ||
leaderboard.py | ||
leval.py | ||
longbench.py | ||
longeval_v2.py | ||
lveval.py | ||
math_agent.py | ||
math_baseline.py | ||
mathbench_v1.py | ||
mathbench.py | ||
medium.py | ||
mmlu_pro.py | ||
needlebench.py | ||
plugineval.py | ||
small.py | ||
subjective.py | ||
teval.py | ||
tiny.py |