.. |
groups
|
[Sync] Sync with internal codes 2023.01.08 (#777)
|
2024-01-08 14:07:24 +00:00 |
agent_bench.py
|
[Sync] Sync with internal codes 2023.01.08 (#777)
|
2024-01-08 14:07:24 +00:00 |
cibench.py
|
[Sync] Sync with internal codes 2023.01.08 (#777)
|
2024-01-08 14:07:24 +00:00 |
code_passk.py
|
[Sync] Sync with internal codes 2023.01.08 (#777)
|
2024-01-08 14:07:24 +00:00 |
compass_knowledge.py
|
[Sync] Sync with internal codes 2023.01.08 (#777)
|
2024-01-08 14:07:24 +00:00 |
compass_math.py
|
[Sync] Sync with internal codes 2023.01.08 (#777)
|
2024-01-08 14:07:24 +00:00 |
compassbench_v1_language.py
|
[Sync] Sync with internal codes 2023.01.08 (#777)
|
2024-01-08 14:07:24 +00:00 |
compassbench_v1_reason.py
|
[Sync] Sync with internal codes 2023.01.08 (#777)
|
2024-01-08 14:07:24 +00:00 |
contamination.py
|
[Feature] Contamination analysis for MMLU, Hellaswag, and ARC_c (#699)
|
2024-01-08 15:51:48 +08:00 |
example.py
|
[Sync] update github token (#475)
|
2023-10-13 06:50:54 -05:00 |
infinitebench.py
|
[Feature] Add InfiniteBench (#739)
|
2023-12-26 15:36:27 +08:00 |
lawbench.py
|
[Feature] Add lawbench (#460)
|
2023-10-13 06:51:36 -05:00 |
leaderboard.py
|
[Sync] update github token (#475)
|
2023-10-13 06:50:54 -05:00 |
leval.py
|
[Sync] update github token (#475)
|
2023-10-13 06:50:54 -05:00 |
longbench.py
|
[Sync] update github token (#475)
|
2023-10-13 06:50:54 -05:00 |
longeval_v2.py
|
[Sync] Sync with internal codes 2023.01.08 (#777)
|
2024-01-08 14:07:24 +00:00 |
math_agent.py
|
[Feat] Update math/agent (#716)
|
2023-12-19 21:20:42 +08:00 |
math_baseline.py
|
[Feat] Update math/agent (#716)
|
2023-12-19 21:20:42 +08:00 |
mathbench.py
|
Mathbench update postprocess (#600)
|
2023-11-20 16:48:55 +08:00 |
medium.py
|
[Feature] Use dataset in local path (#570)
|
2023-11-13 13:00:37 +08:00 |
small.py
|
[Feature] Use dataset in local path (#570)
|
2023-11-13 13:00:37 +08:00 |
subjective.py
|
[Sync] update github token (#475)
|
2023-10-13 06:50:54 -05:00 |