OpenCompass/configs/summarizers
liyucheng09 0b2863039e
[Feature] Contamination analysis for MMLU, Hellaswag, and ARC_c (#699)
* Contamination analysis for ARC_c, mmlu, and Hellaswag

* update `eval_contamination.py`

* update `contamination.py` summarizer

* fix `eval_contamination.py`

* add mmlu groups for contamination analysis
2024-01-08 15:51:48 +08:00
..
groups [Feature] Add InfiniteBench (#739) 2023-12-26 15:36:27 +08:00
contamination.py [Feature] Contamination analysis for MMLU, Hellaswag, and ARC_c (#699) 2024-01-08 15:51:48 +08:00
example.py [Sync] update github token (#475) 2023-10-13 06:50:54 -05:00
infinitebench.py [Feature] Add InfiniteBench (#739) 2023-12-26 15:36:27 +08:00
lawbench.py [Feature] Add lawbench (#460) 2023-10-13 06:51:36 -05:00
leaderboard.py [Sync] update github token (#475) 2023-10-13 06:50:54 -05:00
leval.py [Sync] update github token (#475) 2023-10-13 06:50:54 -05:00
longbench.py [Sync] update github token (#475) 2023-10-13 06:50:54 -05:00
math_agent.py [Feat] Update math/agent (#716) 2023-12-19 21:20:42 +08:00
math_baseline.py [Feat] Update math/agent (#716) 2023-12-19 21:20:42 +08:00
mathbench.py Mathbench update postprocess (#600) 2023-11-20 16:48:55 +08:00
medium.py [Feature] Use dataset in local path (#570) 2023-11-13 13:00:37 +08:00
small.py [Feature] Use dataset in local path (#570) 2023-11-13 13:00:37 +08:00
subjective.py [Sync] update github token (#475) 2023-10-13 06:50:54 -05:00