mirror of
https://github.com/open-compass/opencompass.git
synced 2025-05-30 16:03:24 +08:00

* add datasets cmo&aime * delete unused modules * modify prompt * update __init__ * update data load and add README * update data load * update performance * update md5 * remove indents * add indent * fix log for debug mode
519 B
519 B
Description
Math dataset composed of problems from AIME2024 (American Invitational Mathematics Examination 2024).
Performance
Qwen2.5-Math-72B-Instruct | Qwen2.5-Math-7B-Instruct | Qwen2-Math-7B-Instruct | Qwen2-Math-1.5B-Instruct | internlm2-math-7b |
---|---|---|---|---|
20.00 | 16.67 | 16.67 | 13.33 | 3.33 |
Qwen2.5-72B-Instruct | Qwen2.5-7B-Instruct | internlm2_5-7b-chat |
---|---|---|
31.25 | 26.44 | 9.13 |