OpenCompass/opencompass/configs/datasets/aime2024/README.md
Junnan Liu 645c5f3b2c
[Datasets] Add datasets CMO&AIME (#1610)
* add datasets cmo&aime

* delete unused modules

* modify prompt

* update __init__

* update data load and add README

* update data load

* update performance

* update md5

* remove indents

* add indent

* fix log for debug mode
2024-10-28 18:08:02 +08:00

13 lines
519 B
Markdown

### Description
Math dataset composed of problems from AIME2024 (American Invitational Mathematics Examination 2024).
### Performance
| Qwen2.5-Math-72B-Instruct | Qwen2.5-Math-7B-Instruct | Qwen2-Math-7B-Instruct | Qwen2-Math-1.5B-Instruct | internlm2-math-7b |
| ----------- | ----------- | ----------- | ----------- | ----------- |
| 20.00 | 16.67 | 16.67 | 13.33 | 3.33 |
| Qwen2.5-72B-Instruct | Qwen2.5-7B-Instruct | internlm2_5-7b-chat |
| ----------- | ----------- | ----------- |
| 31.25 | 26.44 | 9.13 |