mirror of
https://github.com/open-compass/opencompass.git
synced 2025-05-30 16:03:24 +08:00
Add olymmath dataset
This commit is contained in:
parent
a75b8fa98e
commit
006a70748f
@ -1,9 +1,8 @@
|
||||
# OlymMATH
|
||||
[GitHub Link](https://github.com/RUCAIBox/OlymMATH)
|
||||
|
||||
This is a implementation of HLE dataset, which evaluates 2370 text-based questions without images. The default setting is to use LLM as a judge.
|
||||
|
||||
Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models
|
||||
Dataset OlymMATH, please refer to the paper:
|
||||
Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models by Haoxiang Sun, Yingqian Min, Zhipeng Chen, Wayne Xin Zhao, Zheng Liu, Zhongyuan Wang, Lei Fang, and Ji-Rong Wen.
|
||||
|
||||
|
||||
## How to eval OlymMATH with model judge
|
||||
|
Loading…
Reference in New Issue
Block a user