OpenCompass/opencompass/configs/datasets/SimpleQA/README.md
liushz c437135fad
[Feature] Add Openai Simpleqa dataset (#1720)
* Add Openai SimpleQA dataset

* Add Openai SimpleQA dataset

* Add Openai SimpleQA dataset

* Update eval_simpleqa.py

---------

Co-authored-by: Linchen Xiao <xxllcc1993@gmail.com>
2024-11-28 19:16:07 +08:00

11 lines
364 B
Markdown

# OpenCompass SimpleQA dataset config for evaluation
## 1. Introduction
SimpleQA is a benchmark that evaluates the ability of language models to answer short, fact-seeking questions by OpenAI.
The original site is https://github.com/openai/simple-evals.
## 2. How to use
Please refer to the demo evaluation script `/opencompass/configs/mine/simpleqa_eval.py`.