OpenCompass/opencompass/configs/datasets/SimpleQA/README.md

# OpenCompass SimpleQA dataset config for evaluation

## 1. Introduction

SimpleQA is a benchmark that evaluates the ability of language models to answer short, fact-seeking questions by OpenAI.
The original site is https://github.com/openai/simple-evals.

## 2. How to use

Please refer to the demo evaluation script `/opencompass/configs/mine/simpleqa_eval.py`.
[Feature] Add Openai Simpleqa dataset (#1720) * Add Openai SimpleQA dataset * Add Openai SimpleQA dataset * Add Openai SimpleQA dataset * Update eval_simpleqa.py --------- Co-authored-by: Linchen Xiao <xxllcc1993@gmail.com> 2024-11-28 19:16:07 +08:00			`# OpenCompass SimpleQA dataset config for evaluation`

			`## 1. Introduction`

			`SimpleQA is a benchmark that evaluates the ability of language models to answer short, fact-seeking questions by OpenAI.`
			`The original site is https://github.com/openai/simple-evals.`

			`## 2. How to use`

			Please refer to the demo evaluation script `/opencompass/configs/mine/simpleqa_eval.py`.