👋 join us on Discord and WeChat
> \[!IMPORTANT\] > > **Star Us**, You will receive all release notifications from GitHub without any delay ~ ⭐️Language | Knowledge | Reasoning | Examination |
Word Definition- WiC - SummEditsIdiom Learning- CHIDSemantic Similarity- AFQMC - BUSTMCoreference Resolution- CLUEWSC - WSC - WinoGrandeTranslation- Flores - IWSLT2017Multi-language Question Answering- TyDi-QA - XCOPAMulti-language Summary- XLSum |
Knowledge Question Answering- BoolQ - CommonSenseQA - NaturalQuestions - TriviaQA |
Textual Entailment- CMNLI - OCNLI - OCNLI_FC - AX-b - AX-g - CB - RTE - ANLICommonsense Reasoning- StoryCloze - COPA - ReCoRD - HellaSwag - PIQA - SIQAMathematical Reasoning- MATH - GSM8KTheorem Application- TheoremQA - StrategyQA - SciBenchComprehensive Reasoning- BBH |
Junior High, High School, University, Professional Examinations- C-Eval - AGIEval - MMLU - GAOKAO-Bench - CMMLU - ARC - XiezhiMedical Examinations- CMB |
Understanding | Long Context | Safety | Code |
Reading Comprehension- C3 - CMRC - DRCD - MultiRC - RACE - DROP - OpenBookQA - SQuAD2.0Content Summary- CSL - LCSTS - XSum - SummScreenContent Analysis- EPRSTMT - LAMBADA - TNEWS |
Long Context Understanding- LEval - LongBench - GovReports - NarrativeQA - Qasper |
Safety- CivilComments - CrowsPairs - CValues - JigsawMultilingual - TruthfulQARobustness- AdvGLUE |
Code- HumanEval - HumanEvalX - MBPP - APPs - DS1000 |
Open-source Models | API Models |
- [Alpaca](https://github.com/tatsu-lab/stanford_alpaca) - [Baichuan](https://github.com/baichuan-inc) - [BlueLM](https://github.com/vivo-ai-lab/BlueLM) - [ChatGLM2](https://github.com/THUDM/ChatGLM2-6B) - [ChatGLM3](https://github.com/THUDM/ChatGLM3-6B) - [Gemma](https://huggingface.co/google/gemma-7b) - [InternLM](https://github.com/InternLM/InternLM) - [LLaMA](https://github.com/facebookresearch/llama) - [LLaMA3](https://github.com/meta-llama/llama3) - [Qwen](https://github.com/QwenLM/Qwen) - [TigerBot](https://github.com/TigerResearch/TigerBot) - [Vicuna](https://github.com/lm-sys/FastChat) - [WizardLM](https://github.com/nlpxucan/WizardLM) - [Yi](https://github.com/01-ai/Yi) - …… | - OpenAI - Gemini - Claude - ZhipuAI(ChatGLM) - Baichuan - ByteDance(YunQue) - Huawei(PanGu) - 360 - Baidu(ERNIEBot) - MiniMax(ABAB-Chat) - SenseTime(nova) - Xunfei(Spark) - …… |
|
---|