diff --git a/README.md b/README.md new file mode 100644 index 00000000..f8766154 --- /dev/null +++ b/README.md @@ -0,0 +1,313 @@ +
+ Language + | ++ Knowledge + | ++ Reasoning + | ++ Exam + | ++ Understanding + | +
+
+
+
+Word Definition+ +- WiC +- SummEdits + +
+
+
+Idiom Learning+ +- CHID + +
+
+
+Semantic Similarity+ +- AFQMC +- BUSTM + +
+
+
+Coreference Resolution+ +- CLUEWSC +- WSC +- WinoGrande + +
+
+ Translation+ +- Flores + + |
+
+
+
+
+Knowledge Question Answering+ +- BoolQ +- CommonSenseQA +- NaturalQuestion +- TrivialQA + +
+
+ Multi-language Question Answering+ +- TyDi-QA + + |
+
+
+
+
+Textual Entailment+ +- CMNLI +- OCNLI +- OCNLI_FC +- AX-b +- AX-g +- CB +- RTE + +
+
+
+Commonsense Reasoning+ +- StoryCloze +- StoryCloze-CN (coming soon) +- COPA +- ReCoRD +- HellaSwag +- PIQA +- SIQA + +
+
+
+Mathematical Reasoning+ +- MATH +- GSM8K + +
+
+
+Theorem Application+ +- TheoremQA + +
+
+
+Code+ +- HumanEval +- MBPP + +
+
+ Comprehensive Reasoning+ +- BBH + + |
+
+
+
+ Junior High, High School, University, Professional Examinations+ +- GAOKAO-2023 +- CEval +- AGIEval +- MMLU +- GAOKAO-Bench +- MMLU-CN (coming soon) +- ARC + + |
+
+
+
+
+Reading Comprehension+ +- C3 +- CMRC +- DRCD +- MultiRC +- RACE + +
+
+
+Content Summary+ +- CSL +- LCSTS +- XSum + +
+
+ Content Analysis+ +- EPRSTMT +- LAMBADA +- TNEWS + + |
+
+ HuggingFace Models + | ++ API Models + | ++ Custom Models + | +
+ +- InternLM +- LLaMA +- Vicuna +- Alpaca +- Baichuan +- WizardLM +- ChatGLM-6B +- ChatGLM2-6B +- MPT +- Falcon +- TigerBot +- MOSS +- …… + + | ++ +- OpenAI +- Claude (coming soon) +- PaLM (coming soon) +- …… + + | ++ +- GLM +- …… + + | +
+ 语言 + | ++ 知识 + | ++ 推理 + | ++ 考试 + | ++ 理解 + | +
+
+
+
+字词释义+ +- WiC +- SummEdits + +
+
+
+成语习语+ +- CHID + +
+
+
+语义相似度+ +- AFQMC +- BUSTM + +
+
+
+指代消解+ +- CLUEWSC +- WSC +- WinoGrande + +
+
+ 翻译+ +- Flores + + |
+
+
+
+
+知识问答+ +- BoolQ +- CommonSenseQA +- NaturalQuestion +- TrivialQA + +
+
+ 多语种问答+ +- TyDi-QA + + |
+
+
+
+
+文本蕴含+ +- CMNLI +- OCNLI +- OCNLI_FC +- AX-b +- AX-g +- CB +- RTE + +
+
+
+常识推理+ +- StoryCloze +- StoryCloze-CN(即将上线) +- COPA +- ReCoRD +- HellaSwag +- PIQA +- SIQA + +
+
+
+数学推理+ +- MATH +- GSM8K + +
+
+
+定理应用+ +- TheoremQA + +
+
+
+代码+ +- HumanEval +- MBPP + +
+
+ 综合推理+ +- BBH + + |
+
+
+
+ 初中/高中/大学/职业考试+ +- GAOKAO-2023 +- CEval +- AGIEval +- MMLU +- GAOKAO-Bench +- MMLU-CN (即将上线) +- ARC + + |
+
+
+
+
+阅读理解+ +- C3 +- CMRC +- DRCD +- MultiRC +- RACE + +
+
+
+内容总结+ +- CSL +- LCSTS +- XSum + +
+
+ 内容分析+ +- EPRSTMT +- LAMBADA +- TNEWS + + |
+
+ HuggingFace 模型 + | ++ API 模型 + | ++ 自定义模型 + | +
+ +- LLaMA +- Vicuna +- Alpaca +- Baichuan +- WizardLM +- ChatGLM-6B +- ChatGLM2-6B +- MPT +- Falcon +- TigerBot +- MOSS +- …… + + | ++ +- OpenAI +- Claude (即将推出) +- PaLM (即将推出) +- …… + + | ++ +- GLM +- …… + + | +