OpenCompass/opencompass/openicl/icl_evaluator
Yufeng Zhao 300adc31e8
[Feature] Add Korbench dataset (#1713)
* first version for korbench

* first stage for korbench

* korbench_1

* korbench_1

* korbench_1

* korbench_1

* korbench_1_revised

* korbench_combined_1

* korbench_combined_1

* kor_combined

* kor_combined

* update

---------

Co-authored-by: MaiziXiao <xxllcc1993@gmail.com>
2024-11-25 20:11:27 +08:00
..
hf_metrics [Feature] Use local accuracy from hf implements (#416) 2023-09-20 16:35:22 +08:00
__init__.py [Feature] Support inference ppl datasets (#1315) 2024-07-22 17:59:30 +08:00
icl_agent_evaluator.py Support GSM8k evaluation with tools by Lagent and LangChain (#277) 2023-09-22 15:28:22 +08:00
icl_aucroc_evaluator.py [Enhancement] Test linting in CI and fix existing linting errors (#69) 2023-07-17 15:59:10 +08:00
icl_base_evaluator.py [Feature] Support LiveCodeBench (#1617) 2024-10-21 20:50:39 +08:00
icl_bpc_evaluator.py [Feature] Adding support for LLM Compression Evaluation (#1108) 2024-04-30 10:51:01 +08:00
icl_circular_evaluator.py [Sync] update configs (#734) 2023-12-25 21:59:16 +08:00
icl_em_evaluator.py [Sync] update (#517) 2023-10-27 20:31:22 +08:00
icl_hf_evaluator.py [Sync] update evaluator (#1175) 2024-05-21 14:22:46 +08:00
icl_jieba_rouge_evaluator.py fix jieba rouge (#467) 2023-10-12 10:25:19 +08:00
icl_korbench_evaluator.py [Feature] Add Korbench dataset (#1713) 2024-11-25 20:11:27 +08:00
icl_misc_evaluator.py [Feature] Support inference ppl datasets (#1315) 2024-07-22 17:59:30 +08:00
icl_plugin_evaluator.py [Sync] Fix TEvalEvaluator (#929) 2024-02-28 16:05:30 +08:00
icl_toxic_evaluator.py [Fix] Fix CI (#70) 2023-07-17 19:10:59 +08:00
lm_evaluator.py [Add] Add CompassArenaSubjectiveBench (#1645) 2024-11-01 13:52:22 +08:00