.. |
hf_metrics
|
[Feature] Use local accuracy from hf implements (#416)
|
2023-09-20 16:35:22 +08:00 |
__init__.py
|
[Fix] Fix math-verify evaluator (#1917)
|
2025-03-11 17:35:04 +08:00 |
code_evaluator.py
|
[Feature] Add MultiPL-E & Code Evaluator (#1963)
|
2025-03-21 20:09:25 +08:00 |
icl_agent_evaluator.py
|
Support GSM8k evaluation with tools by Lagent and LangChain (#277)
|
2023-09-22 15:28:22 +08:00 |
icl_aucroc_evaluator.py
|
[Enhancement] Test linting in CI and fix existing linting errors (#69)
|
2023-07-17 15:59:10 +08:00 |
icl_base_evaluator.py
|
[Fix] Fix compare error when k is list in base_evaluator (#2010)
|
2025-04-10 19:47:21 +08:00 |
icl_bpc_evaluator.py
|
[Feature] Adding support for LLM Compression Evaluation (#1108)
|
2024-04-30 10:51:01 +08:00 |
icl_circular_evaluator.py
|
[Sync] update configs (#734)
|
2023-12-25 21:59:16 +08:00 |
icl_em_evaluator.py
|
[Sync] update (#517)
|
2023-10-27 20:31:22 +08:00 |
icl_hf_evaluator.py
|
[Sync] update evaluator (#1175)
|
2024-05-21 14:22:46 +08:00 |
icl_jieba_rouge_evaluator.py
|
fix jieba rouge (#467)
|
2023-10-12 10:25:19 +08:00 |
icl_korbench_evaluator.py
|
[Feature] Add Korbench dataset (#1713)
|
2024-11-25 20:11:27 +08:00 |
icl_misc_evaluator.py
|
[Feature] Support inference ppl datasets (#1315)
|
2024-07-22 17:59:30 +08:00 |
icl_plugin_evaluator.py
|
[Fix] Fix compatible issue
|
2025-03-03 15:35:57 +08:00 |
icl_toxic_evaluator.py
|
[Fix] Fix CI (#70)
|
2023-07-17 19:10:59 +08:00 |
lm_evaluator.py
|
[Feature] Support subjective evaluation for reasoning model (#1868)
|
2025-02-20 12:19:46 +08:00 |
math_evaluator.py
|
[Fix] OpenICL Math Evaluator Config (#2007)
|
2025-04-08 14:38:35 +08:00 |