..
postprocessors
[Feature] Add custom model postprocess function ( #1519 )
2024-09-18 14:40:51 +08:00
__init__.py
[Update] Update Skywork/Qwen-QwQ ( #1728 )
2024-12-05 19:30:43 +08:00
abbr.py
[Feature] Add multi-model judge and fix some problems ( #1016 )
2024-04-02 11:52:06 +08:00
auxiliary.py
[Feat] support humaneval and mbpp pass@k ( #598 )
2023-11-16 21:22:06 +08:00
build.py
[Feature] Add huggingface apply_chat_template ( #1098 )
2024-05-14 14:50:16 +08:00
collect_env.py
[Feature] Update pip install ( #1324 )
2024-07-29 18:32:50 +08:00
datasets_info.py
[Update] Update o1 eval prompt ( #1806 )
2025-01-07 00:14:32 +08:00
datasets.py
[Update] Update o1 eval prompt ( #1806 )
2025-01-07 00:14:32 +08:00
dependency.py
[Feature]: Use multimodal ( #73 )
2023-08-03 11:07:50 +08:00
dict_postprocessors.py
[Feature] Add Judgerbench and reorg subeval ( #1593 )
2024-10-15 16:36:05 +08:00
file.py
fix output typing, change mutable list to immutable tuple ( #989 )
2024-04-26 23:07:34 +08:00
fileio.py
[Update] Update Skywork/Qwen-QwQ ( #1728 )
2024-12-05 19:30:43 +08:00
lark.py
[Feature] Several enhancements ( #142 )
2023-08-01 18:19:49 +08:00
logging.py
[Enhance] Supress warning raised by get_logger ( #353 )
2023-09-04 15:27:08 +08:00
menu.py
[Feat] Support local runner for windows ( #515 )
2023-10-27 17:16:22 +08:00
model_postprocessors.py
[Update] Update MATH dataset with model judge ( #1711 )
2024-11-25 15:14:55 +08:00
network.py
[Update] Update Skywork/Qwen-QwQ ( #1728 )
2024-12-05 19:30:43 +08:00
prompt.py
Support wildbench ( #1266 )
2024-06-24 13:16:27 +08:00
run.py
[Feature] Add OpenHuEval-HuLifeQA ( #4 )
2025-01-24 10:32:17 +08:00
text_postprocessors.py
add some features ( #32 )
2025-02-14 20:44:53 +08:00
types.py
[Sync] Initial support of subjective evaluation ( #421 )
2023-09-22 15:42:31 +08:00