mirror of
https://github.com/open-compass/opencompass.git
synced 2025-05-30 16:03:24 +08:00
![]() * Add NeedleInAHaystack Test * Apply pre-commit formatting * Update configs/eval_hf_internlm_chat_20b_cdme.py Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> * add needle in haystack test * update needle in haystack test * update plot function in tools_needleinahaystack.py * optimizing needleinahaystack dataset generation strategy * modify minor formatting issues * add English version support * change NeedleInAHaystackDataset to dynamic loading * change NeedleInAHaystackDataset to dynamic loading * fix needleinahaystack test eval bug * fix needleinahaystack config bug * Added support for multi-needle testing in needle-in-a-haystack test * Optimize the code for plotting in the needle-in-a-haystack test. * Correct the typo in the dataset parameters. * update needleinahaystack test docs --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> |
||
---|---|---|
.. | ||
circular_eval.md | ||
code_eval_service.md | ||
code_eval.md | ||
contamination_eval.md | ||
custom_dataset.md | ||
evaluation_lightllm.md | ||
evaluation_turbomind.md | ||
longeval.md | ||
multimodal_eval.md | ||
needleinahaystack_eval.md | ||
new_dataset.md | ||
new_model.md | ||
prompt_attack.md | ||
subjective_evaluation.md |