Commit Graph

24 Commits

Author SHA1 Message Date
Fengzhe Zhou
32f40a8f83
[Sync] Sync with internal codes 2023.01.08 (#777) 2024-01-08 14:07:24 +00:00
bittersweet1999
7cd65d49d8
[Fix] Fix small bug in alignbench (#764)
* fix small bugs

* fix small bugs
2024-01-03 07:44:53 +00:00
Mo Li
33f8df1ca3
[Update] Change NeedleInAHaystackDataset to dynamic dataset loading (#754)
* Add NeedleInAHaystack Test

* Apply pre-commit formatting

* Update configs/eval_hf_internlm_chat_20b_cdme.py

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* add needle in haystack test

* update needle in haystack test

* update plot function in tools_needleinahaystack.py

* optimizing needleinahaystack dataset generation strategy

* modify minor formatting issues

* add English version support

* change NeedleInAHaystackDataset to dynamic loading

* change NeedleInAHaystackDataset to dynamic loading

* fix needleinahaystack test eval bug

* fix needleinahaystack config bug

---------

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
2024-01-02 17:22:56 +08:00
Mo Li
17b8e929dd
[Feature] Update plot function in tools_needleinahaystack.py (#747)
* Add NeedleInAHaystack Test

* Apply pre-commit formatting

* Update configs/eval_hf_internlm_chat_20b_cdme.py

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* add needle in haystack test

* update needle in haystack test

* update plot function in tools_needleinahaystack.py

* optimizing needleinahaystack dataset generation strategy

* modify minor formatting issues

---------

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
2023-12-29 18:51:09 +08:00
Songyang Zhang
ad96f2156f
Update merge script (#733) 2023-12-25 16:45:22 +08:00
Mo Li
0e24f4213e
[Feature] Add NeedleInAHaystack Test Support (#714)
* Add NeedleInAHaystack Test

* Apply pre-commit formatting

* Update configs/eval_hf_internlm_chat_20b_cdme.py

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* add needle in haystack test

* update needle in haystack test

---------

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
2023-12-23 12:00:51 +08:00
bittersweet1999
fbb912ddf3
[Feature] Add abbr for judgemodel in subjective evaluation (#724)
* add_judgemodel_abbr

* add judgemodel abbr
2023-12-21 15:58:20 +08:00
Songyang Zhang
637628a70f
[Doc] Update Doc for Alignbench (#707)
* update alignmentbench

* update alignmentbench

* update doc

* update

* update
2023-12-15 15:07:25 +08:00
bittersweet1999
1fe152b3e8
[Feature] Support AlignmentBench infer and judge (#697)
* alignmentbench infer and judge

* alignmentbench

* alignmentbench done

* alignment all done

* alignment all done
2023-12-13 19:59:30 +08:00
Fengzhe Zhou
9083dea683
[Sync] some renaming (#641) 2023-11-27 16:06:49 +08:00
Qing
e2355a2ede
[Feature] Add multi model viz (#509)
* add viz_multi_model.py tool

* Modify the viz_multi_model.py script according to the review

* highlight multiple optimal scores

---------

Co-authored-by: wq.chu <wq.chu@tianrang-inc.com>
Co-authored-by: Leymore <zfz-960727@163.com>
2023-10-30 12:11:33 +08:00
Tong Gao
119bfd1569
[Refactor] Move fix_id_list to Retriever (#442)
* [Refactor] Move fix_id_list to Retriever

* update

* move to base

* fix
2023-10-07 12:53:41 +08:00
Leymore
b48d084020
[Fix] update bbh implement & fix bbh suffix (#371) 2023-09-08 15:14:30 +08:00
Hubert
ddb8197212
[Feat] support wizardcoder series (#344)
* [Feat] support wizardcoder series

* minor fix
2023-09-06 17:52:35 +08:00
Hubert
fd389e2d78
[Feat] support codellama and preds collection tools (#335) 2023-08-31 11:14:42 +08:00
cdpath
fa7978fe08
[Feature] Support sample count in prompt_viewer (#273)
* support sample count in prompt_viewer

* update

---------

Co-authored-by: Leymore <zfz-960727@163.com>
2023-08-29 20:51:10 +08:00
Leymore
c26ecdb1b0
[Feature] Add and apply update suffix tool (#280)
* add and apply update suffix tool

* add dataset suffix updater as precommit hook

* update workflow

* update scripts

* update ci

* update

* ci with py3.8

* run in serial

* update bbh

* use py 3.10

* update pre commit zh cn
2023-08-28 17:35:04 +08:00
Tong Gao
9058be07b8
[Feature] Simplify entry script (#204)
* [Feature] Simply entry script

* update
2023-08-25 17:36:30 +08:00
cdpath
6df124d40b
[Docs] update descriptions for tools (#270) 2023-08-25 16:00:26 +08:00
Haodong Duan
6ca2be6626
[Script] Add scripts to evaluate MMBench (#161)
* update

* update

* Update README.md

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>

* refine

* update default

* update CN README

---------

Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
2023-08-07 16:53:36 +08:00
Leymore
3fe5ee096c
[Feature] Add heuristic size partitioner (#63)
* [Feature] Add heuristic size partitioner

* update
2023-07-20 11:53:24 +08:00
Leymore
86d5ec3d0f
Update configs (#9)
* Update implements

* Update
2023-07-06 12:27:41 +08:00
Tong Gao
cd1bec5f2a
Enhance run.py (#7)
* Enhance run.py

* update
2023-07-06 11:58:37 +08:00
gaotongxiao
7d346000bb initial commit 2023-07-04 21:34:55 +08:00