Songyang Zhang
|
aa2b89b6f8
|
[Update] Add CascadeEvaluator with Data Replica (#2022)
* Update CascadeEvaluator
* Update CascadeEvaluator
* Update CascadeEvaluator
* Update Config
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
|
2025-05-20 16:46:55 +08:00 |
|
Myhs_phz
|
1585c0adbe
|
[Feature] Evaluation Results Persistence (#1894)
* feat results_station.py
* lint
* feat save_to_station
* feat result_station.py and lint
* feat
* fix
* fix and lint
* fix
* fix subjective processing
* fix
* fix
* style function name
* lint
|
2025-03-05 18:33:34 +08:00 |
|
bittersweet1999
|
a2e9bc0c41
|
[Fix] fix duplicate error in partitioner (#1552)
* fix pip version
* fix pip version
* fix duplicate error in paritioner
* fix duplicate error in paritioner
|
2024-09-23 19:45:21 +08:00 |
|
Songyang Zhang
|
46cc7894e1
|
[Feature] Support import configs/models/summarizers from whl (#1376)
* [Feature] Support import configs/models/summarizers from whl
* Update LCBench configs
* Update
* Update
* Update
* Update
* update
* Update
* Update
* Update
* Update
* Update
|
2024-08-01 00:42:48 +08:00 |
|
Fengzhe Zhou
|
c3c02c2960
|
update docs (#1318)
* update docs
* 高效评测 -> 数据分片
* update
* update
* Update faq.md
---------
Co-authored-by: bittersweet1999 <148421775+bittersweet1999@users.noreply.github.com>
|
2024-07-25 18:44:25 +08:00 |
|
bittersweet1999
|
68ca48496b
|
[Refactor] Reorganize subjective eval (#1284)
* fix pip version
* fix pip version
* reorganize subjective eval
* reorg sub
* reorg subeval
* reorg subeval
* update subjective doc
* reorg subeval
* reorg subeval
|
2024-07-05 22:11:37 +08:00 |
|
Fengzhe Zhou
|
a32f21a356
|
[Sync] Sync with internal codes 2024.06.28 (#1279)
|
2024-06-28 14:16:34 +08:00 |
|
Fengzhe Zhou
|
7505b3cadf
|
[Feature] Add huggingface apply_chat_template (#1098)
* add TheoremQA with 5-shot
* add huggingface_above_v4_33 classes
* use num_worker partitioner in cli
* update theoremqa
* update TheoremQA
* add TheoremQA
* rename theoremqa -> TheoremQA
* update TheoremQA output path
* rewrite many model configs
* update huggingface
* further update
* refine configs
* update configs
* update configs
* add configs/eval_llama3_instruct.py
* add summarizer multi faceted
* update bbh datasets
* update configs/models/hf_llama/lmdeploy_llama3_8b_instruct.py
* rename class
* update readme
* update hf above v4.33
|
2024-05-14 14:50:16 +08:00 |
|
Haodong Duan
|
3a232db471
|
[Deperecate] Remove multi-modal related stuff (#1072)
* Remove MultiModal
* update index.rst
* update README
* remove mmbench codes
* update news
---------
Co-authored-by: Leymore <zfz-960727@163.com>
|
2024-04-26 21:20:14 +08:00 |
|
bittersweet1999
|
2d4e559763
|
[Feature] Add multi-model judge and fix some problems (#1016)
* support multi-model judge and moe judge
* test_moe
* test_moe
* test
* add moe judge
* support multi-judge-model
|
2024-04-02 11:52:06 +08:00 |
|
bittersweet1999
|
c78a4df923
|
add support for set prediction path (#984)
|
2024-03-19 14:32:15 +08:00 |
|
Fengzhe Zhou
|
d34ba11106
|
[Sync] Merge branch 'dev' into zfz/update-keyset-demo (#876)
|
2024-02-05 23:29:10 +08:00 |
|
Fengzhe Zhou
|
32f40a8f83
|
[Sync] Sync with internal codes 2023.01.08 (#777)
|
2024-01-08 14:07:24 +00:00 |
|
bittersweet1999
|
e985100cd1
|
[Fix] Fix subjective alignbench (#730)
|
2023-12-23 20:06:53 +08:00 |
|
bittersweet1999
|
97c2068bd9
|
[Feature] Add JudgeLLMs (#710)
* add judgellms
* add judgellms
* add sub_size_partition
* add docs
* add ref
|
2023-12-19 18:40:25 +08:00 |
|
bittersweet1999
|
3e77175720
|
[Fix] Hotfix for Subjective Evaluation (#686)
|
2023-12-12 09:22:08 +08:00 |
|
bittersweet1999
|
465308e430
|
[Feature] Add Subjective Evaluation (#680)
* new version of subject
* fixed draw
* fixed draw
* fixed draw
* done
* done
* done
* done
* fixed lint
|
2023-12-11 22:22:11 +08:00 |
|
Hubert
|
e78857ac36
|
[Sync] minor test (#683)
|
2023-12-11 17:42:53 +08:00 |
|
Fengzhe Zhou
|
dbb20b8270
|
[Sync] update (#517)
|
2023-10-27 20:31:22 +08:00 |
|
Leymore
|
fbf5089c40
|
[Sync] update github token (#475)
|
2023-10-13 06:50:54 -05:00 |
|
Tong Gao
|
07574fddbb
|
[Fix] keep keys (#431)
|
2023-09-22 17:30:54 +08:00 |
|
Tong Gao
|
a1ea3c094a
|
[Sync] Initial support of subjective evaluation (#421)
Co-authored-by: Leymore <zfz-960727@163.com>
|
2023-09-22 15:42:31 +08:00 |
|
Tong Gao
|
5d75c1bbb9
|
[Enhancement] Increase default task size (#360)
|
2023-09-05 10:38:13 +08:00 |
|
Leymore
|
7ca6ba625e
|
[Feature] Add qwen & qwen-chat support (#286)
* add and apply update suffix tool
* add tool doc
* add qwen configs
* add cmmlu
* rename bbh
* update datasets
* delete
* update hf_qwen_7b.py
|
2023-08-31 11:29:05 +08:00 |
|
Yuan Liu
|
191a3f6f9d
|
[Feature]: Use multimodal (#73)
* [Feature]: Add minigpt-4
* [Feature]: Add mm local runner
* [Feature]: Add instructblip
* [Feature]: Delete redundant file
* [Feature]: Delete redundant file
* [Feature]: Add README to InstructBLIP
* [Feature]: Update MiniGPT-4
* [Fix]: Fix lint
* [Feature]add omnibenchmark readme (#49)
* add omnibenchmark readme
* fix
* Update OmniMMBench.md
* Update OmniMMBench.md
* Update OmniMMBench.md
* [Fix]: Refine name (#54)
* [Feature]: Unify out and err
* [Fix]: Fix lint
* [Feature]: Rename to mmbench and change weight path
* [Feature]: Delete Omni in instructblip
* [Feature]: Check the avaliablity of lavis
* [Fix]: Fix lint
* [Feature]: Refactor MM
* [Refactor]: Refactor path
* [Feature]: Delete redundant files
* [Refactor]: Delete redundant files
---------
Co-authored-by: Wangbo Zhao(黑色枷锁) <56866854+wangbo-zhao@users.noreply.github.com>
|
2023-08-03 11:07:50 +08:00 |
|
Leymore
|
3fe5ee096c
|
[Feature] Add heuristic size partitioner (#63)
* [Feature] Add heuristic size partitioner
* update
|
2023-07-20 11:53:24 +08:00 |
|
Leymore
|
c94cc94348
|
Add release contribution
|
2023-07-05 03:15:31 +00:00 |
|
yingfhu
|
fb11108723
|
[Feat] support opencompass
|
2023-07-04 22:11:33 +08:00 |
|
gaotongxiao
|
7d346000bb
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
|