Songyang Zhang
|
a4d5a6c81b
|
[Feature] Support LiveCodeBench (#1617)
* Update
* Update LCB
* Update
* Update
* Update
* Update
* Update
|
2024-10-21 20:50:39 +08:00 |
|
Songyang Zhang
|
e7681943f3
|
[Feature] Update the max_out_len for many models (#1559)
|
2024-09-24 21:52:28 +08:00 |
|
Songyang Zhang
|
6997990c93
|
[Feature] Update Models (#1518)
* Update Models
* Update
* Update humanevalx
* Update
* Update
|
2024-09-12 23:35:30 +08:00 |
|
Linchen Xiao
|
da74cbfa39
|
[Fix] Model configs update
|
2024-09-04 18:57:10 +08:00 |
|
Linchen Xiao
|
245664f4c0
|
[Feature] Fullbench v0.1 language update (#1463)
* update
* update
* update
* update
|
2024-08-28 14:01:05 +08:00 |
|
Linchen Xiao
|
8e55c9c6ee
|
[Update] Compassbench v1.3 (#1396)
* stash files
* compassbench subjective evaluation added
* evaluation update
* fix lint
* update docs
* Update lint
* changes saved
* changes saved
* CompassBench subjective summarizer added (#1349)
* subjective summarizer added
* fix lint
[Fix] Fix MathBench (#1351)
Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>
[Update] Update model support list (#1353)
* fix pip version
* fix pip version
* update model support
subjective summarizer updated
knowledge, math objective done (data need update)
remove secrets
objective changes saved
knowledge data added
* secrets removed
* changed added
* summarizer modified
* summarizer modified
* compassbench coding added
* fix lint
* objective summarizer updated
* compass_bench_v1.3 updated
* update files in config folder
* remove unused model
* lcbench modified
* removed model evaluation configs
* remove duplicated sdk implementation
---------
Co-authored-by: zhangsongyang <zhangsongyang@pjlab.org.cn>
|
2024-08-12 19:09:19 +08:00 |
|
Fengzhe Zhou
|
a32f21a356
|
[Sync] Sync with internal codes 2024.06.28 (#1279)
|
2024-06-28 14:16:34 +08:00 |
|
Fengzhe Zhou
|
2954913d9b
|
[Sync] bump version (#1204)
|
2024-05-28 23:09:59 +08:00 |
|
Fengzhe Zhou
|
62dbf04708
|
[Sync] update github workflow (#1156)
|
2024-05-14 22:42:23 +08:00 |
|
Fengzhe Zhou
|
aa2dd2b58c
|
[Format] Add config lints (#892)
|
2024-05-14 15:35:58 +08:00 |
|
Fengzhe Zhou
|
7505b3cadf
|
[Feature] Add huggingface apply_chat_template (#1098)
* add TheoremQA with 5-shot
* add huggingface_above_v4_33 classes
* use num_worker partitioner in cli
* update theoremqa
* update TheoremQA
* add TheoremQA
* rename theoremqa -> TheoremQA
* update TheoremQA output path
* rewrite many model configs
* update huggingface
* further update
* refine configs
* update configs
* update configs
* add configs/eval_llama3_instruct.py
* add summarizer multi faceted
* update bbh datasets
* update configs/models/hf_llama/lmdeploy_llama3_8b_instruct.py
* rename class
* update readme
* update hf above v4.33
|
2024-05-14 14:50:16 +08:00 |
|
liushz
|
17735f0c13
|
Fix Llama-3 meta template (#1079)
Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>
|
2024-04-24 16:46:25 +08:00 |
|
Fengzhe Zhou
|
a256753221
|
[Feature] Add LLaMA-3 Series Configs (#1065)
* add LLaMA-3 Series configs
* update readme
|
2024-04-22 14:39:31 +08:00 |
|
Fengzhe Zhou
|
8c85edd1cd
|
[Sync] deprecate old mbpps (#1064)
|
2024-04-19 20:49:46 +08:00 |
|
Fengzhe Zhou
|
32f40a8f83
|
[Sync] Sync with internal codes 2023.01.08 (#777)
|
2024-01-08 14:07:24 +00:00 |
|
Leymore
|
2c915218e8
|
[Feaure] Add new models: baichuan2, tigerbot, vicuna v1.5 (#373)
* add bag of new models: baichuan2, tigerbot, vicuna v1.5
* update
* re-organize models
* update readme
* update
|
2023-09-08 15:41:20 +08:00 |
|