Songyang Zhang
|
0d8df541bc
|
[Update] Update O1-style Benchmark and Prompts (#1742)
* Update JuderBench
* Support O1-style Prompts
* Update Code
* Update OpenAI
* Update BigCodeBench
* Update BigCodeBench
* Update BigCodeBench
* Update BigCodeBench
* Update BigCodeBench
* Update
* Update
* Update
* Update
|
2024-12-09 13:48:56 +08:00 |
|
Linchen Xiao
|
df57c08ccf
|
[Feature] Update Models, Summarizers (#1600)
|
2024-10-29 18:37:15 +08:00 |
|
Songyang Zhang
|
a4d5a6c81b
|
[Feature] Support LiveCodeBench (#1617)
* Update
* Update LCB
* Update
* Update
* Update
* Update
* Update
|
2024-10-21 20:50:39 +08:00 |
|
Songyang Zhang
|
e7681943f3
|
[Feature] Update the max_out_len for many models (#1559)
|
2024-09-24 21:52:28 +08:00 |
|
Songyang Zhang
|
6997990c93
|
[Feature] Update Models (#1518)
* Update Models
* Update
* Update humanevalx
* Update
* Update
|
2024-09-12 23:35:30 +08:00 |
|
Linchen Xiao
|
da74cbfa39
|
[Fix] Model configs update
|
2024-09-04 18:57:10 +08:00 |
|
Linchen Xiao
|
245664f4c0
|
[Feature] Fullbench v0.1 language update (#1463)
* update
* update
* update
* update
|
2024-08-28 14:01:05 +08:00 |
|
Linchen Xiao
|
8e55c9c6ee
|
[Update] Compassbench v1.3 (#1396)
* stash files
* compassbench subjective evaluation added
* evaluation update
* fix lint
* update docs
* Update lint
* changes saved
* changes saved
* CompassBench subjective summarizer added (#1349)
* subjective summarizer added
* fix lint
[Fix] Fix MathBench (#1351)
Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>
[Update] Update model support list (#1353)
* fix pip version
* fix pip version
* update model support
subjective summarizer updated
knowledge, math objective done (data need update)
remove secrets
objective changes saved
knowledge data added
* secrets removed
* changed added
* summarizer modified
* summarizer modified
* compassbench coding added
* fix lint
* objective summarizer updated
* compass_bench_v1.3 updated
* update files in config folder
* remove unused model
* lcbench modified
* removed model evaluation configs
* remove duplicated sdk implementation
---------
Co-authored-by: zhangsongyang <zhangsongyang@pjlab.org.cn>
|
2024-08-12 19:09:19 +08:00 |
|
Songyang Zhang
|
46cc7894e1
|
[Feature] Support import configs/models/summarizers from whl (#1376)
* [Feature] Support import configs/models/summarizers from whl
* Update LCBench configs
* Update
* Update
* Update
* Update
* update
* Update
* Update
* Update
* Update
* Update
|
2024-08-01 00:42:48 +08:00 |
|