Dongsheng Zhu
|
7a7a4517ab
|
[Update] History code bench pass@k update (#2102)
* bigcodebench
* humaneval
* humanevalx
* humanevalx
* livecodebench
* mbpp
* humaneval_plus
* fix bug
* template
* max_out fix
* template update
|
2025-05-19 17:03:33 +08:00 |
|
Dongsheng Zhu
|
fff2d51440
|
[Update] Code evaluation alignment (#1909)
* code alignment
* update oss md5
* bigcodebench update
* lint
* lint_
* lint yapf
|
2025-03-04 18:49:38 +08:00 |
|
Songyang Zhang
|
fb43dd1906
|
[Update] Update Skywork/Qwen-QwQ (#1728)
* Update JuderBench
* Support O1-style Prompts
* Update Code
* Update OpenAI
* Update BigCodeBench
* Update BigCodeBench
* Update BigCodeBench
* Update BigCodeBench
* Update BigCodeBench
* Update
|
2024-12-05 19:30:43 +08:00 |
|
Songyang Zhang
|
f97c4eae42
|
[Update] Update Fullbench (#1712)
* Update JuderBench
* Support O1-style Prompts
* Update Code
|
2024-11-26 14:26:55 +08:00 |
|
Songyang Zhang
|
a4d5a6c81b
|
[Feature] Support LiveCodeBench (#1617)
* Update
* Update LCB
* Update
* Update
* Update
* Update
* Update
|
2024-10-21 20:50:39 +08:00 |
|