Dongsheng Zhu
|
7a7a4517ab
|
[Update] History code bench pass@k update (#2102)
* bigcodebench
* humaneval
* humanevalx
* humanevalx
* livecodebench
* mbpp
* humaneval_plus
* fix bug
* template
* max_out fix
* template update
|
2025-05-19 17:03:33 +08:00 |
|
Dongsheng Zhu
|
fff2d51440
|
[Update] Code evaluation alignment (#1909)
* code alignment
* update oss md5
* bigcodebench update
* lint
* lint_
* lint yapf
|
2025-03-04 18:49:38 +08:00 |
|
Linchen Xiao
|
d7daee6e25
|
[Update] OpenAI model update, bigcodebench update (#1879)
* [Update] Openai model update, bigcodebench update
* update
|
2025-02-20 19:33:25 +08:00 |
|
Dongsheng Zhu
|
3fd8b4e0cd
|
[Update] Update BigCodeBench & LCBench load path (#1857)
* BigCodeBench update
* update LCBench
* update LCBench 2
* update code
|
2025-02-08 15:15:47 +08:00 |
|
Linchen Xiao
|
ebefffed61
|
[Update] Update OC academic 202412 (#1771)
* [Update] Update academic settings
* Update
* update
|
2024-12-19 18:07:34 +08:00 |
|
Songyang Zhang
|
fb43dd1906
|
[Update] Update Skywork/Qwen-QwQ (#1728)
* Update JuderBench
* Support O1-style Prompts
* Update Code
* Update OpenAI
* Update BigCodeBench
* Update BigCodeBench
* Update BigCodeBench
* Update BigCodeBench
* Update BigCodeBench
* Update
|
2024-12-05 19:30:43 +08:00 |
|