OpenCompass/opencompass/configs/datasets/bigcodebench
Dongsheng Zhu 7a7a4517ab
[Update] History code bench pass@k update (#2102)
* bigcodebench

* humaneval

* humanevalx

* humanevalx

* livecodebench

* mbpp

* humaneval_plus

* fix bug

* template

* max_out fix

* template update
2025-05-19 17:03:33 +08:00
..
bigcodebench_full_complete_gen_faf748.py [Update] Code evaluation alignment (#1909) 2025-03-04 18:49:38 +08:00
bigcodebench_full_complete_gen.py [Update] Update Skywork/Qwen-QwQ (#1728) 2024-12-05 19:30:43 +08:00
bigcodebench_full_instruct_gen_8815eb.py [Update] Code evaluation alignment (#1909) 2025-03-04 18:49:38 +08:00
bigcodebench_full_instruct_gen.py [Update] Update Skywork/Qwen-QwQ (#1728) 2024-12-05 19:30:43 +08:00
bigcodebench_full_instruct_repeat_gen_c3d5ad.py [Update] History code bench pass@k update (#2102) 2025-05-19 17:03:33 +08:00
bigcodebench_gen.py [Feature] Add recommendation configs for datasets (#1937) 2025-03-25 14:54:13 +08:00
bigcodebench_hard_complete_gen_2888d3.py [Update] Add dataset configurations of no max_out_len (#1967) 2025-03-24 14:24:12 +08:00
bigcodebench_hard_complete_gen_faf748.py [Update] Code evaluation alignment (#1909) 2025-03-04 18:49:38 +08:00
bigcodebench_hard_complete_gen.py [Update] Update Skywork/Qwen-QwQ (#1728) 2024-12-05 19:30:43 +08:00
bigcodebench_hard_instruct_gen_8815eb.py [Update] Code evaluation alignment (#1909) 2025-03-04 18:49:38 +08:00
bigcodebench_hard_instruct_gen_c3d5ad.py [Feature] Add recommendation configs for datasets (#1937) 2025-03-25 14:54:13 +08:00
bigcodebench_hard_instruct_gen.py [Feature] Add recommendation configs for datasets (#1937) 2025-03-25 14:54:13 +08:00
bigcodebench_hard_instruct_repeat_gen_c3d5ad.py [Update] History code bench pass@k update (#2102) 2025-05-19 17:03:33 +08:00