Hoter Young
|
0971777348
|
[Feature] Support DeepSeep-R1-Distill-Qwen_32B (#30)
|
2025-02-13 21:42:16 +08:00 |
|
Hoter Young
|
f92a1e5050
|
[Feature] Support DeepSeep-R1 API from SenseTime (#29)
|
2025-02-13 20:50:57 +08:00 |
|
wujiang
|
60ab611ecd
|
set deepseek r1 batchsize = 1
|
2025-02-11 21:55:51 +08:00 |
|
wujiang
|
e261a76e07
|
set reasoning model max_out_len = 8192
|
2025-02-11 16:51:05 +08:00 |
|
wujiang
|
f55810ae48
|
[Update] OpenHuEval examples
|
2025-02-10 23:08:43 +08:00 |
|
wujiang
|
5741e38310
|
rename models
|
2025-02-10 17:24:24 +08:00 |
|
hoteryoung
|
c3b0803013
|
support deepseek-r1-distill-qwen-7b and -llama-8b
|
2025-02-10 17:24:24 +08:00 |
|
hoteryoung
|
f2c17190c9
|
enable tested reasoning model
|
2025-02-10 16:51:48 +08:00 |
|
wujiang
|
7586186897
|
add deepseek api models
|
2025-02-04 15:07:34 +08:00 |
|
Linchen Xiao
|
eadbdcb4cb
|
[Update] Update requirement and deepseek configurations (#1764)
|
2024-12-17 10:16:47 +08:00 |
|
Linchen Xiao
|
e2a290fd46
|
[Bump] Bump version to 0.3.7 (#1733)
|
2024-12-03 19:34:57 +08:00 |
|
Linchen Xiao
|
ef695e28e5
|
[Bug] Fix Korbench dataset module (#1717)
|
2024-11-26 17:13:28 +08:00 |
|
Linchen Xiao
|
695738a89b
|
[Update] Add lmdeploy DeepSeek configs (#1656)
* [Update] Add lmdeploy DeepSeek configs
* update max out length
|
2024-11-01 15:34:23 +08:00 |
|
Songyang Zhang
|
e7681943f3
|
[Feature] Update the max_out_len for many models (#1559)
|
2024-09-24 21:52:28 +08:00 |
|
Songyang Zhang
|
46cc7894e1
|
[Feature] Support import configs/models/summarizers from whl (#1376)
* [Feature] Support import configs/models/summarizers from whl
* Update LCBench configs
* Update
* Update
* Update
* Update
* update
* Update
* Update
* Update
* Update
* Update
|
2024-08-01 00:42:48 +08:00 |
|