Xiaoyu Zhang
|
82a533a690
|
add rwkv-5-3b model (#666)
* support rwkv5-3b learnboard
* update rwkv-5-3b config
* update config
* refine
* fix bug
* update config
* refine
* reduce batch size
* refine
* reduce batch size to avoid oom in special datasets
* Update huggingface.py
* Update huggingface.py
|
2023-12-12 18:15:19 +08:00 |
|