OpenCompass/configs/models/hf_internlm/lmdeploy_internlm2_chat_7b.py

from opencompass.models import TurboMindModelwithChatTemplate


models = [
    dict(
        type=TurboMindModelwithChatTemplate,
        abbr=f'internlm2-chat-7b-lmdeploy',
        path='internlm/internlm2-chat-7b',
        # inference backend of LMDeploy. It can be either 'turbomind' or 'pytorch'.
        # If the model is not supported by 'turbomind', it will fallback to
        # 'pytorch'
        backend='turbomind',
        # For the detailed engine config and generation config, please refer to
        # https://github.com/InternLM/lmdeploy/blob/main/lmdeploy/messages.py
        engine_config=dict(tp=1),
        gen_config=dict(do_sample=False),
        max_seq_len=8192,
        max_out_len=4096,
        # the max number of prompts that LMDeploy receives
        # in `generate` function
        batch_size=5000,
        run_cfg=dict(num_gpus=1),
    )
]
[Feat] enable HuggingFacewithChatTemplate with --accelerator via cli (#1163) * enable HuggingFacewithChatTemplate with --accelerator via cli * rm vllm_internlm2_chat_7b 2024-05-15 21:51:07 +08:00			`from opencompass.models import TurboMindModelwithChatTemplate`
[Feature] update needlebench and configs (#986) * add Needlebench-1000K configs * add prompt postion args * add model configs * Update parallel.py * fix lint 2024-03-25 18:05:01 +08:00
[Feature] Integrate lmdeploy pipeline api (#1198) * integrate lmdeploy's pipeline api * fix linting * update user guide * rename * update * update * update * rollback class name * update * remove unused code * update * update * fix ci check * compatibility * remove concurrency * Update configs/models/hf_internlm/lmdeploy_internlm2_chat_7b.py * Update docs/zh_cn/advanced_guides/evaluation_lmdeploy.md * [Bug] fix lint --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> Co-authored-by: tonysy <sy.zhangbuaa@gmail.com> 2024-10-09 22:58:06 +08:00
[Feature] update needlebench and configs (#986) * add Needlebench-1000K configs * add prompt postion args * add model configs * Update parallel.py * fix lint 2024-03-25 18:05:01 +08:00			`models = [`
			`dict(`
[Feat] enable HuggingFacewithChatTemplate with --accelerator via cli (#1163) * enable HuggingFacewithChatTemplate with --accelerator via cli * rm vllm_internlm2_chat_7b 2024-05-15 21:51:07 +08:00			`type=TurboMindModelwithChatTemplate,`
[Feature] Integrate lmdeploy pipeline api (#1198) * integrate lmdeploy's pipeline api * fix linting * update user guide * rename * update * update * update * rollback class name * update * remove unused code * update * update * fix ci check * compatibility * remove concurrency * Update configs/models/hf_internlm/lmdeploy_internlm2_chat_7b.py * Update docs/zh_cn/advanced_guides/evaluation_lmdeploy.md * [Bug] fix lint --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> Co-authored-by: tonysy <sy.zhangbuaa@gmail.com> 2024-10-09 22:58:06 +08:00			`abbr=f'internlm2-chat-7b-lmdeploy',`
[Format] Add config lints (#892) 2024-05-14 15:35:58 +08:00			`path='internlm/internlm2-chat-7b',`
[Feature] Integrate lmdeploy pipeline api (#1198) * integrate lmdeploy's pipeline api * fix linting * update user guide * rename * update * update * update * rollback class name * update * remove unused code * update * update * fix ci check * compatibility * remove concurrency * Update configs/models/hf_internlm/lmdeploy_internlm2_chat_7b.py * Update docs/zh_cn/advanced_guides/evaluation_lmdeploy.md * [Bug] fix lint --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> Co-authored-by: tonysy <sy.zhangbuaa@gmail.com> 2024-10-09 22:58:06 +08:00			`# inference backend of LMDeploy. It can be either 'turbomind' or 'pytorch'.`
			`# If the model is not supported by 'turbomind', it will fallback to`
			`# 'pytorch'`
			`backend='turbomind',`
			`# For the detailed engine config and generation config, please refer to`
			`# https://github.com/InternLM/lmdeploy/blob/main/lmdeploy/messages.py`
			`engine_config=dict(tp=1),`
			`gen_config=dict(do_sample=False),`
[Feature] Update the max_out_len for many models (#1559) 2024-09-24 21:52:28 +08:00			`max_seq_len=8192,`
			`max_out_len=4096,`
[Feature] Integrate lmdeploy pipeline api (#1198) * integrate lmdeploy's pipeline api * fix linting * update user guide * rename * update * update * update * rollback class name * update * remove unused code * update * update * fix ci check * compatibility * remove concurrency * Update configs/models/hf_internlm/lmdeploy_internlm2_chat_7b.py * Update docs/zh_cn/advanced_guides/evaluation_lmdeploy.md * [Bug] fix lint --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> Co-authored-by: tonysy <sy.zhangbuaa@gmail.com> 2024-10-09 22:58:06 +08:00			`# the max number of prompts that LMDeploy receives`
			# in `generate` function
			`batch_size=5000,`
[Feat] enable HuggingFacewithChatTemplate with --accelerator via cli (#1163) * enable HuggingFacewithChatTemplate with --accelerator via cli * rm vllm_internlm2_chat_7b 2024-05-15 21:51:07 +08:00			`run_cfg=dict(num_gpus=1),`
[Feature] update needlebench and configs (#986) * add Needlebench-1000K configs * add prompt postion args * add model configs * Update parallel.py * fix lint 2024-03-25 18:05:01 +08:00			`)`
			`]`