OpenCompass/examples/eval_internlm_chat_turbomind.py

from mmengine.config import read_base

from opencompass.models.turbomind import TurboMindModel

with read_base():
    # choose a list of datasets
    from opencompass.configs.datasets.ceval.ceval_gen_5f30c7 import \
        ceval_datasets
    from opencompass.configs.datasets.crowspairs.crowspairs_gen_381af0 import \
        crowspairs_datasets
    from opencompass.configs.datasets.gsm8k.gsm8k_gen_1d7fe4 import \
        gsm8k_datasets
    from opencompass.configs.datasets.mmlu.mmlu_gen_a484b3 import mmlu_datasets
    from opencompass.configs.datasets.race.race_gen_69ee4f import race_datasets
    from opencompass.configs.datasets.SuperGLUE_WiC.SuperGLUE_WiC_gen_d06864 import \
        WiC_datasets
    from opencompass.configs.datasets.SuperGLUE_WSC.SuperGLUE_WSC_gen_7902a7 import \
        WSC_datasets
    from opencompass.configs.datasets.triviaqa.triviaqa_gen_2121ce import \
        triviaqa_datasets
    # and output the results in a choosen format
    from opencompass.configs.summarizers.medium import summarizer

datasets = sum((v for k, v in locals().items() if k.endswith('_datasets')), [])

internlm_meta_template = dict(round=[
    dict(role='HUMAN', begin='<|User|>:', end='\n'),
    dict(role='BOT', begin='<|Bot|>:', end='<eoa>\n', generate=True),
],
                              eos_token_id=103028)

internlm2_meta_template = dict(round=[
    dict(role='HUMAN', begin='<|im_start|>user\n', end='<|im_end|>\n'),
    dict(role='BOT',
         begin='<|im_start|>assistant\n',
         end='<|im_end|>\n',
         generate=True),
],
                               eos_token_id=92542)

# config for internlm-chat-7b
internlm_chat_7b = dict(
    type=TurboMindModel,
    abbr='internlm-chat-7b-turbomind',
    path='internlm/internlm-chat-7b',
    engine_config=dict(session_len=2048,
                       max_batch_size=32,
                       rope_scaling_factor=1.0),
    gen_config=dict(top_k=1, top_p=0.8, temperature=1.0, max_new_tokens=100),
    max_out_len=100,
    max_seq_len=2048,
    batch_size=32,
    concurrency=32,
    meta_template=internlm_meta_template,
    run_cfg=dict(num_gpus=1, num_procs=1),
    end_str='<eoa>',
)

# config for internlm-chat-7b
internlm2_chat_7b = dict(type=TurboMindModel,
                         abbr='internlm2-chat-7b-turbomind',
                         path='internlm/internlm2-chat-7b',
                         engine_config=dict(session_len=2048,
                                            max_batch_size=32,
                                            rope_scaling_factor=1.0),
                         gen_config=dict(top_k=1,
                                         top_p=0.8,
                                         temperature=1.0,
                                         max_new_tokens=100),
                         max_out_len=100,
                         max_seq_len=2048,
                         batch_size=32,
                         concurrency=32,
                         meta_template=internlm2_meta_template,
                         run_cfg=dict(num_gpus=1, num_procs=1),
                         end_str='<|im_end|>')

# config for internlm-chat-20b
internlm_chat_20b = dict(
    type=TurboMindModel,
    abbr='internlm-chat-20b-turbomind',
    path='internlm/internlm-chat-20b',
    engine_config=dict(session_len=2048,
                       max_batch_size=8,
                       rope_scaling_factor=1.0),
    gen_config=dict(top_k=1, top_p=0.8, temperature=1.0, max_new_tokens=100),
    max_out_len=100,
    max_seq_len=2048,
    batch_size=8,
    concurrency=8,
    meta_template=internlm_meta_template,
    run_cfg=dict(num_gpus=1, num_procs=1),
    end_str='<eoa>',
)

models = [internlm_chat_20b]
Integrate turbomind python api (#484) * integrate turbomind python api * update * update user guide * update * fix according to reviewer's comments * fix error * fix linting * update user guide * remove debug log --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> 2023-11-21 22:34:46 +08:00			`from mmengine.config import read_base`
[Refactor] Code refactoarization (#1831) * Update * fix lint * update * fix lint 2025-01-20 19:17:38 +08:00
Integrate turbomind python api (#484) * integrate turbomind python api * update * update user guide * update * fix according to reviewer's comments * fix error * fix linting * update user guide * remove debug log --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> 2023-11-21 22:34:46 +08:00			`from opencompass.models.turbomind import TurboMindModel`

			`with read_base():`
			`# choose a list of datasets`
[Refactor] Code refactoarization (#1831) * Update * fix lint * update * fix lint 2025-01-20 19:17:38 +08:00			`from opencompass.configs.datasets.ceval.ceval_gen_5f30c7 import \`
			`ceval_datasets`
			`from opencompass.configs.datasets.crowspairs.crowspairs_gen_381af0 import \`
			`crowspairs_datasets`
			`from opencompass.configs.datasets.gsm8k.gsm8k_gen_1d7fe4 import \`
			`gsm8k_datasets`
[Doc] Update Readme (#1439) * update * update * update * update * update * update * update * update * update * update * update * update 2024-08-22 14:48:45 +08:00			`from opencompass.configs.datasets.mmlu.mmlu_gen_a484b3 import mmlu_datasets`
			`from opencompass.configs.datasets.race.race_gen_69ee4f import race_datasets`
[Refactor] Code refactoarization (#1831) * Update * fix lint * update * fix lint 2025-01-20 19:17:38 +08:00			`from opencompass.configs.datasets.SuperGLUE_WiC.SuperGLUE_WiC_gen_d06864 import \`
			`WiC_datasets`
			`from opencompass.configs.datasets.SuperGLUE_WSC.SuperGLUE_WSC_gen_7902a7 import \`
			`WSC_datasets`
			`from opencompass.configs.datasets.triviaqa.triviaqa_gen_2121ce import \`
			`triviaqa_datasets`
Integrate turbomind python api (#484) * integrate turbomind python api * update * update user guide * update * fix according to reviewer's comments * fix error * fix linting * update user guide * remove debug log --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> 2023-11-21 22:34:46 +08:00			`# and output the results in a choosen format`
[Doc] Update Readme (#1439) * update * update * update * update * update * update * update * update * update * update * update * update 2024-08-22 14:48:45 +08:00			`from opencompass.configs.summarizers.medium import summarizer`
Integrate turbomind python api (#484) * integrate turbomind python api * update * update user guide * update * fix according to reviewer's comments * fix error * fix linting * update user guide * remove debug log --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> 2023-11-21 22:34:46 +08:00
			`datasets = sum((v for k, v in locals().items() if k.endswith('_datasets')), [])`

[Feature] Update configs for evaluating chat models like qwen, baichuan, llama2 using turbomind backend (#721) * add llama2 test * fix * test qwen chat-7b * test w4 * add baichuan2 * update * update * update configs and docs * update 2023-12-21 18:22:17 +08:00			`internlm_meta_template = dict(round=[`
			`dict(role='HUMAN', begin='<\|User\|>:', end='\n'),`
			`dict(role='BOT', begin='<\|Bot\|>:', end='<eoa>\n', generate=True),`
			`],`
			`eos_token_id=103028)`
Integrate turbomind python api (#484) * integrate turbomind python api * update * update user guide * update * fix according to reviewer's comments * fix error * fix linting * update user guide * remove debug log --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> 2023-11-21 22:34:46 +08:00
[Refactor] Code refactoarization (#1831) * Update * fix lint * update * fix lint 2025-01-20 19:17:38 +08:00			`internlm2_meta_template = dict(round=[`
			`dict(role='HUMAN', begin='<\|im_start\|>user\n', end='<\|im_end\|>\n'),`
			`dict(role='BOT',`
			`begin='<\|im_start\|>assistant\n',`
			`end='<\|im_end\|>\n',`
			`generate=True),`
			`],`
			`eos_token_id=92542)`
[Feature] Add end_str for turbomind (#859) * fix * update * fix internlm1 * fix docs * remove sys 2024-02-01 22:31:14 +08:00
Integrate turbomind python api (#484) * integrate turbomind python api * update * update user guide * update * fix according to reviewer's comments * fix error * fix linting * update user guide * remove debug log --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> 2023-11-21 22:34:46 +08:00			`# config for internlm-chat-7b`
[Feature] Update configs for evaluating chat models like qwen, baichuan, llama2 using turbomind backend (#721) * add llama2 test * fix * test qwen chat-7b * test w4 * add baichuan2 * update * update * update configs and docs * update 2023-12-21 18:22:17 +08:00			`internlm_chat_7b = dict(`
			`type=TurboMindModel,`
			`abbr='internlm-chat-7b-turbomind',`
[Feature] Update evaluate turbomind (#804) * update * fix * fix * fix 2024-01-17 11:09:50 +08:00			`path='internlm/internlm-chat-7b',`
[Fix] Fix turbomind and update docs (#808) * update * update docs * add engine_config and gen_config in eval_config * update * fix * fix * fix * fix docstr * fix url 2024-01-18 14:41:35 +08:00			`engine_config=dict(session_len=2048,`
			`max_batch_size=32,`
			`rope_scaling_factor=1.0),`
[Refactor] Code refactoarization (#1831) * Update * fix lint * update * fix lint 2025-01-20 19:17:38 +08:00			`gen_config=dict(top_k=1, top_p=0.8, temperature=1.0, max_new_tokens=100),`
[Feature] Update configs for evaluating chat models like qwen, baichuan, llama2 using turbomind backend (#721) * add llama2 test * fix * test qwen chat-7b * test w4 * add baichuan2 * update * update * update configs and docs * update 2023-12-21 18:22:17 +08:00			`max_out_len=100,`
			`max_seq_len=2048,`
			`batch_size=32,`
			`concurrency=32,`
			`meta_template=internlm_meta_template,`
			`run_cfg=dict(num_gpus=1, num_procs=1),`
[Feature] Add end_str for turbomind (#859) * fix * update * fix internlm1 * fix docs * remove sys 2024-02-01 22:31:14 +08:00			`end_str='<eoa>',`
			`)`

			`# config for internlm-chat-7b`
[Refactor] Code refactoarization (#1831) * Update * fix lint * update * fix lint 2025-01-20 19:17:38 +08:00			`internlm2_chat_7b = dict(type=TurboMindModel,`
			`abbr='internlm2-chat-7b-turbomind',`
			`path='internlm/internlm2-chat-7b',`
			`engine_config=dict(session_len=2048,`
			`max_batch_size=32,`
			`rope_scaling_factor=1.0),`
			`gen_config=dict(top_k=1,`
			`top_p=0.8,`
			`temperature=1.0,`
			`max_new_tokens=100),`
			`max_out_len=100,`
			`max_seq_len=2048,`
			`batch_size=32,`
			`concurrency=32,`
			`meta_template=internlm2_meta_template,`
			`run_cfg=dict(num_gpus=1, num_procs=1),`
			`end_str='<\|im_end\|>')`
Integrate turbomind python api (#484) * integrate turbomind python api * update * update user guide * update * fix according to reviewer's comments * fix error * fix linting * update user guide * remove debug log --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> 2023-11-21 22:34:46 +08:00
			`# config for internlm-chat-20b`
[Feature] Update configs for evaluating chat models like qwen, baichuan, llama2 using turbomind backend (#721) * add llama2 test * fix * test qwen chat-7b * test w4 * add baichuan2 * update * update * update configs and docs * update 2023-12-21 18:22:17 +08:00			`internlm_chat_20b = dict(`
			`type=TurboMindModel,`
			`abbr='internlm-chat-20b-turbomind',`
[Feature] Update evaluate turbomind (#804) * update * fix * fix * fix 2024-01-17 11:09:50 +08:00			`path='internlm/internlm-chat-20b',`
[Fix] Fix turbomind and update docs (#808) * update * update docs * add engine_config and gen_config in eval_config * update * fix * fix * fix * fix docstr * fix url 2024-01-18 14:41:35 +08:00			`engine_config=dict(session_len=2048,`
			`max_batch_size=8,`
			`rope_scaling_factor=1.0),`
[Refactor] Code refactoarization (#1831) * Update * fix lint * update * fix lint 2025-01-20 19:17:38 +08:00			`gen_config=dict(top_k=1, top_p=0.8, temperature=1.0, max_new_tokens=100),`
[Feature] Update configs for evaluating chat models like qwen, baichuan, llama2 using turbomind backend (#721) * add llama2 test * fix * test qwen chat-7b * test w4 * add baichuan2 * update * update * update configs and docs * update 2023-12-21 18:22:17 +08:00			`max_out_len=100,`
			`max_seq_len=2048,`
			`batch_size=8,`
			`concurrency=8,`
			`meta_template=internlm_meta_template,`
			`run_cfg=dict(num_gpus=1, num_procs=1),`
[Feature] Add end_str for turbomind (#859) * fix * update * fix internlm1 * fix docs * remove sys 2024-02-01 22:31:14 +08:00			`end_str='<eoa>',`
[Feature] Update configs for evaluating chat models like qwen, baichuan, llama2 using turbomind backend (#721) * add llama2 test * fix * test qwen chat-7b * test w4 * add baichuan2 * update * update * update configs and docs * update 2023-12-21 18:22:17 +08:00			`)`
Integrate turbomind python api (#484) * integrate turbomind python api * update * update user guide * update * fix according to reviewer's comments * fix error * fix linting * update user guide * remove debug log --------- Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com> 2023-11-21 22:34:46 +08:00
[Feature] Update configs for evaluating chat models like qwen, baichuan, llama2 using turbomind backend (#721) * add llama2 test * fix * test qwen chat-7b * test w4 * add baichuan2 * update * update * update configs and docs * update 2023-12-21 18:22:17 +08:00			`models = [internlm_chat_20b]`