Fengzhe Zhou
8c85edd1cd
[Sync] deprecate old mbpps ( #1064 )
2024-04-19 20:49:46 +08:00
Robin Chen
c172401323
[Fix] Fixed repeated loading of VLLM ( #1051 )
...
* [fix]Fixed the issue caused by the repeated loading of VLLM model during task segmentation.
* [fix] avoid TypeError: VLLM.__init__() got an unexpected keyword argument 'tokenizer_only'
* restore .pre-commit-config.yaml
* restore opencompass/tasks/openicl_infer.py
---------
Co-authored-by: IcyFeather <mengzhuo.happy@gmail.com>
Co-authored-by: Leymore <zfz-960727@163.com>
2024-04-17 20:36:08 +08:00
Songyang Zhang
629836146a
[Doc] Update README ( #1053 )
...
* [Update] Update readme
* [Update] Update readme
* [Update] Update readme
2024-04-16 19:54:12 +08:00
Fengzhe Zhou
881bdbf6bd
[Sync] Bump version to 0.2.4 ( #1052 )
...
(cherry picked from commit 16ac6306c72fa202173289b55eaefe85e0fcb73c)
Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>
2024-04-16 18:09:46 +08:00
Fengzhe Zhou
7a41951dda
[Fix] logger.error -> logger.debug in OpenAI wrapper ( #1050 )
...
* logger.error -> logger.info in OpenAI
* logger.info -> logger.debug in OpenAI
2024-04-15 21:08:13 +08:00
liuwei130
a00e57296f
[Feature] Add ChemBench ( #1032 )
...
* add ChemBench
* update results
* molbench -> ChemBench
---------
Co-authored-by: Leymore <zfz-960727@163.com>
2024-04-12 08:46:26 +08:00
Fengzhe Zhou
bd7c11bb89
[Fix] Update setup.py install_requires ( #1036 )
2024-04-11 11:11:34 +08:00
Fengzhe Zhou
b39f501563
[Sync] update taco ( #1030 )
2024-04-09 17:50:23 +08:00
Mo Li
16f29b25f1
[Fix] Simplify needlebench summarizer ( #1024 )
...
* Conflicts:
configs/summarizers/needlebench.py
* fix lint problems
2024-04-07 17:51:13 +08:00
Mo Li
f2af49337d
[Feature] Add ATC Choice Version ( #1019 )
...
* Squashed commit of the following:
commit c48ad194c3976dc63d1b60d8c8ab2d5ff9e1cbfe
Author: DseidLi <2568818204@qq.com>
Date: Tue Apr 2 16:57:43 2024 +0800
add atc_choice
commit 3ac6efea29619573e6fac8fa3cce464853dcead0
Merge: 2d4e559
8e3a9c3
Author: DseidLi <2568818204@qq.com>
Date: Tue Apr 2 16:41:38 2024 +0800
Merge branch 'atc_choice' into atc_add_choice
commit 8e3a9c396a3e5546d3faf584183f6fd60b974d5e
Merge: 150a036 0a6a03f
Author: DseidLi <2568818204@qq.com>
Date: Tue Mar 26 04:47:07 2024 +0800
Merge branch 'main' into atc_choice
Conflicts:
configs/summarizers/needlebench.py
opencompass/datasets/needlebench/multi.py
opencompass/datasets/needlebench/origin.py
opencompass/datasets/needlebench/parallel.py
commit 150a036d6d990f26a57c974d1af83d88c31a0f9d
Merge: 8d6ac9a 940dd18
Author: DseidLi <2568818204@qq.com>
Date: Wed Mar 20 03:49:08 2024 +0800
Merge branch 'needlebench_fix' into atc_choice
commit 8d6ac9a1a43b1c9d0f0ea27e7d58968a203ea898
Author: DseidLi <2568818204@qq.com>
Date: Wed Mar 20 03:41:49 2024 +0800
optimize needlebench code
commit 940dd18a4270f24bc69edd2a780182c68918e1a9
Author: DseidLi <2568818204@qq.com>
Date: Wed Mar 20 03:39:46 2024 +0800
fix vllm
commit d8be6877bc41051f3edcc0421c462c834c0f1c9a
Merge: ecad78a 2527fda
Author: DseidLi <2568818204@qq.com>
Date: Tue Mar 19 21:07:08 2024 +0800
Merge remote-tracking branch 'origin/add_1M_dataset' into atc_choice
commit 2527fda8a5
Author: DseidLi <2568818204@qq.com>
Date: Tue Mar 19 16:03:40 2024 +0800
add model configs
commit 75425acdf8
Author: DseidLi <2568818204@qq.com>
Date: Tue Mar 19 16:02:15 2024 +0800
add prompt postion args
commit 367ba1ba61
Author: DseidLi <2568818204@qq.com>
Date: Wed Feb 28 21:40:00 2024 +0800
add Needlebench-1000K configs
commit ecad78af14c4bb00fe325779114b384c57ab30bf
Author: DseidLi <2568818204@qq.com>
Date: Thu Mar 14 22:08:32 2024 +0800
fix atc
commit 08772c0787b18872abadc9ffec3223941a5ee0c2
Merge: 9f3f8cf caf1cf8
Author: DseidLi <2568818204@qq.com>
Date: Thu Mar 14 22:07:28 2024 +0800
Merge branch 'main' into atc_choice
Conflicts:
configs/datasets/needlebench/readme.md
configs/datasets/needlebench/readme_zh-CN.md
configs/summarizers/needlebench.py
opencompass/datasets/needlebench/atc.py
opencompass/summarizers/needlebench.py
commit 9f3f8cfb4452722734d334114ac1d14110e57406
Author: DseidLi <2568818204@qq.com>
Date: Thu Mar 14 21:35:53 2024 +0800
add atc-choice test
commit 52be7c1202376b4e09821188b826f1a805328129
Author: DseidLi <2568818204@qq.com>
Date: Wed Mar 6 02:54:15 2024 +0800
update needlebench randomseed and add vllm qwen14b
commit fc1effce596ae2e5ece4933e8cd34aef8e64a6f9
Merge: 4e747ed caf1cf8
Author: DseidLi <2568818204@qq.com>
Date: Wed Mar 6 02:51:14 2024 +0800
Merge branch 'main' into add_model_configs
commit 31834f9b23af3354ac3581ec86d693d0f05cdd1c
Merge: 7dabc82 120bf8b
Author: DseidLi <2568818204@qq.com>
Date: Sun Mar 3 23:29:42 2024 +0800
Merge branch 'main' of https://github.com/open-compass/opencompass into atc_choice
commit 4e747ed1988ddbcfcc7fff334601259ade72d363
Author: DseidLi <2568818204@qq.com>
Date: Sun Mar 3 22:15:25 2024 +0800
add internlm2-lmdeploy model and gemma configs
commit 7dabc828123d711c8cf834d6aab4137bb55e85ed
Author: DseidLi <2568818204@qq.com>
Date: Sat Mar 2 17:26:15 2024 +0800
add atc choice version -ZH
commit 996f8ae43d
Author: DseidLi <2568818204@qq.com>
Date: Wed Feb 28 16:58:56 2024 +0800
update readme for needlebench
commit f7266e873c
Author: DseidLi <2568818204@qq.com>
Date: Wed Feb 28 16:44:53 2024 +0800
move readme.md
commit 1c7375681d
Author: DseidLi <2568818204@qq.com>
Date: Wed Feb 28 16:38:31 2024 +0800
fix linting error
commit b6524f3ebf
Author: DseidLi <2568818204@qq.com>
Date: Wed Feb 28 16:33:51 2024 +0800
lint summarizer
commit c0d1190e39
Author: DseidLi <2568818204@qq.com>
Date: Wed Feb 28 16:29:03 2024 +0800
add needlebench intro, fix summarizer
commit 0965baf785
Author: DseidLi <2568818204@qq.com>
Date: Mon Feb 26 13:31:26 2024 +0800
fix bug in needlebench summarizer
commit 5d32b31eb8
Author: DseidLi <2568818204@qq.com>
Date: Sat Feb 24 03:19:08 2024 +0800
update act prompt
commit af82a7f085
Merge: 32bf9fe
53fe788
Author: DseidLi <2568818204@qq.com>
Date: Fri Feb 23 17:50:32 2024 +0800
Merge remote-tracking branch 'upstream/main' into needlebench
commit 32bf9fe802
Author: DseidLi <2568818204@qq.com>
Date: Fri Feb 23 17:31:32 2024 +0800
simplify needlebench 32k, 128k, 200k for eval
commit a7cb025e05
Author: DseidLi <2568818204@qq.com>
Date: Fri Feb 23 14:48:58 2024 +0800
add needlebench
* fix summarizer
* remove repeated code
* remove chinese comments
2024-04-07 15:46:20 +08:00
Mo Li
b50d163265
[Fix] Refactor Needlebench Configs for CLI Testing Support ( #1020 )
...
* add needlebench datasets suffix
* fix import
* update run.py args for summarizer key and dataset suffix
* update utils/run.py
2024-04-07 15:12:56 +08:00
bittersweet1999
2d4e559763
[Feature] Add multi-model judge and fix some problems ( #1016 )
...
* support multi-model judge and moe judge
* test_moe
* test_moe
* test
* add moe judge
* support multi-judge-model
2024-04-02 11:52:06 +08:00
Y0oMu
c220550fb9
updates docs ( #1015 )
...
Co-authored-by: youmuspc <yejiayi2004@outlook.com>
2024-04-02 10:30:04 +08:00
bittersweet1999
02e7eec911
[Feature] Support AlpacaEval_V2 ( #1006 )
...
* support alpacaeval_v2
* support alpacaeval
* update docs
* update docs
2024-03-28 16:49:04 +08:00
Mo Li
0a6a03fe1a
[Feature] update needlebench and configs ( #986 )
...
* add Needlebench-1000K configs
* add prompt postion args
* add model configs
* Update parallel.py
* fix lint
2024-03-25 18:05:01 +08:00
bittersweet1999
0665bb91a8
[Fix] Quick fix ( #995 )
2024-03-22 19:54:19 +08:00
Chaseldot
1d3198554b
[Fix] base.py change status into list ( #994 )
2024-03-22 17:06:34 +08:00
Ke Bao
e415ddf96a
[Fix] Fix turbomind_tis ( #992 )
2024-03-22 15:50:12 +08:00
bittersweet1999
054e9fa7e5
[Feature] add one script for subjective ( #993 )
...
* add one script for subjective
* add one script for subjective
* add one script for subjective
* add one script for subjective
---------
Co-authored-by: thebestannie <1290646445@qq.com>
2024-03-20 23:20:41 +08:00
Connor-Shen
0221d30877
[Fix] Update APPS/TACO ( #988 )
...
* [Feature] update apps/taco
* [Feature] update apps/taco
2024-03-19 20:21:39 +08:00
Connor-Shen
8a3c6e51ed
[Feature] Update APPS ( #985 )
...
* update post process
* update post process
2024-03-19 15:47:05 +08:00
Connor-Shen
d92595b671
[Feat] Support TACO ( #966 )
...
* [Feat] Support TACO
* update README
* update README
2024-03-19 15:39:16 +08:00
bittersweet1999
c78a4df923
add support for set prediction path ( #984 )
2024-03-19 14:32:15 +08:00
klein
4d2591acb2
modify the requirements/runtime.txt: numpy==1.23.4 --> numpy>=1.23.4 ( #983 )
...
Co-authored-by: zhangchuyu@pjlab.org.cn <zhangchuyu@pjlab.org.cn>
2024-03-18 20:25:55 +08:00
Jingming
89a8a8917b
[Feature] Add the implement of QuALITY datasets ( #976 )
...
#976
2024-03-15 21:22:38 +08:00
Jingming
c2d4717be2
[Fix] Fix a bug in internlm2 series configs ( #977 )
2024-03-15 15:21:35 +08:00
seanzhang-zhichen
7baa711fc7
[Fix] Fix doc problem ( #975 )
...
Co-authored-by: zhangzc <2608882093@qq.com>
2024-03-15 13:44:46 +08:00
Connor-Shen
3098d78845
[Bench] Support APPS ( #963 )
...
* [Feat] support apps
* [Feat] support apps
* [Feat] support apps
* update README
2024-03-13 16:09:23 +08:00
Fengzhe Zhou
2a741477fe
update links and checkers ( #890 )
2024-03-13 11:01:35 +08:00
Jingming
4c1533e59e
[Fix] fix the config's name of deepseek-coder ( #964 )
2024-03-12 19:36:52 +08:00
Fengzhe Zhou
ab6cdb2be8
[Sync] Bump version 0.2.3 ( #957 )
2024-03-12 11:51:56 +08:00
Fengzhe Zhou
64fde73b15
[Fix] Use logger.error on failure ( #960 )
2024-03-12 11:51:39 +08:00
Fengzhe Zhou
ed663ca17b
[Misc] Update owners ( #961 )
2024-03-12 11:51:25 +08:00
Songyang Zhang
47cb75a3f7
[Docs] Update README ( #956 )
...
* [Docs] Update README
* Update README.md
* [Docs] Update README
2024-03-12 11:40:34 +08:00
Fengzhe Zhou
bdd85358cc
[Sync] update 20240308 ( #953 )
2024-03-11 22:34:19 +08:00
bittersweet1999
848e7c8a76
[fix] add different temp for different question in mtbench ( #954 )
...
* add temp for mtbench
* add document for mtbench
* add document for mtbench
2024-03-11 17:24:39 +08:00
Songyang Zhang
7c1a819bb4
[Fix] Chinese version of ReadTheDoc ( #947 )
...
* [Fix] Chinese version of ReadTheDoc
* rename
---------
Co-authored-by: Leymore <zfz-960727@163.com>
2024-03-08 18:10:05 +08:00
Yang Yong
3829be87b1
Fix LightllmApi ppl test ( #951 )
2024-03-08 12:04:44 +08:00
Yang Yong
107e022cf4
Support prompt template for LightllmApi. Update LightllmApi token bucket. ( #945 )
2024-03-06 15:33:53 +08:00
RunningLeon
c54a5d3b0f
Support get_ppl for TurbomindModel ( #878 )
...
* update ppl for turbomindmodel
* update api_server
* rename config and set thread_safe for pytorch engine if possible
2024-03-06 11:44:19 +08:00
fanqiNO1
caf1cf8a17
[Docs] Update rank link ( #911 )
2024-03-05 20:33:44 +08:00
Xu Song
2e993989a6
[Fix] FinanceIQ_datasets import error ( #939 )
...
* [Fix] Fix KeyError: 'FinanceIQ_datasets'
* [Fix] Fix KeyError: 'FinanceIQ_datasets'
2024-03-05 20:32:24 +08:00
Jingming
66d3aa4c01
[Feature] Add configs of deepseek-coder ( #943 )
2024-03-05 11:38:28 +08:00
Jingming
d0550268f3
[Fix] fix a bug of humanevalplus config ( #944 )
2024-03-05 11:37:17 +08:00
Fengzhe Zhou
b03d5dc531
[Sync] Sync Internal ( #941 )
2024-03-04 14:42:36 +08:00
yuantao2108
bbec7d8733
[Feature] add lveval benchmark ( #914 )
...
* add lveval benchmark
* add LVEval readme file
* update LVEval readme file
* Update configs/eval_bluelm_32k_lveval.py
* Update configs/eval_llama2_7b_lveval.py
---------
Co-authored-by: yuantao <yuantao@infini-ai.com>
Co-authored-by: Mo Li <82895469+DseidLi@users.noreply.github.com>
2024-03-04 11:22:03 +08:00
Mo Li
8142f399a8
[Feature] Upgrade the needle-in-a-haystack experiment to Needlebench ( #913 )
...
* add needlebench
* simplify needlebench 32k, 128k, 200k for eval
* update act prompt
* fix bug in needlebench summarizer
* add needlebench intro, fix summarizer
* lint summarizer
* fix linting error
* move readme.md
* update readme for needlebench
* update docs of needlebench
* simplify needlebench summarizers
2024-03-04 11:10:52 +08:00
Mo Li
120bf8b399
add vllm model configs ( #938 )
2024-03-01 17:31:51 +08:00
Kdump
3e9844ed33
[Fix]Fixed the problem of never entering task.run() mode in local scheduling mode. ( #930 )
...
* Fixed the problem of never entering task.run() mode in local scheduling mode.
get_command_template方法中为命令行前缀添加了CUDA_VISIBLE_DEVICES=或set CUDA_VISIBLE_DEVICES=。导致task.run()分支失效。
---------
CUDA_VISIBLE_DEVICES= or set CUDA_VISIBLE_DEVICES= is added to the command line prefix in the get_command_template method. Causes the task.run() branch to fail.
* [Fix]Fixed the problem of never entering task.run() mode in local scheduling mode.
get_command_template方法中为命令行前缀添加了CUDA_VISIBLE_DEVICES=或set CUDA_VISIBLE_DEVICES=。导致task.run()分支失效。
---
CUDA_VISIBLE_DEVICES= or set CUDA_VISIBLE_DEVICES= is added to the command line prefix in the get_command_template method. Causes the task.run() branch to fail.
* [Fix]Fixed the problem of never entering task.run() mode in local scheduling mode.
get_command_template方法中为命令行前缀添加了CUDA_VISIBLE_DEVICES=或set CUDA_VISIBLE_DEVICES=。导致task.run()分支失效。
CUDA_VISIBLE_DEVICES= or set CUDA_VISIBLE_DEVICES= is added to the command line prefix in the get_command_template method. Causes the task.run() branch to fail.
2024-02-29 14:35:45 +08:00
Skyfall-xzz
4c45a71bbc
[Feature] Support OpenFinData ( #896 )
...
* [Feature] Support OpenFinData
* add README for OpenFinData
* update README
2024-02-29 12:55:07 +08:00