Songyang Zhang
3f36db3b06
[Feature] Support turbomind ( #166 )
...
* support turbomind
* update doc
* Update docs/en/advanced_guides/evaluation_turbomind.md
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
* Update docs/zh_cn/advanced_guides/evaluation_turbomind.md
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
* Update docs/zh_cn/advanced_guides/evaluation_turbomind.md
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
* Update docs/en/advanced_guides/evaluation_turbomind.md
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
* update
---------
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
2023-08-10 16:25:11 +08:00
Leymore
e7fc54baf1
[Feature] Add Xiezhi SQuAD2.0 ANLI ( #101 )
...
* add Xiezhi SQuAD2.0 ANLI; update WSC
* update
* update
* update doc string
2023-08-10 14:04:18 +08:00
Yuan Liu
a205629ff3
[Feature]: Refactor input and output ( #176 )
...
* [Feature]: Refactor input and output
* [Feature]: Update tasks
2023-08-10 14:01:28 +08:00
Leymore
876ade71a5
[Fix] Fix AGIEval multiple choice ( #137 )
...
* update agieval data
* rename variables
2023-08-10 11:38:24 +08:00
dependabot[bot]
0555d59a6a
Bump requests from 2.28.1 to 2.31.0 ( #178 )
...
Bumps [requests](https://github.com/psf/requests ) from 2.28.1 to 2.31.0.
- [Release notes](https://github.com/psf/requests/releases )
- [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md )
- [Commits](https://github.com/psf/requests/compare/v2.28.1...v2.31.0 )
---
updated-dependencies:
- dependency-name: requests
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-08-09 19:41:09 +08:00
Tong Gao
e6194df29e
[Fix] Use a copy of the config object in Task ( #174 )
2023-08-09 15:24:49 +08:00
Haodong Duan
d5d4f47371
[API] Refine OpenAI ( #175 )
2023-08-09 12:38:57 +08:00
Zaida Zhou
af436f5951
[Feature] Calculate max_out_len without hard code for OpenAI model ( #158 )
...
* calulate max_out_len without hard code
* set default value
* update configs
* Update configs/eval_gpt3.5.py
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
---------
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
2023-08-08 15:16:56 +08:00
Yuan Liu
2f1949e7a1
[Feature]: Add mm suport for local ( #169 )
2023-08-08 14:21:58 +08:00
Songyang Zhang
5b80d83866
[Docs] update readme ( #165 )
2023-08-08 12:49:04 +08:00
Haodong Duan
6ca2be6626
[Script] Add scripts to evaluate MMBench ( #161 )
...
* update
* update
* Update README.md
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* refine
* update default
* update CN README
---------
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
2023-08-07 16:53:36 +08:00
Tong Gao
1bab316624
update internal readme ( #162 )
2023-08-07 14:27:15 +08:00
Tong Gao
bbdedc6c95
[Enhancement] Optimize OpenAI models ( #128 )
...
* [Feature] Enhance OpenAI API, add example config for GPT evaluation
2023-08-03 14:55:16 +08:00
Haodong Duan
d17a5b94fa
[Refine] Refine PR #122 ( #123 )
...
* update
* update
2023-08-03 14:54:38 +08:00
Yuan Liu
191a3f6f9d
[Feature]: Use multimodal ( #73 )
...
* [Feature]: Add minigpt-4
* [Feature]: Add mm local runner
* [Feature]: Add instructblip
* [Feature]: Delete redundant file
* [Feature]: Delete redundant file
* [Feature]: Add README to InstructBLIP
* [Feature]: Update MiniGPT-4
* [Fix]: Fix lint
* [Feature]add omnibenchmark readme (#49 )
* add omnibenchmark readme
* fix
* Update OmniMMBench.md
* Update OmniMMBench.md
* Update OmniMMBench.md
* [Fix]: Refine name (#54 )
* [Feature]: Unify out and err
* [Fix]: Fix lint
* [Feature]: Rename to mmbench and change weight path
* [Feature]: Delete Omni in instructblip
* [Feature]: Check the avaliablity of lavis
* [Fix]: Fix lint
* [Feature]: Refactor MM
* [Refactor]: Refactor path
* [Feature]: Delete redundant files
* [Refactor]: Delete redundant files
---------
Co-authored-by: Wangbo Zhao(黑色枷锁) <56866854+wangbo-zhao@users.noreply.github.com>
2023-08-03 11:07:50 +08:00
Zaida Zhou
289e0567bd
Fix typo in readme ( #152 )
2023-08-02 19:01:39 +08:00
Leymore
bbe45c68a3
[Doc] update acknowledgements ( #147 )
2023-08-02 10:16:53 +08:00
Tong Gao
8b163bd8e9
[Feature] Several enhancements ( #142 )
2023-08-01 18:19:49 +08:00
Tong Gao
c00179d46b
[Feature] Evaluating acc based on minimum edit distance, update SIQA ( #130 )
...
* [Feature] Support evaluating acc based on minimum edit distance, update SIQA
* update
2023-08-01 14:24:27 +08:00
Ezra-Yu
e9b7b8ab02
[DOC] Add metric doc ( #118 )
...
* update
* update
* update metric docs
* update index.rst
* update metrics
2023-08-01 11:47:04 +08:00
Songyang Zhang
d860b61d04
[Enhancement] Update README.md ( #119 )
...
* Update README.md
* update README_zh-CN.md
* update get_started
---------
Co-authored-by: Leymore <zfz-960727@163.com>
2023-07-31 18:26:46 +08:00
Leymore
262ab794fb
[Docs] Update prompt docs ( #46 )
...
* [Docs] Update prompt docs
* update
* [Docs] Prompt docs (#112 )
* update docs
* update
* update
* Update en prompt template
* Update en prompt doc
* fix
* fix
---------
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
2023-07-29 00:46:13 +08:00
Anakin Skywalker
e04f88424d
edit doc ( #125 )
2023-07-28 17:33:51 +08:00
Leymore
d862f570aa
[Feature] Add SC ( #126 )
...
* add self-consistency
* add CoT method Self-Consistency
* fix typo error and update openicl_eval
* add tydiQA-GoldP task
* fix sc
* rename gsm8k_sc
* fix sc
* add self-consistency doc
* refine sc
---------
Authored-by: liushz <qq1791167085@163.com>
2023-07-28 17:29:37 +08:00
Haodong Duan
538b439302
[Fix] Fix seed in HFEvaluator ( #122 )
2023-07-28 11:29:01 +08:00
Haodong Duan
46c9645753
[Feature] Allow explicitly setting the temperature for API model ( #121 )
...
* allow explicitly setting the temperature
* update
2023-07-28 11:28:15 +08:00
Tong Gao
80ce18f860
[Docs] Update issue templates for proper guidance to discussions ( #116 )
2023-07-27 19:38:41 +08:00
gowithme
57fcfc975a
[Feature] Support intern lanuage model ( #51 )
...
* support internLM
* support internLM
* simplify intern model files
* update storage_manager
* support internLM
* Modify the file organization structure
* support internLM
* support internLM
* support internLM
* support internLM
* change some details
2023-07-27 18:49:36 +08:00
vansin
8a4d0867ab
Doc: add twitter link ( #111 )
2023-07-27 17:19:35 +08:00
Ezra-Yu
d1ec6047af
[Doc] Update Readme and Fix failed links ( #108 )
...
* update reame and fix failed links
* update
* update review
2023-07-27 17:15:25 +08:00
Hubert
aa13067735
[Feat] add auto assignee bot ( #105 )
...
* [Feat] add auto assignee bot
* minor fix
2023-07-26 16:43:31 +08:00
Hubert
b7184e9db5
[Refactor] Update crows-pairs evaluation ( #98 )
...
* [Refactor] Update crows-pairs evaluation
* [Refactor] Update crows-pairs evaluation
* minor
2023-07-26 11:21:32 +08:00
Haodong Duan
4b0aa80466
[Fix] MMBench Doc Fix ( #96 )
...
* update
* update
* fix lint
2023-07-25 10:43:22 +08:00
Tong Gao
3715be6595
[Fix] Fix llama configs ( #72 )
...
Co-authored-by: Leymore <zfz-960727@163.com>
2023-07-25 10:21:31 +08:00
Haonan Li
e9cdb24ddd
[Feature] Add CMMLU dataset ( #91 )
...
* add CMMLU
* debug cmmlu
* add slurm args `qos`
* fix format: space before comment
* remove unused variable
* change the location of `answer is`
---------
Co-authored-by: 李浩楠 <lihaonan@lihaonandeMacBook-Air.local>
Co-authored-by: 李浩楠 <haonan.li>
Co-authored-by: Leymore <zfz-960727@163.com>
2023-07-25 10:14:27 +08:00
Haodong Duan
6e885d668b
force utf-8 encoding for all non-dataset fileios ( #97 )
2023-07-25 10:06:01 +08:00
Leymore
3fe5ee096c
[Feature] Add heuristic size partitioner ( #63 )
...
* [Feature] Add heuristic size partitioner
* update
2023-07-20 11:53:24 +08:00
Leymore
eea8b04417
[Feature] Add llama-2 models ( #81 )
...
* add llama-2 models
* update docs
---------
Co-authored-by: gaotongxiao <gaotongxiao@gmail.com>
2023-07-19 19:51:29 +08:00
Hubert
f83e125e5a
[Feat] Support CValues Responsibility dataset ( #78 )
...
* [Feat] support CValues
* minor fix
2023-07-18 18:45:15 +08:00
LZH
26e2f171f4
[Feature] Support load PEFT adapter for HuggingFace model ( #74 )
...
* support peft for HuggingFace model
* add docstring
2023-07-18 16:21:43 +08:00
liushz
f36c0496f3
[Feature] Add tydiqa-goldp ( #75 )
...
Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>
2023-07-18 14:54:35 +08:00
Hubert
29598e3619
[Feat] add falcon-40b ( #76 )
...
* [Feat] add falcon-40b
* minor fix
2023-07-18 14:40:16 +08:00
Tong Gao
311bf0daa7
[Fix] Fix CI ( #70 )
...
* [Fix] Fix CI
* [Fix] Fix CI
* [Fix] Fix CI
* update
2023-07-17 19:10:59 +08:00
Tong Gao
29006e39c0
[Fix] Fix circular import of PromptTemplate ( #71 )
2023-07-17 19:09:38 +08:00
Tong Gao
1e44541730
[Enhancement] Test linting in CI and fix existing linting errors ( #69 )
...
* [Enhancement] Test linting in CI
* fix linting
2023-07-17 15:59:10 +08:00
Leymore
9a16448905
[Fix] eval_llama_7b ( #68 )
2023-07-17 15:28:21 +08:00
Leymore
edb23d15d1
[Feature] Add baichuan13b model configs ( #60 )
...
* [Feature] Add baichuan13b
* update num_gpus
2023-07-17 14:38:12 +08:00
Leymore
1326aff77e
[Feature] Add logger info and remove dataset bugs ( #61 )
...
* Add logger info and remove dataset bugs
* fix typo
2023-07-17 14:26:30 +08:00
Tong Gao
77a1cc4486
[Docs] Update evaluation doc ( #39 )
2023-07-17 14:12:19 +08:00
Leymore
e19a0c1cf8
[Feature] add --dry-run option ( #59 )
2023-07-17 10:41:38 +08:00