Ezra-Yu
|
17ccaa5980
|
[Feat] Add codegeex2 and Humanevalx (#210)
* add codegeex2
* add humanevalx dataset
* add evaluator
* update evaluator
* update configs
* update clean code
* update configs
* fix lint
* remove sleep
* fix lint
* update docs
* fix lint
|
2023-08-17 11:03:16 +08:00 |
|
Hubert
|
0fe2366a72
|
[Feat] support adv_glue dataset for adversarial robustness (#205)
* [Feat] support adv_glue dataset for adversarial robustness
* reorg files
* minor fix
* minor fix
|
2023-08-16 18:42:06 +08:00 |
|
Hubert
|
7c393192af
|
[Fix] fix bug for postprocessor (#195)
* [Fix] fix bug for postprocessor
* minor fix
|
2023-08-11 18:41:12 +08:00 |
|
Tong Gao
|
bf79ff1c6d
|
[Feature] Add LEval datasets
Co-authored-by: kennymckormick <dhd@pku.edu.cn>
|
2023-08-11 17:38:31 +08:00 |
|
Hubert
|
8d9cee060f
|
[Feat] update postprocessor to get first option more accurately (#193)
* [Feat] update postprocessor to get first option
* minor fix
* minor fix
|
2023-08-11 17:33:00 +08:00 |
|
Leymore
|
14332e08fd
|
[Feature] add llama-oriented dataset configs (#82)
* add llama-oriented dataset configs
* update
* revert cvalues & update llama_example
|
2023-08-11 12:48:05 +08:00 |
|
Hubert
|
5a9539f375
|
[Feat] add safety to collections (#185)
* [Feat] add safety to collections
* minor fix
|
2023-08-11 11:19:26 +08:00 |
|
Tong Gao
|
2931f3dcb8
|
[Enhancement] Add humaneval postprocessor for GPT models & eval config for GPT4, enhance the original humaneval postprocessor (#129)
* [Enhancement] Enhance humaneval postprocessor
* add human-eval testcase
* update
* update
---------
Co-authored-by: Leymore <zfz-960727@163.com>
|
2023-08-10 16:31:12 +08:00 |
|
Leymore
|
e7fc54baf1
|
[Feature] Add Xiezhi SQuAD2.0 ANLI (#101)
* add Xiezhi SQuAD2.0 ANLI; update WSC
* update
* update
* update doc string
|
2023-08-10 14:04:18 +08:00 |
|
Leymore
|
876ade71a5
|
[Fix] Fix AGIEval multiple choice (#137)
* update agieval data
* rename variables
|
2023-08-10 11:38:24 +08:00 |
|
Tong Gao
|
c00179d46b
|
[Feature] Evaluating acc based on minimum edit distance, update SIQA (#130)
* [Feature] Support evaluating acc based on minimum edit distance, update SIQA
* update
|
2023-08-01 14:24:27 +08:00 |
|
Leymore
|
d862f570aa
|
[Feature] Add SC (#126)
* add self-consistency
* add CoT method Self-Consistency
* fix typo error and update openicl_eval
* add tydiQA-GoldP task
* fix sc
* rename gsm8k_sc
* fix sc
* add self-consistency doc
* refine sc
---------
Authored-by: liushz <qq1791167085@163.com>
|
2023-07-28 17:29:37 +08:00 |
|
Hubert
|
b7184e9db5
|
[Refactor] Update crows-pairs evaluation (#98)
* [Refactor] Update crows-pairs evaluation
* [Refactor] Update crows-pairs evaluation
* minor
|
2023-07-26 11:21:32 +08:00 |
|
Haonan Li
|
e9cdb24ddd
|
[Feature] Add CMMLU dataset (#91)
* add CMMLU
* debug cmmlu
* add slurm args `qos`
* fix format: space before comment
* remove unused variable
* change the location of `answer is`
---------
Co-authored-by: 李浩楠 <lihaonan@lihaonandeMacBook-Air.local>
Co-authored-by: 李浩楠 <haonan.li>
Co-authored-by: Leymore <zfz-960727@163.com>
|
2023-07-25 10:14:27 +08:00 |
|
Hubert
|
f83e125e5a
|
[Feat] Support CValues Responsibility dataset (#78)
* [Feat] support CValues
* minor fix
|
2023-07-18 18:45:15 +08:00 |
|
liushz
|
f36c0496f3
|
[Feature] Add tydiqa-goldp (#75)
Co-authored-by: liuhongwei <liuhongwei@pjlab.org.cn>
|
2023-07-18 14:54:35 +08:00 |
|
Leymore
|
1326aff77e
|
[Feature] Add logger info and remove dataset bugs (#61)
* Add logger info and remove dataset bugs
* fix typo
|
2023-07-17 14:26:30 +08:00 |
|
Leymore
|
86d5ec3d0f
|
Update configs (#9)
* Update implements
* Update
|
2023-07-06 12:27:41 +08:00 |
|
Tong Gao
|
16e759b996
|
Align prompt files with their hash (#1)
* fix bbh
* fix bbh
* rename
|
2023-07-05 18:28:58 +08:00 |
|
mzr1996
|
04dd01a235
|
Update configs and code
|
2023-07-05 11:45:08 +08:00 |
|
Leymore
|
c94cc94348
|
Add release contribution
|
2023-07-05 03:15:31 +00:00 |
|
tonysy
|
e6b5bdcb87
|
OpenCompass Public MR
|
2023-07-05 03:15:21 +00:00 |
|
Ezra-Yu
|
cbe9fe2cdb
|
Add Release Contraibution
|
2023-07-05 02:22:40 +00:00 |
|
cky
|
36f111100f
|
update datasets
|
2023-07-05 01:45:26 +00:00 |
|
mzr1996
|
3cfe73de3f
|
Support a batch of datasets.
|
2023-07-05 01:30:27 +00:00 |
|
yingfhu
|
fb11108723
|
[Feat] support opencompass
|
2023-07-04 22:11:33 +08:00 |
|
gaotongxiao
|
7d346000bb
|
initial commit
|
2023-07-04 21:34:55 +08:00 |
|