bittersweet1999
|
2d4e559763
|
[Feature] Add multi-model judge and fix some problems (#1016)
* support multi-model judge and moe judge
* test_moe
* test_moe
* test
* add moe judge
* support multi-judge-model
|
2024-04-02 11:52:06 +08:00 |
|
bittersweet1999
|
02e7eec911
|
[Feature] Support AlpacaEval_V2 (#1006)
* support alpacaeval_v2
* support alpacaeval
* update docs
* update docs
|
2024-03-28 16:49:04 +08:00 |
|
bittersweet1999
|
c78a4df923
|
add support for set prediction path (#984)
|
2024-03-19 14:32:15 +08:00 |
|
bittersweet1999
|
848e7c8a76
|
[fix] add different temp for different question in mtbench (#954)
* add temp for mtbench
* add document for mtbench
* add document for mtbench
|
2024-03-11 17:24:39 +08:00 |
|
bittersweet1999
|
001e77fea2
|
[Feature] add support for gemini (#931)
* add gemini
* add gemini
* add gemini
|
2024-02-28 19:38:34 +08:00 |
|
bittersweet1999
|
7806cd0f64
|
[Feature] support alpacaeval (#809)
* support alpacaeval_v1
* Update opencompass/summarizers/subjective/__init__.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* Update opencompass/summarizers/subjective/alpacaeval_v1.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* fix conflict
* support alpacaeval v2
* support alpacav2
---------
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
|
2024-02-04 14:18:36 +08:00 |
|
bittersweet1999
|
5c6dc908cd
|
fix compass arena (#854)
|
2024-01-30 16:34:38 +08:00 |
|
bittersweet1999
|
77be07dbb5
|
[Fix] fix corev2 (#838)
* fix corev2
* fix corev2
|
2024-01-24 18:15:29 +08:00 |
|
bittersweet1999
|
2ee8e8a1a1
|
[Feature] add mtbench (#829)
* add mtbench
* add mtbench
* Update configs/datasets/subjective/multiround/mtbench_judgeby_gpt4.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* Update configs/datasets/subjective/multiround/mtbench_judgeby_gpt4.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* Update opencompass/datasets/subjective/__init__.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* Update opencompass/datasets/subjective/mtbench.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* fix mtbench
---------
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
|
2024-01-24 12:11:47 +08:00 |
|
bittersweet1999
|
2d4da8dd02
|
[Feature] Add CompassArena (#828)
* add compass arena
* add compass_arena
* add compass arena
* Update opencompass/summarizers/subjective/compass_arena.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* Update opencompass/summarizers/subjective/__init__.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* Update opencompass/datasets/subjective/compass_arena.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* Update opencompass/datasets/subjective/__init__.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* Update configs/eval_subjective_compassarena.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* Update configs/datasets/subjective/compassarena/compassarena_compare.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* Update configs/eval_subjective_compassarena.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* Update configs/datasets/subjective/compassarena/compassarena_compare.py
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
* fix check position bias
---------
Co-authored-by: Songyang Zhang <tonysy@users.noreply.github.com>
|
2024-01-23 15:12:46 +08:00 |
|
bittersweet1999
|
814b3f73bd
|
reorganize subject files (#801)
|
2024-01-16 18:03:11 +08:00 |
|