Linchen Xiao
a05f9da134
[Feature] Make dump-eval-details default behavior ( #1999 )
...
* Update
* update
* update
2025-04-08 14:42:26 +08:00
Shudong Liu
277d7946f5
[Fix] Fix typo in deepseed_r1.md ( #1916 )
2025-03-05 19:37:22 +08:00
Songyang Zhang
c84bc18ac1
[Update] Support OlympiadBench-Math/OmniMath/LiveMathBench-Hard ( #1899 )
...
* [Update] Support OlympiadBench-Math/OmniMath/LiveMathBench-Hard with LLM Verify
* Update
* Update
* Update DeepSeek-R1 example
* Update DeepSeek-R1 example
* Update DeepSeek-R1 example
2025-03-03 18:56:11 +08:00
Junnan Liu
73c80953c6
[Feature] Support Dataset Repeat and G-Pass Compute for Each Evaluator ( #1886 )
...
* support dataset repeat and g-pass compute for each evaluator
* fix pre-commit errors
* delete print
* delete gpassk_evaluator and fix potential errors
* change `repeat` to `n`
* fix `repeat` to `n` in openicl_eval
* update doc for multi-run and g-pass
* update latex equation in doc
* update eng doc for multi-run and g-pass
* update datasets.md
* update datasets.md
* fix multi-line equation
* fix multi-line equation
* fix multi-line equation
* fix multi-line equation
* fix multi-line equation
* fix multi-line equation
* fix multi-line equation in zh_cn user_guides
* mmodify pre-commit-zh-cn
* recover pre-commit and edit math expr in doc
* del [TIP]
* del cite tag in doc
* del extract_model param in livemathbench config
2025-02-26 19:43:12 +08:00
Xingjun.Wang
edab1c07ba
[Feature] Support ModelScope datasets ( #1289 )
...
* add ceval, gsm8k modelscope surpport
* update race, mmlu, arc, cmmlu, commonsenseqa, humaneval and unittest
* update bbh, flores, obqa, siqa, storycloze, summedits, winogrande, xsum datasets
* format file
* format file
* update dataset format
* support ms_dataset
* udpate dataset for modelscope support
* merge myl_dev and update test_ms_dataset
* udpate dataset for modelscope support
* update readme
* update eval_api_zhipu_v2
* remove unused code
* add get_data_path function
* update readme
* remove tydiqa japanese subset
* add ceval, gsm8k modelscope surpport
* update race, mmlu, arc, cmmlu, commonsenseqa, humaneval and unittest
* update bbh, flores, obqa, siqa, storycloze, summedits, winogrande, xsum datasets
* format file
* format file
* update dataset format
* support ms_dataset
* udpate dataset for modelscope support
* merge myl_dev and update test_ms_dataset
* update readme
* udpate dataset for modelscope support
* update eval_api_zhipu_v2
* remove unused code
* add get_data_path function
* remove tydiqa japanese subset
* update util
* remove .DS_Store
* fix md format
* move util into package
* update docs/get_started.md
* restore eval_api_zhipu_v2.py, add environment setting
* Update dataset
* Update
* Update
* Update
* Update
---------
Co-authored-by: Yun lin <yunlin@U-Q9X2K4QV-1904.local>
Co-authored-by: Yunnglin <mao.looper@qq.com>
Co-authored-by: Yun lin <yunlin@laptop.local>
Co-authored-by: Yunnglin <maoyl@smail.nju.edu.cn>
Co-authored-by: zhangsongyang <zhangsongyang@pjlab.org.cn>
2024-07-29 13:48:32 +08:00
Fengzhe Zhou
a32f21a356
[Sync] Sync with internal codes 2024.06.28 ( #1279 )
2024-06-28 14:16:34 +08:00
Fengzhe Zhou
d656e818f8
[Docs] Remove --no-batch-padding and Use --hf-num-gpus ( #1205 )
...
* [Docs] Remove --no-batch-padding and Use -hf-num-gpus
* update
2024-05-29 16:30:10 +08:00
Songyang Zhang
063f5f5f49
[Update] Update performance of common benchmarks ( #1109 )
...
* [Update] Update performance of common benchmarks
* [Update] Update performance of common benchmarks
* [Update] Update performance of common benchmarks
2024-04-30 00:09:08 +08:00
Haodong Duan
3a232db471
[Deperecate] Remove multi-modal related stuff ( #1072 )
...
* Remove MultiModal
* update index.rst
* update README
* remove mmbench codes
* update news
---------
Co-authored-by: Leymore <zfz-960727@163.com>
2024-04-26 21:20:14 +08:00
Hubert
36360bdfc3
[Fix] fix filename typo ( #549 )
2023-11-07 14:00:26 +08:00
Songyang Zhang
239c2a346e
[Feature] Add support for MiniMax API ( #548 )
...
* update requirement
* update requirement
* update with minimax
* update api model
* Update readme
* fix error
---------
Co-authored-by: zhangsongyang <zhangsongyang@pjlab.org.cn>
2023-11-06 21:57:32 +08:00
Fengzhe Zhou
dbb20b8270
[Sync] update ( #517 )
2023-10-27 20:31:22 +08:00
Hubert
44c8d6cc60
[Docs] update invalid link in docs ( #499 )
2023-10-25 13:15:42 +08:00
Leymore
d7ff933a73
[Fix] Use jieba rouge in lcsts ( #459 )
...
* use jieba rouge in lcsts
* use rouge_chinese
2023-10-09 10:10:33 +08:00
Tong Gao
767c12a660
[Docs] update get_started ( #435 )
...
* [Docs] update get_started
* [Docs] Refactor get_started
* update
* add zh FAQ
* add cn doc
* update
* fix dead links
---------
Co-authored-by: Leymore <zfz-960727@163.com>
2023-10-07 11:49:40 +08:00
Songyang Zhang
3871188c89
[Feat] Update URL ( #368 )
2023-09-07 17:29:50 +08:00
Songyang Zhang
a05daab911
[Doc] Update Overview ( #242 )
...
* Update news
* update overview
* add framework
* update index
* update
---------
Co-authored-by: Leymore <zfz-960727@163.com>
2023-09-07 14:21:39 +08:00
Tong Gao
166022f568
[Docs] Update docs for new entry script ( #246 )
...
* update docs
* update docs
* update
* update en docs
* update
* update
---------
Co-authored-by: Leymore <zfz-960727@163.com>
2023-08-31 16:43:55 +08:00
Leymore
c0e58632ca
[Doc] Add summarizer doc ( #231 )
...
* add summarizer doc
* update
* update doc
* Apply suggestions from code review
---------
Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
2023-08-23 11:18:01 +08:00
Ezra-Yu
e9b7b8ab02
[DOC] Add metric doc ( #118 )
...
* update
* update
* update metric docs
* update index.rst
* update metrics
2023-08-01 11:47:04 +08:00
Anakin Skywalker
e04f88424d
edit doc ( #125 )
2023-07-28 17:33:51 +08:00
Tong Gao
77a1cc4486
[Docs] Update evaluation doc ( #39 )
2023-07-17 14:12:19 +08:00
Leymore
e19a0c1cf8
[Feature] add --dry-run option ( #59 )
2023-07-17 10:41:38 +08:00
Tong Gao
fd57786954
[Docs] Polish docs ( #43 )
...
* [Docs] Polish docs
* apply suggestions
* apply suggestions
2023-07-13 09:07:53 +08:00
Ezra-Yu
0c6fb6cf67
[Doc] Update logo icon ( #32 )
...
* update logo_icon and fix type in docs
* rebase:
* update get_started
* update .gitignore
* remove extra lines
* remove extra 'S'
* update
* update
* update docs
* update docs
* update docs
---------
Co-authored-by: gaotongxiao <gaotongxiao@gmail.com>
2023-07-08 16:40:24 +08:00
Tong Gao
30a988a620
[Docs] Update dataset docs ( #19 )
...
* [Docs] Update dataset docs
* [Docs] Update dataset docs
2023-07-06 15:47:09 +08:00
mzr1996
d1025c3223
[Docs] Update config tutorials.
2023-07-06 15:07:21 +08:00
gaotongxiao
cbaa1bc8f9
Docs
2023-07-06 13:51:55 +08:00
Tong Gao
18ace3d549
Add docs ( #8 )
...
* Add docs
* update
* update
2023-07-06 12:58:58 +08:00
Hubert
7f8eee4725
[Docs] add en docs ( #15 )
...
* add en docs
* update
---------
Co-authored-by: gaotongxiao <gaotongxiao@gmail.com>
2023-07-06 12:58:44 +08:00
Ma Zerun
e035265352
[Docs] Add model docs. ( #11 )
...
* [Docs] Add model docs.
* Imporve according to comments
2023-07-06 12:44:08 +08:00
Ma Zerun
ed76b5d066
[Docs] Add config docs. ( #3 )
...
* [Docs] Add config docs.
* Update according to comments
2023-07-05 18:28:43 +08:00
Leymore
c94cc94348
Add release contribution
2023-07-05 03:15:31 +00:00
tonysy
e6b5bdcb87
OpenCompass Public MR
2023-07-05 03:15:21 +00:00
Ezra-Yu
cbe9fe2cdb
Add Release Contraibution
2023-07-05 02:22:40 +00:00
cky
36f111100f
update datasets
2023-07-05 01:45:26 +00:00
gaotongxiao
7d346000bb
initial commit
2023-07-04 21:34:55 +08:00