Songyang Zhang
|
aa2b89b6f8
|
[Update] Add CascadeEvaluator with Data Replica (#2022)
* Update CascadeEvaluator
* Update CascadeEvaluator
* Update CascadeEvaluator
* Update Config
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
* Update
|
2025-05-20 16:46:55 +08:00 |
|
Linchen Xiao
|
508e2b0cb2
|
[Update] Set load_from_cache_file to False (#2085)
|
2025-05-09 15:21:47 +08:00 |
|
Junnan Liu
|
97010dc4ce
|
[Update] Update dataset repeat concatenation (#2039)
|
2025-04-23 16:16:28 +08:00 |
|
Junnan Liu
|
73c80953c6
|
[Feature] Support Dataset Repeat and G-Pass Compute for Each Evaluator (#1886)
* support dataset repeat and g-pass compute for each evaluator
* fix pre-commit errors
* delete print
* delete gpassk_evaluator and fix potential errors
* change `repeat` to `n`
* fix `repeat` to `n` in openicl_eval
* update doc for multi-run and g-pass
* update latex equation in doc
* update eng doc for multi-run and g-pass
* update datasets.md
* update datasets.md
* fix multi-line equation
* fix multi-line equation
* fix multi-line equation
* fix multi-line equation
* fix multi-line equation
* fix multi-line equation
* fix multi-line equation in zh_cn user_guides
* mmodify pre-commit-zh-cn
* recover pre-commit and edit math expr in doc
* del [TIP]
* del cite tag in doc
* del extract_model param in livemathbench config
|
2025-02-26 19:43:12 +08:00 |
|
Songyang Zhang
|
fd6fbf01a2
|
[Update] Support AIME-24 Evaluation for DeepSeek-R1 series (#1888)
* Update
* Update
* Update
* Update
|
2025-02-25 20:34:41 +08:00 |
|
Leymore
|
c94cc94348
|
Add release contribution
|
2023-07-05 03:15:31 +00:00 |
|