Skywork
/

Skywork-OR1-Math-7B

Model card Files Files and versions

chrisliu298 commited on Apr 13, 2025

Commit

88d0b7c

·

verified ·

1 Parent(s): 0a3069c

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -58,7 +58,7 @@ The **`Skywork-OR1`** (Open Reasoner 1) model series consists of powerful math a
 We evaluate our models on AIME24, AIME25, and LiveCodeBench. Instead of using Pass@1, which is common in prior work, we introduce Avg@K as the primary metric. This metric robustly measures a model's average performance across K independent attempts, reducing the impact of randomness and enhancing the reliability of the results. We believe that Avg@K provides a better reflection of a model's stability and reasoning consistency.
-We inlcude the detailed results in the following table.
 | Model | AIME24 (Avg@32) | AIME25 (Avg@32) | LiveCodeBench (8/1/24-2/1/25) (Avg@4) |
 |-------|---------|---------|--------------|

 We evaluate our models on AIME24, AIME25, and LiveCodeBench. Instead of using Pass@1, which is common in prior work, we introduce Avg@K as the primary metric. This metric robustly measures a model's average performance across K independent attempts, reducing the impact of randomness and enhancing the reliability of the results. We believe that Avg@K provides a better reflection of a model's stability and reasoning consistency.
+We include the detailed results in the following table.
 | Model | AIME24 (Avg@32) | AIME25 (Avg@32) | LiveCodeBench (8/1/24-2/1/25) (Avg@4) |
 |-------|---------|---------|--------------|