Salesforce
/

E1-Math-1.5B

Text Generation

text-generation-inference

Model card Files Files and versions

yuhuixu commited on May 25

Commit

503abe7

·

verified ·

1 Parent(s): 2336d1c

Update README.md

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -10,6 +10,13 @@ license: cc-by-nc-4.0
 ## Introduction
 E1-Math-1.5B is a language model fine-tuned from DeepSeek-R1-Distilled-Qwen-1.5B. It is trained for Elastic Reasoning by budget-constrained rollout strategy, integrated into GRPO, which teaches the model to reason adaptively when the thinking process is cut short and generalizes effectively to unseen budget constraints without additional training.
 ## Usage
 For detailed usage, please refer to [repo](https://github.com/SalesforceAIResearch/Elastic-Reasoning).

 ## Introduction
 E1-Math-1.5B is a language model fine-tuned from DeepSeek-R1-Distilled-Qwen-1.5B. It is trained for Elastic Reasoning by budget-constrained rollout strategy, integrated into GRPO, which teaches the model to reason adaptively when the thinking process is cut short and generalizes effectively to unseen budget constraints without additional training.
+## Performance
+| Model | Tokens | Acc (%) | Tokens | Acc (%) | Tokens | Acc (%) | Tokens | Acc (%) | Tokens | Acc (%) |
+|---------------|--------------|---------------|--------------|---------------|--------------|---------------|--------------|---------------|--------------|---------------|
+| DeepSscaleR-1.5B | 10050 | 41.0|  1488 | 5.2 | 1904 | 9.6 | 2809 | 15.8 | 3700 | 22.7 |
+| E1-Math-1.5B | 6825 | 35.0 | 1340   | 13.5   | 1799   | 17.5   | 2650   | 24.8   | 3377   | 27.9   |
 ## Usage
 For detailed usage, please refer to [repo](https://github.com/SalesforceAIResearch/Elastic-Reasoning).