Format Distributional Reasoning
Collection
FDR
•
3 items
•
Updated
•
1
This model is a fine-tuned version of Qwen/Qwen2.5-7B on the aqua_rat_multiple_choice and the aqua_rat_open_form datasets. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.4525 | 0.1642 | 100 | 0.4681 |
| 0.4298 | 0.3284 | 200 | 0.4217 |
| 0.4128 | 0.4926 | 300 | 0.4075 |
| 0.4247 | 0.6568 | 400 | 0.3982 |
| 0.3922 | 0.8210 | 500 | 0.3916 |
| 0.3634 | 0.9852 | 600 | 0.3876 |
| 0.351 | 1.1494 | 700 | 0.3857 |
| 0.3613 | 1.3136 | 800 | 0.3825 |
| 0.3655 | 1.4778 | 900 | 0.3792 |
| 0.3849 | 1.6420 | 1000 | 0.3762 |
| 0.3373 | 1.8062 | 1100 | 0.3736 |
| 0.358 | 1.9704 | 1200 | 0.3711 |
| 0.3476 | 2.1346 | 1300 | 0.3727 |
| 0.3318 | 2.2989 | 1400 | 0.3717 |
| 0.3309 | 2.4631 | 1500 | 0.3709 |
| 0.3141 | 2.6273 | 1600 | 0.3703 |
| 0.3252 | 2.7915 | 1700 | 0.3698 |
| 0.3446 | 2.9557 | 1800 | 0.3699 |
Base model
Qwen/Qwen2.5-7B