Format Distributional Reasoning
Collection
FDR • 3 items • Updated
• 1
This model is a fine-tuned version of Qwen/Qwen2.5-3B on the aqua_rat_multiple_choice and the aqua_rat_open_form datasets. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.5984 | 0.0821 | 100 | 0.6201 |
| 0.5397 | 0.1642 | 200 | 0.5328 |
| 0.4751 | 0.2463 | 300 | 0.5067 |
| 0.4854 | 0.3284 | 400 | 0.4936 |
| 0.5098 | 0.4105 | 500 | 0.4845 |
| 0.473 | 0.4926 | 600 | 0.4806 |
| 0.4599 | 0.5747 | 700 | 0.4754 |
| 0.4888 | 0.6568 | 800 | 0.4699 |
| 0.4579 | 0.7389 | 900 | 0.4668 |
| 0.4381 | 0.8210 | 1000 | 0.4628 |
| 0.4479 | 0.9031 | 1100 | 0.4603 |
| 0.4579 | 0.9852 | 1200 | 0.4579 |
| 0.4237 | 1.0673 | 1300 | 0.4566 |
| 0.4168 | 1.1494 | 1400 | 0.4564 |
| 0.4394 | 1.2315 | 1500 | 0.4530 |
| 0.4438 | 1.3136 | 1600 | 0.4513 |
| 0.4352 | 1.3957 | 1700 | 0.4481 |
| 0.4355 | 1.4778 | 1800 | 0.4467 |
| 0.4057 | 1.5599 | 1900 | 0.4443 |
| 0.4743 | 1.6420 | 2000 | 0.4429 |
| 0.4264 | 1.7241 | 2100 | 0.4413 |
| 0.4071 | 1.8062 | 2200 | 0.4400 |
| 0.4156 | 1.8883 | 2300 | 0.4385 |
| 0.3961 | 1.9704 | 2400 | 0.4375 |
| 0.3887 | 2.0525 | 2500 | 0.4391 |
| 0.4492 | 2.1346 | 2600 | 0.4383 |
| 0.4032 | 2.2167 | 2700 | 0.4381 |
| 0.381 | 2.2989 | 2800 | 0.4372 |
| 0.4104 | 2.3810 | 2900 | 0.4366 |
| 0.3933 | 2.4631 | 3000 | 0.4360 |
| 0.4282 | 2.5452 | 3100 | 0.4359 |
| 0.3708 | 2.6273 | 3200 | 0.4356 |
| 0.3629 | 2.7094 | 3300 | 0.4349 |
| 0.4044 | 2.7915 | 3400 | 0.4349 |
| 0.3941 | 2.8736 | 3500 | 0.4348 |
| 0.4078 | 2.9557 | 3600 | 0.4348 |
Base model
Qwen/Qwen2.5-3B