crumb/Clean-Instruct-440k
Viewer • Updated • 444k • 137 • 8
This model is a fine-tuned version of bigcode/starcoderbase on the crumb/Clean-Instruct-440k dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 1.4135 | 0.99 | 55 | 1.3103 |
| 1.3429 | 2.0 | 111 | 1.2430 |
| 1.2432 | 2.98 | 166 | 1.2283 |
| 1.2808 | 3.96 | 220 | 1.2266 |