| library_name: transformers | |
| tags: [] | |
| # experiment_2_8b-fp16 | |
| Another experimental train w/ unsloth. This time, roughly 0.6 epochs of the cleaned c2-logs. My metaparams are probably bad, since the loss-value was super weird at the end. Also uploaded another version in the `checkpoint-3500`-branch that may mitigate some of that. |