This is the version of my model that I did Continued Pre Training on to try and infuse more flavor into it before SFT. It took an obnoxiously long time and I'm not sure how much it helped, if I'm being honest. But here's to learning!
- Downloads last month
- 4