Smol-reason

Sweaterdog 's Collections

updated Mar 2

My first ever usage of GRPO fine tuning techniques, information learned from this model will be used on future Andy models.