model4 / README.md
sharon8811's picture
Trained with Unsloth
143e1f1 verified
---
license: bsd
tags:
- unsloth
- trl
- grpo
---