Fine tuning experiment details at https://github.com/Yeok-c/grpo-gsm8k-demo
- Downloads last month
- -
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Fine tuning experiment details at https://github.com/Yeok-c/grpo-gsm8k-demo