Source code available at https://github.com/phhusson/llm-rl/blob/main/grpo-tldr.py
- Downloads last month
- 128
Hardware compatibility
Log In to add your hardware
16-bit
Source code available at https://github.com/phhusson/llm-rl/blob/main/grpo-tldr.py
16-bit