optigami / trainer /__init__.py
sissississi's picture
Add GRPO trainer scaffold with mock environment
aa44758
raw
history blame
0 Bytes