Add SFT warm start before GRPO and DB connectivity init check c2dc160 unverified Claude commited on 2 days ago
Add Supabase upload for training results (Storage + DB) 28bcb40 unverified Claude commited on 2 days ago
Improve reward function to break refuse-everything local minimum and scale training bd8220a unverified Claude commited on 2 days ago
Clean up dead code, unused imports, and move hardcoded values to config.yaml 3dc48b7 unverified Claude commited on 2 days ago
Centralize all training params in config.yaml (single source of truth) 4e2b74e unverified Claude commited on 2 days ago