refactor: Restructure project with unified CLI and fix RL training gaps 6d32faf RoyAalekh commited on Nov 27, 2025