feat: commit training evidence, update README with real scores, add demo scripts 8204dc0 adityss commited on 24 days ago
feat: add baseline evaluation tools and demo scripts for RL performance comparison c395f6a adityss commited on 24 days ago