securereview / training_space

Commit History

Add SFT→GRPO hybrid pipeline, 60+ scenarios, semantic graders, full results
d2a68fa

sam25kat Claude Opus 4.7 (1M context) commited on

Add HF training Space: Gradio UI + GRPO train script
443f900

sam25kat Claude Sonnet 4.6 commited on