Demo / training

Commit History

Add self-contained GRPO training notebook for HF Jupyter
1080341

Ajayyy00 Claude Sonnet 4.6 commited on

Add root health check route to fix HF 404
14a2669

Ajayyy00 commited on

Add alternating self-play training scaffolding.
292f6a5

Ajayyy00 commited on

Add GRPO training pipeline + remove shield emoji
4ed0f64

Ajayyy00 Claude Sonnet 4.6 commited on

Initial commit of CyberSOC upgraded RLVR environment
57e71f8

Ajayyy00 commited on