assets: headline chart, architecture overview, training pipeline (for README) 7001779 verified SaiManish123 commited on 16 days ago
readme: project description, results, training logs, links 7a3ba78 verified SaiManish123 commited on 16 days ago
Replace SFT reward curve with baseline-anchored learning curve (tool-aware baseline → checkpoint-40 … final) 10fa2de verified SaiManish123 commited on 16 days ago
Add HF Job stdout log for grpo_polymorphic_zero_day_1_5b 51ebc24 verified SaiManish123 commited on 16 days ago
Upload sft_worldsplit_1_5b/adaptshield_sft_worldsplit.summary.json with huggingface_hub 2cddefe verified SaiManish123 commited on 17 days ago