README/BLOG: feature canonical 110-step GRPO numbers as the headline ec275fe Viani commited on 15 days ago
Hackathon cleanup: drop assets/ and scripts/ (notebook embeds curves + eval); patch README/BLOG to remove dead refs 504c667 verified Viani commited on 15 days ago
HF Space: 4-dept SimMart env + 1.5B SFT+GRPO training (hackathon submission) 5c35138 Viani commited on 15 days ago