training: precision rewrite to prevent SFT collapse + GRPO variance starvation 42ab8f0 Madhav189 Claude Opus 4.7 (1M context) commited on 30 days ago
hackathon sprint: grader collapse + coliseum rename + training pipeline c9baa73 Madhav189 Claude Opus 4.7 (1M context) commited on 30 days ago
Replace estimated baseline numbers with eval_sweep-verified ones c8cdf7c dakshdoesdev Claude Opus 4.7 (1M context) commited on about 1 month ago
Expand seed_combined.jsonl to 200 samples across 3 teachers 7ee873a dakshdoesdev Claude Opus 4.7 (1M context) commited on about 1 month ago
Add Llama-3.3-70B teacher episodes + merge into seed_combined.jsonl 9c00699 dakshdoesdev Claude Opus 4.7 (1M context) commited on about 1 month ago
Document the 4-step empty-prompt filter in seed README 8dfbdb7 dakshdoesdev Claude Opus 4.7 (1M context) commited on about 1 month ago
Add vibe-coded SaaS scenarios + Claude-teacher seed dataset f749d7b dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 24