ui: tier cards — black background, new tier names (TRIAGE/STRATEGY/OPERATIONS) 2c6a812 Running Madhav189 commited on 28 days ago
SystemTruth rebrand: bigger UI, new diagrams, theme-cohesive HF Space 6583a07 Madhav189 commited on 28 days ago
lfs: track docs/blog/*.png through git-lfs for HF Space compat e8774c9 Madhav189 commited on 28 days ago
finalization: blog + README + execution rewrite, drop 3B + openclaw shim 0058c94 Madhav189 commited on 28 days ago
notebook: broaden Cell 10 SFT-only load except clause c0ea16e Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
notebook: fix GRPO prompts — apply chat template before passing to TRL 215c8ad Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
notebook: install openenv-core (and other repo deps) in Cell 0 091927e Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
notebook: switch Cell 0 to Unsloth's official uv pattern + harden Cells 10-12 b2632ec Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
training: precision rewrite to prevent SFT collapse + GRPO variance starvation 42ab8f0 Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
notebook: use Qwen2.5-3B-Instruct (has chat template) not the base model 2214e76 Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
notebook: align with Unsloth-recommended TRL 0.22.2 65d2643 Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
notebook: show pip install progress (drop -q flag) 93e560c Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
notebook: rewrite for robustness and resumability 17dba36 Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
notebook: fix TRL >=1.0 API breakage 32e423b Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
hackathon sprint: grader collapse + coliseum rename + training pipeline c9baa73 Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
honesty pass: README + execution.md + manifests now match the codebase 8e274ca Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
UI rewrite: visual-spec terminal + held-out eval streamer 48a148f Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
UI Build Addendum: mount Gradio at /, preserve full FastAPI surface 517c2d9 Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
24-hour sprint: all 3 tiers runnable + MCP dual-route + terminal-style Gradio UI ad2cbc4 Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
docs: expand README + execution.md into comprehensive references 66a9e6b Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
Tier-escalating sre-gym v3.0 — compute → horizon → realism 2733f3f Madhav189 Claude Opus 4.7 (1M context) commited on 29 days ago
GRPO notebook: trajectory snapshots, Drive checkpointing, comparison table 7b4e2df dakshdoesdev Claude Opus 4.7 (1M context) commited on 29 days ago
Replace estimated baseline numbers with eval_sweep-verified ones c8cdf7c dakshdoesdev Claude Opus 4.7 (1M context) commited on 29 days ago
Position vs OpenEnv-hackathon competition + add baseline grid 47be627 dakshdoesdev Claude Opus 4.7 (1M context) commited on 29 days ago
Auto-fetch seed dataset + Open-in-Colab badges 0ef5181 dakshdoesdev Claude Opus 4.7 (1M context) commited on 30 days ago
Wire Colab Secrets bridge into both training notebooks bcfbf5f dakshdoesdev Claude Opus 4.7 (1M context) commited on 30 days ago
Expand seed_combined.jsonl to 200 samples across 3 teachers 7ee873a dakshdoesdev Claude Opus 4.7 (1M context) commited on 30 days ago
Document expanded training pipeline in README f7c833c dakshdoesdev Claude Opus 4.7 (1M context) commited on 30 days ago
Add Groq driver, GRPO notebook, and eval-sweep harness 0d3e723 dakshdoesdev Claude Opus 4.7 (1M context) commited on 30 days ago
Add Llama-3.3-70B teacher episodes + merge into seed_combined.jsonl 9c00699 dakshdoesdev Claude Opus 4.7 (1M context) commited on 30 days ago
Add Fireworks driver for teacher-data collection 4d6c819 dakshdoesdev Claude Opus 4.7 (1M context) commited on 30 days ago
Document the 4-step empty-prompt filter in seed README 8dfbdb7 dakshdoesdev Claude Opus 4.7 (1M context) commited on 30 days ago
Rewire sanity_run.ipynb to SFT on the 39-sample Claude seed 0e63c79 dakshdoesdev Claude Opus 4.7 (1M context) commited on 30 days ago
Rewrite README as comprehensive hackathon landing page 209017c dakshdoesdev Claude Opus 4.7 (1M context) commited on about 1 month ago
Add vibe-coded SaaS scenarios + Claude-teacher seed dataset f749d7b dakshdoesdev Claude Opus 4.7 (1M context) commited on about 1 month ago
Harden env + ship Claude skill, OpenClaw-RL shim, training pipeline 0bf41ea dakshdoesdev Claude Opus 4.7 (1M context) commited on about 1 month ago
Initial commit of Unified Incident Env v2: Honest SRE Simulator f12569b dakshdoesdev commited on about 1 month ago