Commit History

ui: tier cards — black background, new tier names (TRIAGE/STRATEGY/OPERATIONS)
2c6a812
Running

Madhav189 commited on

SystemTruth rebrand: bigger UI, new diagrams, theme-cohesive HF Space
6583a07

Madhav189 commited on

lfs: track docs/blog/*.png through git-lfs for HF Space compat
e8774c9

Madhav189 commited on

finalization: blog + README + execution rewrite, drop 3B + openclaw shim
0058c94

Madhav189 commited on

notebook: broaden Cell 10 SFT-only load except clause
c0ea16e

Madhav189 Claude Opus 4.7 (1M context) commited on

notebook: fix GRPO prompts — apply chat template before passing to TRL
215c8ad

Madhav189 Claude Opus 4.7 (1M context) commited on

notebook: install openenv-core (and other repo deps) in Cell 0
091927e

Madhav189 Claude Opus 4.7 (1M context) commited on

notebook: switch Cell 0 to Unsloth's official uv pattern + harden Cells 10-12
b2632ec

Madhav189 Claude Opus 4.7 (1M context) commited on

training: precision rewrite to prevent SFT collapse + GRPO variance starvation
42ab8f0

Madhav189 Claude Opus 4.7 (1M context) commited on

notebook: use Qwen2.5-3B-Instruct (has chat template) not the base model
2214e76

Madhav189 Claude Opus 4.7 (1M context) commited on

notebook: align with Unsloth-recommended TRL 0.22.2
65d2643

Madhav189 Claude Opus 4.7 (1M context) commited on

notebook: show pip install progress (drop -q flag)
93e560c

Madhav189 Claude Opus 4.7 (1M context) commited on

notebook: rewrite for robustness and resumability
17dba36

Madhav189 Claude Opus 4.7 (1M context) commited on

notebook: fix TRL >=1.0 API breakage
32e423b

Madhav189 Claude Opus 4.7 (1M context) commited on

hackathon sprint: grader collapse + coliseum rename + training pipeline
c9baa73

Madhav189 Claude Opus 4.7 (1M context) commited on

data: 30 expert episodes + training notebook
f337985

Madhav189 commited on

honesty pass: README + execution.md + manifests now match the codebase
8e274ca

Madhav189 Claude Opus 4.7 (1M context) commited on

UI rewrite: visual-spec terminal + held-out eval streamer
48a148f

Madhav189 Claude Opus 4.7 (1M context) commited on

UI Build Addendum: mount Gradio at /, preserve full FastAPI surface
517c2d9

Madhav189 Claude Opus 4.7 (1M context) commited on

24-hour sprint: all 3 tiers runnable + MCP dual-route + terminal-style Gradio UI
ad2cbc4

Madhav189 Claude Opus 4.7 (1M context) commited on

docs: expand README + execution.md into comprehensive references
66a9e6b

Madhav189 Claude Opus 4.7 (1M context) commited on

Tier-escalating sre-gym v3.0 — compute → horizon → realism
2733f3f

Madhav189 Claude Opus 4.7 (1M context) commited on

GRPO notebook: trajectory snapshots, Drive checkpointing, comparison table
7b4e2df

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Replace estimated baseline numbers with eval_sweep-verified ones
c8cdf7c

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Position vs OpenEnv-hackathon competition + add baseline grid
47be627

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Auto-fetch seed dataset + Open-in-Colab badges
0ef5181

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Wire Colab Secrets bridge into both training notebooks
bcfbf5f

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Expand seed_combined.jsonl to 200 samples across 3 teachers
7ee873a

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Document expanded training pipeline in README
f7c833c

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Add Groq driver, GRPO notebook, and eval-sweep harness
0d3e723

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Add Llama-3.3-70B teacher episodes + merge into seed_combined.jsonl
9c00699

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Add Fireworks driver for teacher-data collection
4d6c819

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Document the 4-step empty-prompt filter in seed README
8dfbdb7

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Rewire sanity_run.ipynb to SFT on the 39-sample Claude seed
0e63c79

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Rewrite README as comprehensive hackathon landing page
209017c

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Add vibe-coded SaaS scenarios + Claude-teacher seed dataset
f749d7b

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Harden env + ship Claude skill, OpenClaw-RL shim, training pipeline
0bf41ea

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Rename project to sre-engineer-llm
c8bef53

dakshdoesdev commited on

Initial commit of Unified Incident Env v2: Honest SRE Simulator
f12569b

dakshdoesdev commited on

Require token/model in UI start and add run summary charts
0d44d51

Madhav189 commited on

Update README with current links, UI, and latest model scores
cad6640

Madhav189 commited on

Make simple UI run full scenario suite like terminal flow
3055e7c

Madhav189 commited on

Improve simple console auto-run and remove token persistence
5a51ee6

Madhav189 commited on

Fix simple console reset done/reward logging
7c6c085

Madhav189 commited on

Add simple terminal-style UI as default app entry
7eca7b4

Madhav189 commited on

Fix /step action validation and query_logs shorthand
b887bf1

Madhav189 commited on

Autofill required web-step fields from valid example
cd2430f

Madhav189 commited on

Fix web step payload compatibility for Space UI
ba3c655

Madhav189 commited on

Prepare competition-ready submission
0126492

Madhav189 commited on

Keep public task scores inside strict validator bounds
dbea29d

dakshdoesdev commited on