Spaces:

Madhav189
/

SystemTruth

Running

App Files Files Community

Commit History

ui: tier cards — black background, new tier names (TRIAGE/STRATEGY/OPERATIONS)

2c6a812

Running

Madhav189 commited on Apr 26

SystemTruth rebrand: bigger UI, new diagrams, theme-cohesive HF Space

6583a07

Madhav189 commited on Apr 26

lfs: track docs/blog/*.png through git-lfs for HF Space compat

e8774c9

Madhav189 commited on Apr 26

finalization: blog + README + execution rewrite, drop 3B + openclaw shim

0058c94

Madhav189 commited on Apr 26

notebook: broaden Cell 10 SFT-only load except clause

c0ea16e

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 26

notebook: fix GRPO prompts — apply chat template before passing to TRL

215c8ad

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 26

notebook: install openenv-core (and other repo deps) in Cell 0

091927e

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 26

notebook: switch Cell 0 to Unsloth's official uv pattern + harden Cells 10-12

b2632ec

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 26

training: precision rewrite to prevent SFT collapse + GRPO variance starvation

42ab8f0

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 26

notebook: use Qwen2.5-3B-Instruct (has chat template) not the base model

2214e76

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 25

notebook: align with Unsloth-recommended TRL 0.22.2

65d2643

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 25

notebook: show pip install progress (drop -q flag)

93e560c

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 25

notebook: rewrite for robustness and resumability

17dba36

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 25

notebook: fix TRL >=1.0 API breakage

32e423b

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 25

hackathon sprint: grader collapse + coliseum rename + training pipeline

c9baa73

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 25

data: 30 expert episodes + training notebook

f337985

Madhav189 commited on Apr 25

honesty pass: README + execution.md + manifests now match the codebase

8e274ca

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 25

UI rewrite: visual-spec terminal + held-out eval streamer

48a148f

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 25

UI Build Addendum: mount Gradio at /, preserve full FastAPI surface

517c2d9

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 25

24-hour sprint: all 3 tiers runnable + MCP dual-route + terminal-style Gradio UI

ad2cbc4

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 25

docs: expand README + execution.md into comprehensive references

66a9e6b

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 25

Tier-escalating sre-gym v3.0 — compute → horizon → realism

2733f3f

Madhav189 Claude Opus 4.7 (1M context) commited on Apr 25

GRPO notebook: trajectory snapshots, Drive checkpointing, comparison table

7b4e2df

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 25

Replace estimated baseline numbers with eval_sweep-verified ones

c8cdf7c

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 25

Position vs OpenEnv-hackathon competition + add baseline grid

47be627

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 25

Auto-fetch seed dataset + Open-in-Colab badges

0ef5181

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 25

Wire Colab Secrets bridge into both training notebooks

bcfbf5f

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 25

Expand seed_combined.jsonl to 200 samples across 3 teachers

7ee873a

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 24

Document expanded training pipeline in README

f7c833c

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 24

Add Groq driver, GRPO notebook, and eval-sweep harness

0d3e723

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 24

Add Llama-3.3-70B teacher episodes + merge into seed_combined.jsonl

9c00699

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 24

Add Fireworks driver for teacher-data collection

4d6c819

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 24

Document the 4-step empty-prompt filter in seed README

8dfbdb7

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 24

Rewire sanity_run.ipynb to SFT on the 39-sample Claude seed

0e63c79

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 24

Rewrite README as comprehensive hackathon landing page

209017c

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 24

Add vibe-coded SaaS scenarios + Claude-teacher seed dataset

f749d7b

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 24

Harden env + ship Claude skill, OpenClaw-RL shim, training pipeline

0bf41ea

dakshdoesdev Claude Opus 4.7 (1M context) commited on Apr 23

Rename project to sre-engineer-llm

c8bef53

dakshdoesdev commited on Apr 23

Initial commit of Unified Incident Env v2: Honest SRE Simulator

f12569b

dakshdoesdev commited on Apr 23

Require token/model in UI start and add run summary charts

0d44d51

Madhav189 commited on Apr 8

Update README with current links, UI, and latest model scores

cad6640

Madhav189 commited on Apr 8

Make simple UI run full scenario suite like terminal flow

3055e7c

Madhav189 commited on Apr 8

Improve simple console auto-run and remove token persistence

5a51ee6

Madhav189 commited on Apr 8

Fix simple console reset done/reward logging

7c6c085

Madhav189 commited on Apr 8

Add simple terminal-style UI as default app entry

7eca7b4

Madhav189 commited on Apr 8

Fix /step action validation and query_logs shorthand

b887bf1

Madhav189 commited on Apr 8

Autofill required web-step fields from valid example

cd2430f

Madhav189 commited on Apr 8

Fix web step payload compatibility for Space UI

ba3c655

Madhav189 commited on Apr 8

Prepare competition-ready submission

0126492

Madhav189 commited on Apr 8

Keep public task scores inside strict validator bounds

dbea29d

dakshdoesdev commited on Apr 8