SystemTruth / train

Commit History

finalization: blog + README + execution rewrite, drop 3B + openclaw shim
0058c94

Madhav189 commited on

training: precision rewrite to prevent SFT collapse + GRPO variance starvation
42ab8f0

Madhav189 Claude Opus 4.7 (1M context) commited on

hackathon sprint: grader collapse + coliseum rename + training pipeline
c9baa73

Madhav189 Claude Opus 4.7 (1M context) commited on

data: 30 expert episodes + training notebook
f337985

Madhav189 commited on

GRPO notebook: trajectory snapshots, Drive checkpointing, comparison table
7b4e2df

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Replace estimated baseline numbers with eval_sweep-verified ones
c8cdf7c

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Auto-fetch seed dataset + Open-in-Colab badges
0ef5181

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Wire Colab Secrets bridge into both training notebooks
bcfbf5f

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Expand seed_combined.jsonl to 200 samples across 3 teachers
7ee873a

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Add Groq driver, GRPO notebook, and eval-sweep harness
0d3e723

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Add Llama-3.3-70B teacher episodes + merge into seed_combined.jsonl
9c00699

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Add Fireworks driver for teacher-data collection
4d6c819

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Document the 4-step empty-prompt filter in seed README
8dfbdb7

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Rewire sanity_run.ipynb to SFT on the 39-sample Claude seed
0e63c79

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Add vibe-coded SaaS scenarios + Claude-teacher seed dataset
f749d7b

dakshdoesdev Claude Opus 4.7 (1M context) commited on

Harden env + ship Claude skill, OpenClaw-RL shim, training pipeline
0bf41ea

dakshdoesdev Claude Opus 4.7 (1M context) commited on