cannon-and-wall / training

Commit History

fix: Gradio mount, verifier stage3, poc_history clear, real notebook
5844c1d

CystronCode commited on

rewrite: training notebook - drop Unsloth, use transformers+peft+bitsandbytes
8cba85d

CystronCode commited on

feat: GRPO training, fixed curriculum stages 2+3, AST verifier, leaderboard UI, openenv schema
da648a3

CystronCode commited on

fix: remove 3 debug cells from notebook
592bdde

CystronCode commited on

feat: add real training notebook from Colab run
fd58a6d

CystronCode commited on

initial deploy — jairaj files, teammate files pending
9556146

CystronCode commited on