Commit History

fix: reduce grpo training runtime
e058108
verified

Siddh12334 commited on

fix: parse chat completions in rewards
25befe1
verified

Siddh12334 commited on

fix: emit prompt column for grpo trainer
f4ae239
verified

Siddh12334 commited on

fix: add grpo warnings compatibility attr
26b04fa
verified

Siddh12334 commited on

fix: select writable cache root at startup
bb169f7
verified

Siddh12334 commited on

fix: write grpo checkpoints to tmp in space
7dd1ae3
verified

Siddh12334 commited on

fix: use writable runtime dirs before training imports
c5f23d9
verified

Siddh12334 commited on

fix: comprehensive torchao dtype + register_constant shims
b696295
verified

Siddh12334 commited on

fix: shim register_constant before importing unsloth (no-op safe for non-torchao training)
471dca3
verified

Siddh12334 commited on

fix: verbose step-by-step logs, catch BaseException with full traceback
9587508
verified

Siddh12334 commited on

fix: capture transformers/TRL logging into UI via logging.Handler
a3f7eb5
verified

Siddh12334 commited on

fix: remove runtime unsloth install, verify import at startup
a4ad061
verified

Siddh12334 commited on

fix: read pip output in chunks splitting on and
fbb358f
verified

Siddh12334 commited on

feat: stream pip output live to UI logs during unsloth install
ac34a90
verified

Siddh12334 commited on

fix: install unsloth to /app/pkgs (writable), detect torch version for git extra
9f4f5dc
verified

Siddh12334 commited on

fix: capture unsloth install errors, try PyPI then git fallback
5bd6953
verified

Siddh12334 commited on

fix: replace demo.load(every=) with gr.Timer for Gradio 4.x compat
48cfe76
verified

Siddh12334 commited on

fix: space_runner installs unsloth at click-time on live GPU
9bd2d99
verified

Siddh12334 commited on

feat: training space with manual start UI
204fa23
verified

Siddh12334 commited on