Commit History

fix: reduce grpo training runtime
e058108
verified

Siddh12334 commited on

fix: parse chat completions in rewards
25befe1
verified

Siddh12334 commited on

fix: emit prompt column for grpo trainer
f4ae239
verified

Siddh12334 commited on

fix: add grpo warnings compatibility attr
26b04fa
verified

Siddh12334 commited on

fix: avoid build-time root-owned cache env dirs
679b8f2
verified

Siddh12334 commited on

fix: select writable cache root at startup
bb169f7
verified

Siddh12334 commited on

fix: set writable cache env vars for training space
1f75c91
verified

Siddh12334 commited on

fix: write grpo checkpoints to tmp in space
7dd1ae3
verified

Siddh12334 commited on

fix: use writable runtime dirs before training imports
c5f23d9
verified

Siddh12334 commited on

chore: update comment on torchao shim strategy
d1de63d
verified

Siddh12334 commited on

fix: comprehensive torchao dtype + register_constant shims
b696295
verified

Siddh12334 commited on

revert: back to stable cu121 torch β€” patch applied at runtime
54ad149
verified

Siddh12334 commited on

fix: shim register_constant before importing unsloth (no-op safe for non-torchao training)
471dca3
verified

Siddh12334 commited on

fix: use PyTorch nightly β€” torchao needs register_constant not in stable 2.6
753f361
verified

Siddh12334 commited on

fix: switch to cu124 wheels for torch==2.6.0 (cu121 tops out at 2.5.1)
2ec4be7
verified

Siddh12334 commited on

fix: upgrade torch to 2.6.0 β€” torchao requires torch.int1 added in 2.6
8d3ffa5
verified

Siddh12334 commited on

fix: verbose step-by-step logs, catch BaseException with full traceback
9587508
verified

Siddh12334 commited on

fix: capture transformers/TRL logging into UI via logging.Handler
a3f7eb5
verified

Siddh12334 commited on

fix: install unsloth_zoo before unsloth
48413d4
verified

Siddh12334 commited on

fix: remove runtime unsloth install, verify import at startup
a4ad061
verified

Siddh12334 commited on

fix: install unsloth in Dockerfile at build time (not at click-time)
1a15203
verified

Siddh12334 commited on

fix: read pip output in chunks splitting on and
fbb358f
verified

Siddh12334 commited on

feat: stream pip output live to UI logs during unsloth install
ac34a90
verified

Siddh12334 commited on

fix: install unsloth to /app/pkgs (writable), detect torch version for git extra
9f4f5dc
verified

Siddh12334 commited on

fix: capture unsloth install errors, try PyPI then git fallback
5bd6953
verified

Siddh12334 commited on

fix: replace demo.load(every=) with gr.Timer for Gradio 4.x compat
48cfe76
verified

Siddh12334 commited on

chore: upload pre-generated facts.json to avoid network dependency at build
04052e6
verified

Siddh12334 commited on

fix: space_runner installs unsloth at click-time on live GPU
9bd2d99
verified

Siddh12334 commited on

fix: use python:3.11-slim base, install unsloth at runtime
30bfe4c
verified

Siddh12334 commited on

feat: training space with manual start UI
204fa23
verified

Siddh12334 commited on

initial commit
5d65fb7
verified

Siddh12334 commited on