Commit History

Add interactive frontend UI
b0c701c

aagparekh commited on

Update runtime requirements
5c277bb

aagparekh commited on

Merge origin/main into main
cbbb3c4

aagparekh commited on

fix: add root status endpoint
808bf4d

Siddh12334 commited on

docs: trim readme to submission essentials
44b2880

Siddh12334 commited on

docs: restore hugging face space metadata
228f4a9

Siddh12334 commited on

Update README.md by removing unnecessary metadata
63b4bb4
unverified

Siddh commited on

Remove author submission line from BLOG.md
7c4f887
unverified

Siddh commited on

docs: replace private wandb link with exported logs
eaf43b6

Siddh12334 commited on

docs: remove internal checklist from readme
3590d38

Siddh12334 commited on

docs: add separate mini blog writeup
86ab1d8

Siddh12334 commited on

docs: rewrite readme as mini blog
1388ca9

Siddh12334 commited on

docs: add final submission readme and training logs
6a72222

Siddh12334 commited on

fix: load model snapshots from writable cache
b3382c5

Siddh12334 commited on

fix: load peft base model with explicit cache
40bdaf7

Siddh12334 commited on

fix: pass explicit cache dir to model loader
3ca7ae0

Siddh12334 commited on

fix: set model cache dirs before torch imports
264201d

Siddh12334 commited on

feat: add optional trained model inference to env space
46c2f98

Siddh12334 commited on

fix: reduce grpo training runtime
842caac

Siddh12334 commited on

fix: parse chat completions in rewards
6f3d9d6

Siddh12334 commited on

fix: emit prompt column for grpo trainer
6ceec85

Siddh12334 commited on

fix: add grpo warnings compatibility attr
633d604

Siddh12334 commited on

fix: choose writable cache root at space startup
3125dc1

Siddh12334 commited on

fix: use writable runtime dirs in training space
eafb471

Siddh12334 commited on

fix: patch torchao dtype imports for unsloth
a262689

Siddh12334 commited on

fix: shim torch._pytree.register_constant for torchao compat
98317c2

Siddh12334 Claude Sonnet 4.6 commited on

fix: use PyTorch nightly for torchao compat
a73c9e7

Siddh12334 Claude Sonnet 4.6 commited on

fix: switch to cu124 wheel index for torch 2.6.0
d98c86c

Siddh12334 Claude Sonnet 4.6 commited on

fix: pin torch==2.6.0 to satisfy torchao torch.int1 requirement
b42845a

Siddh12334 Claude Sonnet 4.6 commited on

fix: verbose per-step logs and BaseException catch in _run_training
6fc6438

Siddh12334 Claude Sonnet 4.6 commited on

fix: capture transformers/TRL/unsloth logs in Gradio UI
0b6be50

Siddh12334 Claude Sonnet 4.6 commited on

fix: add unsloth_zoo as explicit dependency before unsloth
6f1b59d

Siddh12334 Claude Sonnet 4.6 commited on

fix: install unsloth in Docker image at build time
dee46e3

Siddh12334 Claude Sonnet 4.6 commited on

fix: chunk-read pip output splitting on \r and \n
7ac7eb8

Siddh12334 Claude Sonnet 4.6 commited on

feat: stream pip install output live to UI logs
d79940c

Siddh12334 Claude Sonnet 4.6 commited on

fix: install unsloth to /app/pkgs to bypass HF Space permission issue
722cd66

Siddh12334 Claude Sonnet 4.6 commited on

fix: robust unsloth install β€” PyPI first, git fallback, log errors
fa5785a

Siddh12334 Claude Sonnet 4.6 commited on

feat: add A100 training Space with manual-trigger Gradio UI
77d8bcf

Siddh12334 Claude Sonnet 4.6 commited on

feat: optimize Space β€” dockerignore, dotenv secrets, multi-worker uvicorn
0eb0abc

Siddh12334 commited on

fix: use lean server requirements in Dockerfile (exclude unsloth/torch)
8c7e984

Siddh12334 commited on

chore: add HF Space frontmatter and README draft
40b1abb

Siddh12334 commited on

feat: bulletproof GRPO training script + Colab notebook
67601e4

Siddh12334 commited on

feat: rewrite env to be fully openenv-core compliant
7a8a0f0

Siddh12334 commited on

chore: update baseline results with real facts (450 QA pairs)
4e71c52

Siddh12334 commited on

feat: implement data pipeline
d661639

aagparekh commited on

feat: baseline eval and GRPO training script
5f54992

Siddh12334 commited on

Updated .gitignore
ef79a5f

Siddh12334 commited on

chore: add CLAUDE.md, README, update gitignore
c12488d

Siddh12334 commited on

feat: implement actions, reward, env, server
18fd5bc

Siddh12334 commited on

chore: pin dependencies
d09377a

Siddh12334 commited on