Commit History

env-var step overrides for ad-hoc test runs
9189c00
verified

Laksh718 commited on

fix: lora_dropout=0.05 + HF gradient checkpoint to bypass unsloth GRPO dtype bug; chdir to /tmp before merged push
4e5191a
verified

Laksh718 commited on

resilient training: SFT push insurance, dtype workaround, try/except GRPO, SKIP_GRPO env
07c17a2
verified

Laksh718 commited on

fix: writable OUT_DIR (/tmp), chat-template text column, push_history_to_hub
31f6ba9
verified

Laksh718 commited on

fix: render chat template into text column for new unsloth/TRL SFTTrainer
8e3d067
verified

Laksh718 commited on

fix: add unsloth_zoo to bootstrap step
ad656e7
verified

Laksh718 commited on

deploy v5 (long mode, a100x4)
1ea1445
verified

Laksh718 commited on

deploy v5 (long mode, a100x4)
e8809ac
verified

Laksh718 commited on

deploy v5 (long mode, a100x4)
1363eb9
verified

Laksh718 commited on

deploy v5 (long mode, a100x4)
a74d8c3
verified

Laksh718 commited on

deploy v5 (long mode, a100x4)
c4f0099
verified

Laksh718 commited on

deploy v5 (long mode, a100-x4)
9920e5d
verified

Laksh718 commited on

deploy v5 (long mode, a100-large)
64c9eee
verified

Laksh718 commited on

initial commit
27556e5
verified

Laksh718 commited on