opensoc-env / train

Commit History

training: push adapters to HF Hub after SFT + each GRPO stage
dc2d89f

shivam2k3 commited on

grpo: skip-SFT continuation script + completion-shape fix
19270a9

shivam2k3 commited on

train_grpo: import unsloth at module top before trl
0a5fc17

shivam2k3 commited on

sft_warmstart: import unsloth first; batched formatting_func
99d0d29

shivam2k3 commited on

sft_warmstart: hardcode Qwen2.5 eos_token <|im_end|>
b42f9bf

shivam2k3 commited on

train_grpo: same eos placeholder fix for GRPO
d2bb443

shivam2k3 commited on

sft_warmstart: replace unsloth's <EOS_TOKEN> placeholder with <|im_end|>
67d026b

shivam2k3 commited on

sft_warmstart: pass real eos_token (trl 0.24 sentinel bug)
c3488dd

shivam2k3 commited on

sft_warmstart: trl 0.24 API (SFTConfig + processing_class)
a5f5c45

shivam2k3 commited on

OpenSOC v1
bb6a031

shivam2k3 commited on