Commit History

Update blog with storytelling introduction and remove slides.pdf
8bfa130

shivam2k3 commited on

Update README with trained model links, polish blog, add model card
5cbde7b

shivam2k3 commited on

Add finish_from_stage4.sh: eval, bake, Hub upload without re-training
c4c71a8

shivam2k3 commited on

add scripts/run_resume_stage3.sh
76ba348

shivam2k3 commited on

resume script: fix dataset module names + opt-out of hf_transfer
1f4e468

shivam2k3 commited on

add scripts/run_resume_stage2.sh
3165871

shivam2k3 commited on

run_full_pipeline.sh: install hf_transfer + unsloth_zoo extras
9e53c0c

shivam2k3 commited on

run_full_pipeline.sh: pin trl/datasets/tyro to unsloth_zoo's window
60c2cf2

shivam2k3 commited on

training: push adapters to HF Hub after SFT + each GRPO stage
dc2d89f

shivam2k3 commited on

grpo: skip-SFT continuation script + completion-shape fix
19270a9

shivam2k3 commited on

train_grpo: import unsloth at module top before trl
0a5fc17

shivam2k3 commited on

sft_warmstart: import unsloth first; batched formatting_func
99d0d29

shivam2k3 commited on

sft_warmstart: hardcode Qwen2.5 eos_token <|im_end|>
b42f9bf

shivam2k3 commited on

train_grpo: same eos placeholder fix for GRPO
d2bb443

shivam2k3 commited on

sft_warmstart: replace unsloth's <EOS_TOKEN> placeholder with <|im_end|>
67d026b

shivam2k3 commited on

sft_warmstart: pass real eos_token (trl 0.24 sentinel bug)
c3488dd

shivam2k3 commited on

sft_warmstart: trl 0.24 API (SFTConfig + processing_class)
a5f5c45

shivam2k3 commited on

run_full_pipeline.sh: pin transformers<5; drop xformers; +hf_hub
fa8525d

shivam2k3 commited on

run_full_pipeline.sh: lock torch+torchvision; --no-deps for unsloth
80c89ce

shivam2k3 commited on

run_full_pipeline.sh: install torchvision (unsloth import dep)
1c7e9d1

shivam2k3 commited on

run_full_pipeline.sh: explicit unsloth_zoo install
5b394da

shivam2k3 commited on

run_full_pipeline.sh: fix legacy 'future' import error on Py3.10
2cc7bf5

shivam2k3 commited on

run_full_pipeline.sh: add HF artifact upload step
30b1468

shivam2k3 commited on

train_grpo.ipynb: HF-Jupyter friendly clone + push cells
ddafb99

shivam2k3 commited on

Space metadata header for HF Spaces deploy
6ba5cca

shivam2k3 commited on

Add GET / -> /demo/ redirect for Space iframe
bf4094f

shivam2k3 commited on

README: mark Space + /demo as live, add Space row
38b3641

shivam2k3 commited on

Bump gradio to 5.x to satisfy fastapi 0.115 pin
ecfe060

shivam2k3 commited on

Bump httpx to >=0.28.1 to satisfy openenv-core 0.2.2+
cea0ed8

shivam2k3 commited on

Fix Space metadata for HF: emoji glyph + drop *.pdf LFS tracking
18f5303

shivam2k3 commited on

Fix README links and unignore placeholder eval plots
64649c4

shivam2k3 commited on

Remove binary result images
da392f9

shivam2k3 commited on

Merge branch 'main' of https://huggingface.co/shivam2k3/opensoc-env
474fe70

shivam2k3 commited on

initial commit
808f190

shivam2k3 commited on

OpenSOC v1
bb6a031

shivam2k3 commited on