sql-debug-env / launch_job.py

Commit History

Avoid whoami rate-limit during job submission
a1e637f

md896 commited on

Harden HF job token wiring and persist full training outputs
9552aaf

md896 commited on

Fix GRPO batch/generation mismatch: auto-adjust num_generations; set launcher default to 2.
af54ccd

md896 commited on

Fix HF Jobs bootstrap (pin transformers/trl, drop torchao stack); add reward and trainer JSONL logging; stabilize launch_job.
ceee0e3

md896 commited on

Fix: Mock vllm and llm_blender to stabilize GRPOTrainer in HF Jobs environment
bc20ef9

md896 commited on