Spaces:
Sleeping
Sleeping
HF Jobs bootstrap diagnostics
bootstrap_probe.sh is a cheap sanity check for the HF Jobs runner before
launching expensive Cortex GRPO jobs. It does not install training dependencies
or load models. It only verifies that the container shell starts, logs stream,
basic environment variables are present, the filesystem exists, optional GPU
tools are visible, Python starts, and the deployed env health endpoint is
reachable.
Run this before spending on a full GPU launch:
hf jobs run --flavor cpu-upgrade --secrets HF_TOKEN --timeout 5m \
-e ENV_URL=https://angshuman28-crisisworldcortex.hf.space \
ghcr.io/astral-sh/uv:python3.12-bookworm \
bash -c "git clone --depth=1 https://huggingface.co/spaces/Angshuman28/CrisisWorldCortex /w && bash /w/training/scripts/bootstrap_probe.sh"
Expected logs should include every marker from BOOTSTRAP STAGE 1 through
BOOTSTRAP STAGE 7. If logs are empty or stop before stage 1, the failure is
in HF Jobs scheduling/container startup, not in the Cortex Python trainer.