Large Baseline Extension Launchers
This folder contains launch scripts for the non-location baseline multi-output runs.
launch_nonlocation_pipeline.shlaunch_nonlocation_sft.shlaunch_nonlocation_grpo.sh
The main entry point for a full staged resume run is launch_nonlocation_pipeline.sh.
Useful environment variables:
MIN_STAGEMAX_STAGENUM_PROCESSESGPU_IDSBOOTSTRAP_ADAPTER_DIROUTPUT_ROOTRUN_TAGLIMIT_TRAIN_ROWSWANDB_MODEWANDB_ENTITY
Example:
MIN_STAGE=3 \
MAX_STAGE=5 \
NUM_PROCESSES=8 \
GPU_IDS=0,1,2,3,4,5,6,7 \
BOOTSTRAP_ADAPTER_DIR=/path/to/stage02_grpo \
WANDB_MODE=online \
WANDB_ENTITY=training-dynamics \
bash launch_nonlocation_pipeline.sh