Large Latent Extension Launchers
This folder contains the launch scripts for the non-location latent CoT runs.
launch_nonlocation_sft.shlaunch_nonlocation_grpo.sh
These are the scripts used for the distributed multi-GPU non-location curriculum.
Useful environment variables:
NUM_COT_TOKENSSTAGE_INUM_PROCESSESGPU_IDSINIT_ADAPTER_DIROUTPUT_DIRLIMIT_TRAIN_ROWSWANDB_MODEWANDB_ENTITY
Example:
NUM_COT_TOKENS=3 \
STAGE_I=3 \
NUM_PROCESSES=8 \
GPU_IDS=0,1,2,3,4,5,6,7 \
WANDB_MODE=online \
WANDB_ENTITY=training-dynamics \
bash launch_nonlocation_sft.sh