Avra98's picture
Initial code dump (rebuttal-ready snapshot)
76de008 verified

Large Baseline Extension Launchers

This folder contains launch scripts for the non-location baseline multi-output runs.

  • launch_nonlocation_pipeline.sh
  • launch_nonlocation_sft.sh
  • launch_nonlocation_grpo.sh

The main entry point for a full staged resume run is launch_nonlocation_pipeline.sh.

Useful environment variables:

  • MIN_STAGE
  • MAX_STAGE
  • NUM_PROCESSES
  • GPU_IDS
  • BOOTSTRAP_ADAPTER_DIR
  • OUTPUT_ROOT
  • RUN_TAG
  • LIMIT_TRAIN_ROWS
  • WANDB_MODE
  • WANDB_ENTITY

Example:

MIN_STAGE=3 \
MAX_STAGE=5 \
NUM_PROCESSES=8 \
GPU_IDS=0,1,2,3,4,5,6,7 \
BOOTSTRAP_ADAPTER_DIR=/path/to/stage02_grpo \
WANDB_MODE=online \
WANDB_ENTITY=training-dynamics \
bash launch_nonlocation_pipeline.sh