File size: 735 Bytes
76de008
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
# Large Baseline Extension Launchers

This folder contains launch scripts for the non-location baseline multi-output runs.

- `launch_nonlocation_pipeline.sh`
- `launch_nonlocation_sft.sh`
- `launch_nonlocation_grpo.sh`

The main entry point for a full staged resume run is `launch_nonlocation_pipeline.sh`.

Useful environment variables:

- `MIN_STAGE`
- `MAX_STAGE`
- `NUM_PROCESSES`
- `GPU_IDS`
- `BOOTSTRAP_ADAPTER_DIR`
- `OUTPUT_ROOT`
- `RUN_TAG`
- `LIMIT_TRAIN_ROWS`
- `WANDB_MODE`
- `WANDB_ENTITY`

Example:

```bash
MIN_STAGE=3 \
MAX_STAGE=5 \
NUM_PROCESSES=8 \
GPU_IDS=0,1,2,3,4,5,6,7 \
BOOTSTRAP_ADAPTER_DIR=/path/to/stage02_grpo \
WANDB_MODE=online \
WANDB_ENTITY=training-dynamics \
bash launch_nonlocation_pipeline.sh
```