ashishbaberwal's picture
New Final
1939cbc
# PPO Logs
This folder stores training artifacts produced by `train.py`.
Files:
- `train_metrics.csv`: per-episode reward, task_score, steps, and running baseline.
- `summary.txt`: compact training summary for README/judge evidence.
Example run:
```bash
source .venv/bin/activate
python train.py --episodes 120 --max-steps 5
```