ashishbaberwal's picture
New Final
1939cbc

PPO Logs

This folder stores training artifacts produced by train.py.

Files:

  • train_metrics.csv: per-episode reward, task_score, steps, and running baseline.
  • summary.txt: compact training summary for README/judge evidence.

Example run:

source .venv/bin/activate
python train.py --episodes 120 --max-steps 5