| [0;32m[INFO][0m Launching instance for GPT-2 Large training... |
| [0;32m[INFO][0m Instance type: g5.2xlarge (48GB VRAM) |
| [0;32m[INFO][0m Finding Deep Learning AMI... |
| [0;32m[INFO][0m Using AMI: ami-0e4ef96c62e7cc2fe |
| [0;32m[INFO][0m Using key pair: chave-gpu-nova |
| [0;32m[INFO][0m Using security group: sg-0deaa73e23482e3f6 |
| [0;32m[INFO][0m Launching instance... |
| [0;32m[INFO][0m Instance launched: i-04dc6f51534d8185d |
| [0;32m[INFO][0m Waiting for instance to start... |
|
|
| ========================================== |
| [0;32mGPT-2 Large Training Instance Ready![0m |
| ========================================== |
| Instance ID: i-04dc6f51534d8185d |
| Instance Type: g5.2xlarge (48GB VRAM) |
| Public IP: 52.55.119.255 |
|
|
| [0;34mMonitor training:[0m |
| ssh -i ~/.ssh/chave-gpu-nova.pem ubuntu@52.55.119.255 |
| tail -f /home/ubuntu/training_large.log |
|
|
| [0;34mCheck when complete:[0m |
| ssh ubuntu@52.55.119.255 'while [ ! -f ~/.training_complete ]; do sleep 60; echo "Training in progress..."; done; cat ~/training_results.txt' |
|
|
| [1;33mEstimated time:[0m ~4-5 hours for 3 epochs |
| [1;33mCost:[0m ~-10 USD (/hour for g5.2xlarge) |
|
|
| [0;32m[INFO][0m Instance info saved to: /c/Users/madeinweb/.seriguela/large_instance_info.txt |
|
|