2025-11-29 22:20:42 - train - INFO - Logging to: logs/codet5/train_20251129_222042.log 2025-11-29 22:20:42 - train - INFO - Monitor progress: tail -f logs/codet5/train_20251129_222042.log 2025-11-29 22:20:42 - train - INFO - ============================================================ 2025-11-29 22:20:42 - train - INFO - CodeT5+ Training 2025-11-29 22:20:42 - train - INFO - ============================================================ 2025-11-29 22:20:42 - train - INFO - Using CUDA device: 0 2025-11-29 22:20:42 - train - INFO - GPU: NVIDIA GeForce RTX 4090 2025-11-29 22:20:42 - train - INFO - Configuration: 2025-11-29 22:20:42 - train - INFO - model: Salesforce/codet5p-220m 2025-11-29 22:20:42 - train - INFO - data: datasets/python 2025-11-29 22:20:42 - train - INFO - output: model/checkpoints/run1-python-codet5 2025-11-29 22:20:42 - train - INFO - batch_size: 10 2025-11-29 22:20:42 - train - INFO - gradient_accumulation_steps: 4 2025-11-29 22:20:42 - train - INFO - effective_batch_size: 40 2025-11-29 22:20:42 - train - INFO - learning_rate: 5e-05 2025-11-29 22:20:42 - train - INFO - epochs: 5 2025-11-29 22:20:42 - train - INFO - max_source_len: 1024 2025-11-29 22:20:42 - train - INFO - max_target_len: 32 2025-11-29 22:20:42 - train - INFO - fp16: True 2025-11-29 22:20:42 - train - INFO - seed: 42 2025-11-29 22:20:42 - train - INFO - Loading tokenizer and model... 2025-11-29 22:20:51 - train - INFO - Model loaded: Salesforce/codet5p-220m 2025-11-29 22:20:51 - train - INFO - Loading and preprocessing dataset... 2025-11-29 22:20:54 - train - INFO - Train examples: 155411 2025-11-29 22:20:54 - train - INFO - Validation examples: 19426 2025-11-29 22:30:30 - train - INFO - Dataset preprocessing completed 2025-11-29 22:30:30 - train - INFO - Starting training... 2025-11-29 22:30:30 - train - INFO - Total training steps: 19425 2025-11-29 22:30:30 - train - INFO - No checkpoint found for auto-resume, starting from scratch 2025-11-30 06:15:27 - train - INFO - Training completed in 28485.35 seconds (7.91 hours) 2025-11-30 06:15:27 - train - INFO - Saving model to model/checkpoints/run1-python-codet5 2025-11-30 06:15:28 - train - INFO - Model and tokenizer saved successfully 2025-11-30 06:15:28 - train - INFO - ============================================================ 2025-11-30 06:15:28 - train - INFO - Training Summary 2025-11-30 06:15:28 - train - INFO - ============================================================ 2025-11-30 06:15:28 - train - INFO - Total time: 7.91 hours 2025-11-30 06:15:28 - train - INFO - Output directory: model/checkpoints/run1-python-codet5 2025-11-30 06:15:28 - train - INFO - Training log: model/checkpoints/run1-python-codet5/training_log.csv