File size: 2,733 Bytes
0265461
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
2025-11-29 22:20:42 - train - INFO - Logging to: logs/codet5/train_20251129_222042.log
2025-11-29 22:20:42 - train - INFO - Monitor progress: tail -f logs/codet5/train_20251129_222042.log
2025-11-29 22:20:42 - train - INFO - ============================================================
2025-11-29 22:20:42 - train - INFO - CodeT5+ Training
2025-11-29 22:20:42 - train - INFO - ============================================================
2025-11-29 22:20:42 - train - INFO - Using CUDA device: 0
2025-11-29 22:20:42 - train - INFO - GPU: NVIDIA GeForce RTX 4090
2025-11-29 22:20:42 - train - INFO - Configuration:
2025-11-29 22:20:42 - train - INFO -   model: Salesforce/codet5p-220m
2025-11-29 22:20:42 - train - INFO -   data: datasets/python
2025-11-29 22:20:42 - train - INFO -   output: model/checkpoints/run1-python-codet5
2025-11-29 22:20:42 - train - INFO -   batch_size: 10
2025-11-29 22:20:42 - train - INFO -   gradient_accumulation_steps: 4
2025-11-29 22:20:42 - train - INFO -   effective_batch_size: 40
2025-11-29 22:20:42 - train - INFO -   learning_rate: 5e-05
2025-11-29 22:20:42 - train - INFO -   epochs: 5
2025-11-29 22:20:42 - train - INFO -   max_source_len: 1024
2025-11-29 22:20:42 - train - INFO -   max_target_len: 32
2025-11-29 22:20:42 - train - INFO -   fp16: True
2025-11-29 22:20:42 - train - INFO -   seed: 42
2025-11-29 22:20:42 - train - INFO - Loading tokenizer and model...
2025-11-29 22:20:51 - train - INFO - Model loaded: Salesforce/codet5p-220m
2025-11-29 22:20:51 - train - INFO - Loading and preprocessing dataset...
2025-11-29 22:20:54 - train - INFO - Train examples: 155411
2025-11-29 22:20:54 - train - INFO - Validation examples: 19426
2025-11-29 22:30:30 - train - INFO - Dataset preprocessing completed
2025-11-29 22:30:30 - train - INFO - Starting training...
2025-11-29 22:30:30 - train - INFO - Total training steps: 19425
2025-11-29 22:30:30 - train - INFO - No checkpoint found for auto-resume, starting from scratch
2025-11-30 06:15:27 - train - INFO - Training completed in 28485.35 seconds (7.91 hours)
2025-11-30 06:15:27 - train - INFO - Saving model to model/checkpoints/run1-python-codet5
2025-11-30 06:15:28 - train - INFO - Model and tokenizer saved successfully
2025-11-30 06:15:28 - train - INFO - ============================================================
2025-11-30 06:15:28 - train - INFO - Training Summary
2025-11-30 06:15:28 - train - INFO - ============================================================
2025-11-30 06:15:28 - train - INFO - Total time: 7.91 hours
2025-11-30 06:15:28 - train - INFO - Output directory: model/checkpoints/run1-python-codet5
2025-11-30 06:15:28 - train - INFO - Training log: model/checkpoints/run1-python-codet5/training_log.csv