error_diagnostic_model_mPyT5_label / FineTune.sh.o40489474
ace14459tv
エラー診断ヒデル(γƒ©γƒ™γƒ«δΊˆζΈ¬)mPyT5ベース
30934a6
GPU available: True (cuda), used: True
TPU available: False, using: 0 TPU cores
IPU available: False, using: 0 IPUs
HPU available: False, using: 0 HPUs
Downloading and preparing dataset json/default to /home/ace14459tv/t5maru/cache/json/default-4c66c2b66c128f21/0.0.0/e347ab1c932092252e717ff3f949105a4dd28b27e842dd53157d2f72e276c2e4...
Downloading data files: 0%| | 0/1 [00:00<?, ?it/s] Downloading data files: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1/1 [00:00<00:00, 6523.02it/s]
Extracting data files: 0%| | 0/1 [00:00<?, ?it/s] Extracting data files: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1/1 [00:00<00:00, 421.79it/s]
Generating train split: 0 examples [00:00, ? examples/s] Dataset json downloaded and prepared to /home/ace14459tv/t5maru/cache/json/default-4c66c2b66c128f21/0.0.0/e347ab1c932092252e717ff3f949105a4dd28b27e842dd53157d2f72e276c2e4. Subsequent calls will reuse this data.
Map (num_proc=4): 0%| | 0/2880 [00:00<?, ? examples/s] Map (num_proc=4): 3%|β–Ž | 82/2880 [00:00<00:17, 157.85 examples/s] Map (num_proc=4): 15%|β–ˆβ–Œ | 442/2880 [00:00<00:03, 734.39 examples/s] Map (num_proc=4): 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 1306/2880 [00:00<00:00, 2320.99 examples/s] Map (num_proc=4): 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 2235/2880 [00:00<00:00, 3871.65 examples/s] Downloading and preparing dataset json/default to /home/ace14459tv/t5maru/cache/json/default-f20400f24e226082/0.0.0/e347ab1c932092252e717ff3f949105a4dd28b27e842dd53157d2f72e276c2e4...
Downloading data files: 0%| | 0/1 [00:00<?, ?it/s] Downloading data files: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1/1 [00:00<00:00, 8943.08it/s]
Extracting data files: 0%| | 0/1 [00:00<?, ?it/s] Extracting data files: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1/1 [00:00<00:00, 442.30it/s]
Generating train split: 0 examples [00:00, ? examples/s] Dataset json downloaded and prepared to /home/ace14459tv/t5maru/cache/json/default-f20400f24e226082/0.0.0/e347ab1c932092252e717ff3f949105a4dd28b27e842dd53157d2f72e276c2e4. Subsequent calls will reuse this data.
Map (num_proc=4): 0%| | 0/618 [00:00<?, ? examples/s] Map (num_proc=4): 10%|β–‰ | 61/618 [00:00<00:06, 92.78 examples/s] LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0]
| Name | Type | Params
------------------------------------------
0 | model | OptimizedModule | 300 M
------------------------------------------
300 M Trainable params
0 Non-trainable params
300 M Total params
1,200.707 Total estimated model params size (MB)
[2023-08-25 17:16:53,951] torch._inductor.utils: [WARNING] using triton random, expect difference from eager
Metric val_loss improved. New best score: 0.683
Metric val_loss improved by 0.124 >= min_delta = 0.0. New best score: 0.559
Metric val_loss improved by 0.106 >= min_delta = 0.0. New best score: 0.454
Metric val_loss improved by 0.032 >= min_delta = 0.0. New best score: 0.422
Metric val_loss improved by 0.040 >= min_delta = 0.0. New best score: 0.382
Metric val_loss improved by 0.020 >= min_delta = 0.0. New best score: 0.363
Monitored metric val_loss did not improve in the last 3 records. Best score: 0.363. Signaling Trainer to stop.
{"log": "trained", "date": "2023-08-25T17:15:59", "elapsed": "00:04:12", "model": "Roy029/mpyt5_e20", "max_length": 128, "target_max_length": 128, "batch_size": 32, "gradient_accumulation_steps": 1, "train_steps": 2700, "accelerator": "gpu", "devices": "auto", "precision": 32, "strategy": "auto", "gradient_clip_val": 1.0, "compile": true, "solver": "adamw", "lr": 0.0003, "warmup_steps": 1, "training_steps": 100000, "adam_epsilon": 1e-08, "weight_decay": 0.0, "epoch": 11, "step": 990, "saved": "error_label_mPyT5"}
😊 testing /home/ace14459tv/t5maru/error_label_test.jsonl on cuda
Downloading and preparing dataset generator/default to /home/ace14459tv/t5maru/cache/generator/default-86c49a273729eafb/0.0.0...
Generating train split: 0 examples [00:00, ? examples/s] Dataset generator downloaded and prepared to /home/ace14459tv/t5maru/cache/generator/default-86c49a273729eafb/0.0.0. Subsequent calls will reuse this data.
Map (num_proc=4): 0%| | 0/617 [00:00<?, ? examples/s] Map (num_proc=4): 25%|β–ˆβ–ˆβ– | 154/617 [00:00<00:01, 307.18 examples/s] 😊 Tested 617 items. See error_label_mPyT5/error_label_tested.jsonl