File size: 11,647 Bytes
f479aa1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
2025-08-10 22:18:56,288 INFO instruction: === Avvio Instruction SFT ===
2025-08-10 22:18:56,288 INFO instruction: Repo di destinazione: raniero/test_instr
2025-08-10 22:18:56,288 INFO instruction: Output dir: /app/instruction_output
2025-08-10 22:18:56,288 INFO instruction: Modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:18:56,288 INFO instruction: Dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:18:56,289 INFO instruction: Carico tokenizer e modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:18:59,932 INFO instruction: Carico dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:19:19,445 INFO instruction: === Avvio Instruction SFT ===
2025-08-10 22:19:19,445 INFO instruction: Repo di destinazione: raniero/test_instr
2025-08-10 22:19:19,445 INFO instruction: Output dir: /app/instruction_output
2025-08-10 22:19:19,445 INFO instruction: Modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:19:19,445 INFO instruction: Dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:19:19,445 INFO instruction: Carico tokenizer e modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:19:21,896 INFO instruction: Carico dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:19:40,761 INFO instruction: === Avvio Instruction SFT ===
2025-08-10 22:19:40,761 INFO instruction: Repo di destinazione: raniero/test_instr
2025-08-10 22:19:40,761 INFO instruction: Output dir: /app/instruction_output
2025-08-10 22:19:40,761 INFO instruction: Modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:19:40,761 INFO instruction: Dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:19:40,762 INFO instruction: Carico tokenizer e modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:26:01,552 INFO instruction: === Avvio Instruction SFT ===
2025-08-10 22:26:01,552 INFO instruction: Repo di destinazione: raniero/test_instr
2025-08-10 22:26:01,552 INFO instruction: Output dir: /app/instruction_output
2025-08-10 22:26:01,552 INFO instruction: Modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:26:01,552 INFO instruction: Dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:26:01,553 INFO instruction: Carico tokenizer e modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:26:04,373 INFO instruction: Carico dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:26:04,373 INFO instruction: Rilevato file locale .jsonl: uso datasets.load_dataset('json', data_files=...)
2025-08-10 22:26:23,681 INFO instruction: === Avvio Instruction SFT ===
2025-08-10 22:26:23,681 INFO instruction: Repo di destinazione: raniero/test_instr
2025-08-10 22:26:23,681 INFO instruction: Output dir: /app/instruction_output
2025-08-10 22:26:23,681 INFO instruction: Modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:26:23,681 INFO instruction: Dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:26:23,682 INFO instruction: Carico tokenizer e modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:26:26,122 INFO instruction: Carico dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:26:26,122 INFO instruction: Rilevato file locale .jsonl: uso datasets.load_dataset('json', data_files=...)
2025-08-10 22:26:45,289 INFO instruction: === Avvio Instruction SFT ===
2025-08-10 22:26:45,289 INFO instruction: Repo di destinazione: raniero/test_instr
2025-08-10 22:26:45,289 INFO instruction: Output dir: /app/instruction_output
2025-08-10 22:26:45,289 INFO instruction: Modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:26:45,289 INFO instruction: Dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:26:45,290 INFO instruction: Carico tokenizer e modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:26:47,722 INFO instruction: Carico dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:26:47,722 INFO instruction: Rilevato file locale .jsonl: uso datasets.load_dataset('json', data_files=...)
2025-08-10 22:27:07,798 INFO instruction: === Avvio Instruction SFT ===
2025-08-10 22:27:07,798 INFO instruction: Repo di destinazione: raniero/test_instr
2025-08-10 22:27:07,798 INFO instruction: Output dir: /app/instruction_output
2025-08-10 22:27:07,798 INFO instruction: Modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:27:07,798 INFO instruction: Dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:27:07,799 INFO instruction: Carico tokenizer e modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:29:09,494 INFO instruction: === Avvio Instruction SFT ===
2025-08-10 22:29:09,494 INFO instruction: Repo di destinazione: raniero/test_instr
2025-08-10 22:29:09,494 INFO instruction: Output dir: /app/instruction_output
2025-08-10 22:29:09,494 INFO instruction: Modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:29:09,494 INFO instruction: Dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:29:09,494 INFO instruction: Carico tokenizer e modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:29:12,006 INFO instruction: Carico dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:29:12,006 INFO instruction: Rilevato file locale .jsonl: uso datasets.load_dataset('json', data_files=...)
2025-08-10 22:29:31,739 INFO instruction: === Avvio Instruction SFT ===
2025-08-10 22:29:31,739 INFO instruction: Repo di destinazione: raniero/test_instr
2025-08-10 22:29:31,739 INFO instruction: Output dir: /app/instruction_output
2025-08-10 22:29:31,739 INFO instruction: Modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:29:31,739 INFO instruction: Dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:29:31,739 INFO instruction: Carico tokenizer e modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:41:07,807 INFO instruction: === Avvio Instruction SFT ===
2025-08-10 22:41:07,807 INFO instruction: Repo di destinazione: raniero/test_instr
2025-08-10 22:41:07,807 INFO instruction: Output dir: /app/instruction_output
2025-08-10 22:41:07,807 INFO instruction: Modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:41:07,807 INFO instruction: Dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:41:07,808 INFO instruction: Carico tokenizer e modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:41:10,222 INFO instruction: Carico dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:41:10,223 INFO instruction: Rilevato file locale .jsonl: uso datasets.load_dataset('json', data_files=..., keep_in_memory=True)
2025-08-10 22:41:10,642 WARNING instruction: Loader HF JSON fallito: Loading a dataset cached in a LocalFileSystem is not supported.
2025-08-10 22:41:10,642 WARNING instruction: Fallback: carico .jsonl manualmente in memoria (senza cache HF).
2025-08-10 22:41:10,644 INFO instruction: Costruisco prompt…
2025-08-10 22:41:10,646 INFO instruction: Tokenizzo…
2025-08-10 22:44:02,101 INFO instruction: === Avvio Instruction SFT ===
2025-08-10 22:44:02,101 INFO instruction: Repo di destinazione: raniero/test_instr
2025-08-10 22:44:02,101 INFO instruction: Output dir: /app/instruction_output
2025-08-10 22:44:02,101 INFO instruction: Modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:44:02,101 INFO instruction: Dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:44:02,102 INFO instruction: Carico tokenizer e modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:44:04,628 INFO instruction: Carico dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:44:04,628 INFO instruction: Rilevato file locale .jsonl: uso datasets.load_dataset('json', data_files=..., keep_in_memory=True)
2025-08-10 22:44:05,051 WARNING instruction: Loader HF JSON fallito: Loading a dataset cached in a LocalFileSystem is not supported.
2025-08-10 22:44:05,051 WARNING instruction: Fallback: carico .jsonl manualmente in memoria (senza cache HF).
2025-08-10 22:44:05,056 INFO instruction: Costruisco prompt…
2025-08-10 22:44:05,060 INFO instruction: Tokenizzo…
2025-08-10 22:44:05,075 INFO instruction: Inizio training SFT (Instruction)…
2025-08-10 22:45:01,392 INFO instruction: Metriche: {'train_runtime': 56.218, 'train_samples_per_second': 0.018, 'train_steps_per_second': 0.018, 'total_flos': 298647724032.0, 'train_loss': 4.8641180992126465, 'epoch': 1.0}
2025-08-10 22:45:01,399 INFO instruction: Salvo modello e tokenizer…
2025-08-10 22:45:38,256 INFO instruction: Preparazione upload su Hugging Face…
2025-08-10 22:45:38,259 INFO instruction: HF token salvato nel keyring locale.
2025-08-10 22:46:09,780 INFO instruction: === Avvio Instruction SFT ===
2025-08-10 22:46:09,780 INFO instruction: Repo di destinazione: raniero/test_instr
2025-08-10 22:46:09,780 INFO instruction: Output dir: /app/instruction_output
2025-08-10 22:46:09,780 INFO instruction: Modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:46:09,780 INFO instruction: Dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:46:09,782 INFO instruction: Carico tokenizer e modello base: mistralai/Mistral-7B-Instruct-v0.2
2025-08-10 22:46:13,496 INFO instruction: Carico dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:46:13,496 INFO instruction: Rilevato file locale .jsonl: uso datasets.load_dataset('json', data_files=..., keep_in_memory=True)
2025-08-10 22:46:13,941 WARNING instruction: Loader HF JSON fallito: Loading a dataset cached in a LocalFileSystem is not supported.
2025-08-10 22:46:13,941 WARNING instruction: Fallback: carico .jsonl manualmente in memoria (senza cache HF).
2025-08-10 22:46:13,944 INFO instruction: Costruisco prompt…
2025-08-10 22:46:13,948 INFO instruction: Tokenizzo…
2025-08-10 22:46:13,970 INFO instruction: Inizio training SFT (Instruction)…
2025-08-10 22:47:10,463 INFO instruction: Metriche: {'train_runtime': 56.4102, 'train_samples_per_second': 0.053, 'train_steps_per_second': 0.035, 'total_flos': 5332995072000.0, 'train_loss': 3.2410759925842285, 'epoch': 1.0}
2025-08-10 22:47:10,468 INFO instruction: Salvo modello e tokenizer…
2025-08-10 22:47:46,027 INFO instruction: Preparazione upload su Hugging Face…
2025-08-10 22:47:46,030 INFO instruction: HF token salvato nel keyring locale.
2025-08-10 22:49:34,017 INFO instruction: === Avvio Instruction SFT ===
2025-08-10 22:49:34,017 INFO instruction: Repo di destinazione: raniero/test_instr
2025-08-10 22:49:34,017 INFO instruction: Output dir: /app/instruction_output
2025-08-10 22:49:34,017 INFO instruction: Modello base: distilgpt2
2025-08-10 22:49:34,017 INFO instruction: Dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:49:34,018 INFO instruction: Carico tokenizer e modello base: distilgpt2
2025-08-10 22:49:56,098 INFO instruction: Carico dataset: /home/raniero/test_instr.jsonl
2025-08-10 22:49:56,098 INFO instruction: Rilevato file locale .jsonl: uso datasets.load_dataset('json', data_files=..., keep_in_memory=True)
2025-08-10 22:49:56,526 WARNING instruction: Loader HF JSON fallito: Loading a dataset cached in a LocalFileSystem is not supported.
2025-08-10 22:49:56,526 WARNING instruction: Fallback: carico .jsonl manualmente in memoria (senza cache HF).
2025-08-10 22:49:56,593 INFO instruction: Costruisco prompt…
2025-08-10 22:49:56,598 INFO instruction: Tokenizzo…
2025-08-10 22:49:56,616 INFO instruction: Inizio training SFT (Instruction)…
2025-08-10 22:49:57,207 INFO instruction: Metriche: {'train_runtime': 0.531, 'train_samples_per_second': 5.649, 'train_steps_per_second': 3.766, 'total_flos': 32151748608.0, 'train_loss': 6.437064170837402, 'epoch': 1.0}
2025-08-10 22:49:57,207 INFO instruction: Salvo modello e tokenizer…
2025-08-10 22:49:59,041 INFO instruction: Preparazione upload su Hugging Face…
2025-08-10 22:49:59,041 INFO instruction: HF token salvato nel keyring locale.