TounsiLM-8b / sft_train.log
alabenayed's picture
Upload folder using huggingface_hub
4d65edc verified
2026-05-07 13:27:47,763 | INFO | Starting SFT fine-tuning job
2026-05-07 13:27:47,763 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-07 13:27:47,763 | INFO | Base model: CohereLabs/aya-expanse-8b
2026-05-07 13:27:47,763 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-07 13:27:47,763 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-07 13:27:47,763 | INFO | Torch version: 2.11.0+cu130
2026-05-07 13:27:47,804 | INFO | CUDA available: True
2026-05-07 13:27:47,825 | INFO | GPU: NVIDIA GB10
2026-05-07 13:27:47,826 | INFO | No Hugging Face token provided; skipping login
2026-05-07 13:27:47,827 | INFO | trl version 1.3.0: assistant_only_loss is fully supported
2026-05-07 13:27:47,827 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100
2026-05-07 13:27:47,827 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b
2026-05-07 13:27:49,034 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization)
2026-05-07 13:30:30,504 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-07 13:30:34,280 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-07 13:30:40,321 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-07 13:30:40,717 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-07 13:30:40,818 | INFO | Train rows: 25340
2026-05-07 13:30:40,819 | INFO | Eval rows: 6336
2026-05-07 13:30:40,819 | INFO | Sample train example: [{'role': 'system', 'content': 'ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.'}, {'role': 'user', 'content': 'ู†ุญุณ ุจูˆุฌูŠุนุฉ ููŠ ุงู„ุนูŠู†ูŠู† ู…ู† ูƒุซุฑุฉ ุบุณู„ุงู† ุงู„ู…ุงุนูˆู† ุจุงู„ู…ุงุก ุงู„ุณุฎูˆู†ุŸ'}, {'role': 'assistant', 'content': 'ุงู„ุจุฎุงุฑ ุงู„ุตุงุนุฏ ู…ู† ุงู„ู…ุงุก ุงู„ุณุฎูˆู† ูŠู†ุฌู… ูŠุชุนุจ ุงู„ุนูŠู† ูˆูŠุณุจุจ ุงุญู…ุฑุงุฑุŒ ู‡ุฐุงูƒุง ุนู„ุงุด ุญุงูˆู„ ุชุณุชุนู…ู„ ู…ุงุก ุฏุงููŠ ู…ุด ุณุฎูˆู† ุจุฑุดุฉ. ุงุบุณู„ ูˆุฌู‡ูƒ ุจู…ุงุก ุจุงุฑุฏ ุจุนุฏ ู…ุง ุชูƒู…ู„ ุจุงุด ุชุจุฑุฏ ุนูŠู†ูŠูƒ.'}]
2026-05-07 13:30:40,819 | INFO | No checkpoint found in /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-07 13:34:03,570 | INFO | Starting SFT fine-tuning job
2026-05-07 13:34:03,570 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-07 13:34:03,570 | INFO | Base model: CohereLabs/aya-expanse-8b
2026-05-07 13:34:03,570 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-07 13:34:03,570 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-07 13:34:03,570 | INFO | Torch version: 2.11.0+cu130
2026-05-07 13:34:03,612 | INFO | CUDA available: True
2026-05-07 13:34:03,631 | INFO | GPU: NVIDIA GB10
2026-05-07 13:34:03,633 | INFO | No Hugging Face token provided; skipping login
2026-05-07 13:34:03,633 | INFO | trl version 1.3.0: assistant_only_loss is fully supported
2026-05-07 13:34:03,633 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100
2026-05-07 13:34:03,633 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b
2026-05-07 13:34:04,862 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization)
2026-05-07 13:36:22,953 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-07 13:36:26,760 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-07 13:36:28,674 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-07 13:36:28,675 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-07 13:36:28,676 | INFO | Train rows: 25340
2026-05-07 13:36:28,676 | INFO | Eval rows: 6336
2026-05-07 13:36:28,677 | INFO | Sample train example: [{'role': 'system', 'content': 'ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.'}, {'role': 'user', 'content': 'ู†ุญุณ ุจูˆุฌูŠุนุฉ ููŠ ุงู„ุนูŠู†ูŠู† ู…ู† ูƒุซุฑุฉ ุบุณู„ุงู† ุงู„ู…ุงุนูˆู† ุจุงู„ู…ุงุก ุงู„ุณุฎูˆู†ุŸ'}, {'role': 'assistant', 'content': 'ุงู„ุจุฎุงุฑ ุงู„ุตุงุนุฏ ู…ู† ุงู„ู…ุงุก ุงู„ุณุฎูˆู† ูŠู†ุฌู… ูŠุชุนุจ ุงู„ุนูŠู† ูˆูŠุณุจุจ ุงุญู…ุฑุงุฑุŒ ู‡ุฐุงูƒุง ุนู„ุงุด ุญุงูˆู„ ุชุณุชุนู…ู„ ู…ุงุก ุฏุงููŠ ู…ุด ุณุฎูˆู† ุจุฑุดุฉ. ุงุบุณู„ ูˆุฌู‡ูƒ ุจู…ุงุก ุจุงุฑุฏ ุจุนุฏ ู…ุง ุชูƒู…ู„ ุจุงุด ุชุจุฑุฏ ุนูŠู†ูŠูƒ.'}]
2026-05-07 13:36:28,677 | INFO | No checkpoint found in /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-07 13:41:24,497 | INFO | Starting SFT fine-tuning job
2026-05-07 13:41:24,497 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-07 13:41:24,497 | INFO | Base model: CohereLabs/aya-expanse-8b
2026-05-07 13:41:24,497 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-07 13:41:24,497 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-07 13:41:24,497 | INFO | Torch version: 2.11.0+cu130
2026-05-07 13:41:24,548 | INFO | CUDA available: True
2026-05-07 13:41:24,576 | INFO | GPU: NVIDIA GB10
2026-05-07 13:41:24,577 | INFO | No Hugging Face token provided; skipping login
2026-05-07 13:41:24,578 | INFO | trl version 1.3.0: assistant_only_loss is fully supported
2026-05-07 13:41:24,578 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100
2026-05-07 13:41:24,578 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b
2026-05-07 13:41:25,691 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization)
2026-05-07 13:44:02,778 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-07 13:44:06,613 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-07 13:44:08,694 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-07 13:44:08,697 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-07 13:44:08,698 | INFO | Train rows: 25340
2026-05-07 13:44:08,699 | INFO | Eval rows: 6336
2026-05-07 13:44:08,699 | INFO | Sample train example: [{'role': 'system', 'content': 'ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.'}, {'role': 'user', 'content': 'ู†ุญุณ ุจูˆุฌูŠุนุฉ ููŠ ุงู„ุนูŠู†ูŠู† ู…ู† ูƒุซุฑุฉ ุบุณู„ุงู† ุงู„ู…ุงุนูˆู† ุจุงู„ู…ุงุก ุงู„ุณุฎูˆู†ุŸ'}, {'role': 'assistant', 'content': 'ุงู„ุจุฎุงุฑ ุงู„ุตุงุนุฏ ู…ู† ุงู„ู…ุงุก ุงู„ุณุฎูˆู† ูŠู†ุฌู… ูŠุชุนุจ ุงู„ุนูŠู† ูˆูŠุณุจุจ ุงุญู…ุฑุงุฑุŒ ู‡ุฐุงูƒุง ุนู„ุงุด ุญุงูˆู„ ุชุณุชุนู…ู„ ู…ุงุก ุฏุงููŠ ู…ุด ุณุฎูˆู† ุจุฑุดุฉ. ุงุบุณู„ ูˆุฌู‡ูƒ ุจู…ุงุก ุจุงุฑุฏ ุจุนุฏ ู…ุง ุชูƒู…ู„ ุจุงุด ุชุจุฑุฏ ุนูŠู†ูŠูƒ.'}]
2026-05-07 13:44:08,699 | INFO | No checkpoint found in /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-07 13:44:08,752 | INFO | Training started
2026-05-07 13:46:02,478 | INFO | Starting SFT fine-tuning job
2026-05-07 13:46:02,479 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-07 13:46:02,479 | INFO | Base model: CohereLabs/aya-expanse-8b
2026-05-07 13:46:02,479 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-07 13:46:02,479 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-07 13:46:02,479 | INFO | Torch version: 2.11.0+cu130
2026-05-07 13:46:02,523 | INFO | CUDA available: True
2026-05-07 13:46:02,552 | INFO | GPU: NVIDIA GB10
2026-05-07 13:46:02,554 | INFO | No Hugging Face token provided; skipping login
2026-05-07 13:46:02,555 | INFO | trl version 1.3.0: assistant_only_loss is fully supported
2026-05-07 13:46:02,555 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100
2026-05-07 13:46:02,555 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b
2026-05-07 13:46:03,698 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization)
2026-05-07 13:48:19,475 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-07 13:48:23,292 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-07 13:48:25,199 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-07 13:48:25,202 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-07 13:48:25,203 | INFO | Train rows: 25340
2026-05-07 13:48:25,203 | INFO | Eval rows: 6336
2026-05-07 13:48:25,204 | INFO | Sample train example: [{'role': 'system', 'content': 'ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.'}, {'role': 'user', 'content': 'ู†ุญุณ ุจูˆุฌูŠุนุฉ ููŠ ุงู„ุนูŠู†ูŠู† ู…ู† ูƒุซุฑุฉ ุบุณู„ุงู† ุงู„ู…ุงุนูˆู† ุจุงู„ู…ุงุก ุงู„ุณุฎูˆู†ุŸ'}, {'role': 'assistant', 'content': 'ุงู„ุจุฎุงุฑ ุงู„ุตุงุนุฏ ู…ู† ุงู„ู…ุงุก ุงู„ุณุฎูˆู† ูŠู†ุฌู… ูŠุชุนุจ ุงู„ุนูŠู† ูˆูŠุณุจุจ ุงุญู…ุฑุงุฑุŒ ู‡ุฐุงูƒุง ุนู„ุงุด ุญุงูˆู„ ุชุณุชุนู…ู„ ู…ุงุก ุฏุงููŠ ู…ุด ุณุฎูˆู† ุจุฑุดุฉ. ุงุบุณู„ ูˆุฌู‡ูƒ ุจู…ุงุก ุจุงุฑุฏ ุจุนุฏ ู…ุง ุชูƒู…ู„ ุจุงุด ุชุจุฑุฏ ุนูŠู†ูŠูƒ.'}]
2026-05-07 13:48:25,204 | INFO | No checkpoint found in /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-07 13:48:25,244 | INFO | Training started
2026-05-07 13:55:23,761 | INFO | Starting SFT fine-tuning job
2026-05-07 13:55:23,761 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-07 13:55:23,761 | INFO | Base model: CohereLabs/aya-expanse-8b
2026-05-07 13:55:23,761 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-07 13:55:23,761 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-07 13:55:23,761 | INFO | Torch version: 2.11.0+cu130
2026-05-07 13:55:23,803 | INFO | CUDA available: True
2026-05-07 13:55:23,826 | INFO | GPU: NVIDIA GB10
2026-05-07 13:55:23,828 | INFO | No Hugging Face token provided; skipping login
2026-05-07 13:55:23,829 | INFO | trl version 1.3.0: assistant_only_loss is fully supported
2026-05-07 13:55:23,829 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100
2026-05-07 13:55:23,829 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b
2026-05-07 13:55:24,929 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization)
2026-05-07 13:58:01,693 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-07 13:58:05,420 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-07 13:58:07,816 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-07 13:58:08,954 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-07 13:58:09,429 | INFO | Train rows: 25340
2026-05-07 13:58:09,430 | INFO | Eval rows: 6336
2026-05-07 13:58:09,430 | INFO | Sample train example text: <BOS_TOKEN><|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>ู†ุญุณ ุจูˆุฌูŠุนุฉ ููŠ ุงู„ุนูŠู†ูŠู† ู…ู† ูƒุซุฑุฉ ุบุณู„ุงู† ุงู„ู…ุงุนูˆู† ุจุงู„ู…ุงุก ุงู„ุณุฎูˆู†ุŸ<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>ุงู„ุจุฎุงุฑ ุงู„ุตุงุนุฏ ู…ู† ุงู„ู…ุงุก ุงู„ุณุฎูˆู† ูŠู†ุฌู… ูŠุชุนุจ ุงู„ุนูŠู† ูˆูŠุณุจุจ ุงุญู…ุฑุงุฑุŒ ู‡ุฐุงูƒุง ุนู„ุงุด ุญุงูˆู„ ุชุณุชุนู…ู„ ู…ุงุก ุฏุงููŠ ู…ุด ุณุฎูˆู† ุจุฑุดุฉ. ุงุบุณู„ ูˆุฌู‡ูƒ ุจู…ุงุก ุจุงุฑุฏ ุจุนุฏ ู…ุง ุชูƒู…ู„ ุจุงุด ุชุจุฑุฏ ุนูŠู†ูŠูƒ.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>
2026-05-07 13:58:09,430 | INFO | No checkpoint found in /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-07 13:59:25,638 | INFO | Starting SFT fine-tuning job
2026-05-07 13:59:25,639 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-07 13:59:25,639 | INFO | Base model: CohereLabs/aya-expanse-8b
2026-05-07 13:59:25,639 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-07 13:59:25,639 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-07 13:59:25,639 | INFO | Torch version: 2.11.0+cu130
2026-05-07 13:59:25,683 | INFO | CUDA available: True
2026-05-07 13:59:25,709 | INFO | GPU: NVIDIA GB10
2026-05-07 13:59:25,710 | INFO | No Hugging Face token provided; skipping login
2026-05-07 13:59:25,711 | INFO | trl version 1.3.0: assistant_only_loss is fully supported
2026-05-07 13:59:25,711 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100
2026-05-07 13:59:25,711 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b
2026-05-07 13:59:26,819 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization)
2026-05-07 14:02:05,419 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-07 14:02:09,265 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-07 14:02:11,637 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-07 14:02:11,918 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-07 14:02:12,159 | INFO | Train rows: 25340
2026-05-07 14:02:12,160 | INFO | Eval rows: 6336
2026-05-07 14:02:12,160 | INFO | Sample train example text: <BOS_TOKEN><|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>ู†ุญุณ ุจูˆุฌูŠุนุฉ ููŠ ุงู„ุนูŠู†ูŠู† ู…ู† ูƒุซุฑุฉ ุบุณู„ุงู† ุงู„ู…ุงุนูˆู† ุจุงู„ู…ุงุก ุงู„ุณุฎูˆู†ุŸ<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>ุงู„ุจุฎุงุฑ ุงู„ุตุงุนุฏ ู…ู† ุงู„ู…ุงุก ุงู„ุณุฎูˆู† ูŠู†ุฌู… ูŠุชุนุจ ุงู„ุนูŠู† ูˆูŠุณุจุจ ุงุญู…ุฑุงุฑุŒ ู‡ุฐุงูƒุง ุนู„ุงุด ุญุงูˆู„ ุชุณุชุนู…ู„ ู…ุงุก ุฏุงููŠ ู…ุด ุณุฎูˆู† ุจุฑุดุฉ. ุงุบุณู„ ูˆุฌู‡ูƒ ุจู…ุงุก ุจุงุฑุฏ ุจุนุฏ ู…ุง ุชูƒู…ู„ ุจุงุด ุชุจุฑุฏ ุนูŠู†ูŠูƒ.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>
2026-05-07 14:02:12,161 | INFO | No checkpoint found in /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-07 14:02:23,459 | INFO | Training started
2026-05-07 14:08:07,243 | INFO | Starting SFT fine-tuning job
2026-05-07 14:08:07,244 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-07 14:08:07,244 | INFO | Base model: CohereLabs/aya-expanse-8b
2026-05-07 14:08:07,244 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-07 14:08:07,244 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-07 14:08:07,244 | INFO | Torch version: 2.11.0+cu130
2026-05-07 14:08:07,286 | INFO | CUDA available: True
2026-05-07 14:08:07,309 | INFO | GPU: NVIDIA GB10
2026-05-07 14:08:07,499 | INFO | Hugging Face login succeeded
2026-05-07 14:08:07,500 | INFO | trl version 1.3.0: assistant_only_loss is fully supported
2026-05-07 14:08:07,500 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100
2026-05-07 14:08:07,500 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b
2026-05-07 14:08:08,618 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization)
2026-05-07 14:10:19,279 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-07 14:10:23,076 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-07 14:10:25,353 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-07 14:10:25,621 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-07 14:10:25,861 | INFO | Train rows: 25340
2026-05-07 14:10:25,862 | INFO | Eval rows: 6336
2026-05-07 14:10:25,862 | INFO | Sample train example text: <BOS_TOKEN><|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>ู†ุญุณ ุจูˆุฌูŠุนุฉ ููŠ ุงู„ุนูŠู†ูŠู† ู…ู† ูƒุซุฑุฉ ุบุณู„ุงู† ุงู„ู…ุงุนูˆู† ุจุงู„ู…ุงุก ุงู„ุณุฎูˆู†ุŸ<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>ุงู„ุจุฎุงุฑ ุงู„ุตุงุนุฏ ู…ู† ุงู„ู…ุงุก ุงู„ุณุฎูˆู† ูŠู†ุฌู… ูŠุชุนุจ ุงู„ุนูŠู† ูˆูŠุณุจุจ ุงุญู…ุฑุงุฑุŒ ู‡ุฐุงูƒุง ุนู„ุงุด ุญุงูˆู„ ุชุณุชุนู…ู„ ู…ุงุก ุฏุงููŠ ู…ุด ุณุฎูˆู† ุจุฑุดุฉ. ุงุบุณู„ ูˆุฌู‡ูƒ ุจู…ุงุก ุจุงุฑุฏ ุจุนุฏ ู…ุง ุชูƒู…ู„ ุจุงุด ุชุจุฑุฏ ุนูŠู†ูŠูƒ.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>
2026-05-07 14:10:25,862 | INFO | No checkpoint found in /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-07 14:10:29,187 | INFO | Training started
2026-05-08 04:09:43,113 | INFO | Training finished
2026-05-08 04:09:43,113 | INFO | Saving model and tokenizer to /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-08 04:09:45,243 | INFO | Saved training metrics to /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft/training_metrics.json
2026-05-08 04:09:45,243 | INFO | Running preview generation on a Tunisian prompt
2026-05-08 04:09:48,487 | INFO | Preview prompt: ุนุณู„ุงู…ุฉุŒ ุดู†ูˆุฉ ุชู†ุตุญู†ูŠ ู†ุนู…ู„ ูƒูŠ ู†ูƒูˆู† ุชุนุจุงู† ูˆุจุฑุดุฉุŸ
2026-05-08 04:09:48,487 | INFO | Preview output: <|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.<|START_OF_TURN_TOKEN|><|USER_TOKEN|>ุนุณู„ุงู…ุฉุŒ ุดู†ูˆุฉ ุชู†ุตุญู†ูŠ ู†ุนู…ู„ ูƒูŠ ู†ูƒูˆู† ุชุนุจุงู† ูˆุจุฑุดุฉุŸ<|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>ุญ ููŠ ููŠ ููŠ ููŠ ููŠ ููŠ ููŠ ููŠ ููŠ ููŠ<|START_OF_TURN_TOKEN|>
2026-05-22 00:54:42,946 | INFO | Starting SFT fine-tuning job
2026-05-22 00:54:42,946 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-22 00:54:42,947 | INFO | Base model: CohereLabs/aya-expanse-8b
2026-05-22 00:54:42,947 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-22 00:54:42,947 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-22 00:54:42,947 | INFO | Torch version: 2.11.0+cu130
2026-05-22 00:54:43,079 | INFO | CUDA available: True
2026-05-22 00:54:43,096 | INFO | GPU: NVIDIA GB10
2026-05-22 00:55:42,965 | INFO | Starting SFT fine-tuning job
2026-05-22 00:55:42,965 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-22 00:55:42,965 | INFO | Base model: CohereLabs/aya-expanse-8b
2026-05-22 00:55:42,965 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-22 00:55:42,965 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-22 00:55:42,965 | INFO | Torch version: 2.11.0+cu130
2026-05-22 00:55:43,096 | INFO | CUDA available: True
2026-05-22 00:55:43,113 | INFO | GPU: NVIDIA GB10
2026-05-22 00:55:48,743 | INFO | Starting SFT fine-tuning job
2026-05-22 00:55:48,743 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-22 00:55:48,743 | INFO | Base model: CohereLabs/aya-expanse-8b
2026-05-22 00:55:48,743 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-22 00:55:48,743 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-22 00:55:48,743 | INFO | Torch version: 2.11.0+cu130
2026-05-22 00:55:48,878 | INFO | CUDA available: True
2026-05-22 00:55:48,895 | INFO | GPU: NVIDIA GB10
2026-05-22 00:55:49,029 | INFO | Hugging Face login succeeded
2026-05-22 00:55:49,029 | INFO | trl version 1.3.0: assistant_only_loss is fully supported
2026-05-22 00:55:49,029 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100
2026-05-22 00:55:49,029 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b
2026-05-22 00:55:49,803 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization)
2026-05-22 00:56:57,725 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-22 00:56:59,883 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-22 00:57:06,291 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-22 00:57:07,271 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-22 00:57:07,625 | INFO | Train rows: 25335
2026-05-22 00:57:07,626 | INFO | Eval rows: 6334
2026-05-22 00:57:07,626 | INFO | Sample train example text: <BOS_TOKEN><|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>ุงู„ู€ ุณูŠุฑุชูŠููŠูƒุงุณูŠูˆู† ููŠ ุงู„ู€ ู„ุบุงุช ู…ู‡ู…ุฉ ู„ู€ ุฎุฏู…ุฉ ููŠ ุชูˆู†ุณุŸ<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>ู…ู‡ู…ุฉ ุจุฑุดุฉ ู„ู„ู€ ุดุฑูƒุงุช ุงู„ุฃุฌู†ุจูŠุฉ ูˆุงู„ู€ 'ุงูˆู-ุดูˆุฑ'. ุงู„ู€ (DELF) ูˆุงู„ู€ (TOEIC) ู‡ูˆู…ุง ุงู„ู€ ู…ุนูŠุงุฑ ุงู„ู„ูŠ ูŠู‚ูŠู…ูˆุง ุจูŠู‡ ู…ุณุชูˆุงูƒ ุงู„ุญู‚ูŠู‚ูŠ.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>
2026-05-22 00:57:07,626 | INFO | Detected checkpoint: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft/checkpoint-1584
2026-05-22 00:57:16,510 | INFO | Training started
2026-05-22 00:57:17,609 | INFO | Training finished
2026-05-22 00:57:17,609 | INFO | Saving model and tokenizer to /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-22 00:57:18,156 | INFO | Saved training metrics to /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft/training_metrics.json
2026-05-22 00:57:18,156 | INFO | Running preview generation on a Tunisian prompt
2026-05-22 00:57:19,396 | INFO | Preview prompt: ุนุณู„ุงู…ุฉุŒ ุดู†ูˆุฉ ุชู†ุตุญู†ูŠ ู†ุนู…ู„ ูƒูŠ ู†ูƒูˆู† ุชุนุจุงู† ูˆุจุฑุดุฉุŸ
2026-05-22 00:57:19,396 | INFO | Preview output: <|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.<|START_OF_TURN_TOKEN|><|USER_TOKEN|>ุนุณู„ุงู…ุฉุŒ ุดู†ูˆุฉ ุชู†ุตุญู†ูŠ ู†ุนู…ู„ ูƒูŠ ู†ูƒูˆู† ุชุนุจุงู† ูˆุจุฑุดุฉุŸ<|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>ุญ ููŠ ููŠ ููŠ ููŠ<|START_OF_TURN_TOKEN|>
2026-05-22 01:03:24,154 | INFO | Starting SFT fine-tuning job
2026-05-22 01:03:24,154 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-22 01:03:24,154 | INFO | Base model: CohereLabs/aya-expanse-8b
2026-05-22 01:03:24,154 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-22 01:03:24,154 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-22 01:03:24,154 | INFO | Torch version: 2.11.0+cu130
2026-05-22 01:03:24,287 | INFO | CUDA available: True
2026-05-22 01:03:24,302 | INFO | GPU: NVIDIA GB10
2026-05-22 01:03:24,436 | INFO | Hugging Face login succeeded
2026-05-22 01:03:24,437 | INFO | trl version 1.3.0: assistant_only_loss is fully supported
2026-05-22 01:03:24,437 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100
2026-05-22 01:03:24,437 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b
2026-05-22 01:03:25,235 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization)
2026-05-22 01:04:34,301 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-22 01:04:36,348 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-22 01:04:38,428 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-22 01:04:38,550 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-22 01:04:38,657 | INFO | Train rows: 25335
2026-05-22 01:04:38,658 | INFO | Eval rows: 6334
2026-05-22 01:04:38,658 | INFO | Sample train example text: <BOS_TOKEN><|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>ุงู„ู€ ุณูŠุฑุชูŠููŠูƒุงุณูŠูˆู† ููŠ ุงู„ู€ ู„ุบุงุช ู…ู‡ู…ุฉ ู„ู€ ุฎุฏู…ุฉ ููŠ ุชูˆู†ุณุŸ<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>ู…ู‡ู…ุฉ ุจุฑุดุฉ ู„ู„ู€ ุดุฑูƒุงุช ุงู„ุฃุฌู†ุจูŠุฉ ูˆุงู„ู€ 'ุงูˆู-ุดูˆุฑ'. ุงู„ู€ (DELF) ูˆุงู„ู€ (TOEIC) ู‡ูˆู…ุง ุงู„ู€ ู…ุนูŠุงุฑ ุงู„ู„ูŠ ูŠู‚ูŠู…ูˆุง ุจูŠู‡ ู…ุณุชูˆุงูƒ ุงู„ุญู‚ูŠู‚ูŠ.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>
2026-05-22 01:04:38,658 | INFO | force_restart enabled; checkpoint resume disabled
2026-05-22 01:04:40,699 | INFO | Training started
2026-05-22 12:34:27,241 | INFO | Starting SFT fine-tuning job
2026-05-22 12:34:27,242 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-22 12:34:27,242 | INFO | Base model: CohereLabs/aya-expanse-8b
2026-05-22 12:34:27,242 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-22 12:34:27,242 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-22 12:34:27,242 | INFO | Torch version: 2.11.0+cu130
2026-05-22 12:34:27,374 | INFO | CUDA available: True
2026-05-22 12:34:27,399 | INFO | GPU: NVIDIA GB10
2026-05-22 12:34:27,562 | INFO | Hugging Face login succeeded
2026-05-22 12:34:27,563 | INFO | trl version 1.3.0: assistant_only_loss is fully supported
2026-05-22 12:34:27,563 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100
2026-05-22 12:34:27,563 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b
2026-05-22 12:34:28,414 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization)
2026-05-22 12:35:55,107 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-22 12:35:57,709 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-22 12:35:59,906 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-22 12:36:00,063 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-22 12:36:00,200 | INFO | Train rows: 25335
2026-05-22 12:36:00,200 | INFO | Eval rows: 6334
2026-05-22 12:36:00,201 | INFO | Sample train example text: <BOS_TOKEN><|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>ุงู„ู€ ุณูŠุฑุชูŠููŠูƒุงุณูŠูˆู† ููŠ ุงู„ู€ ู„ุบุงุช ู…ู‡ู…ุฉ ู„ู€ ุฎุฏู…ุฉ ููŠ ุชูˆู†ุณุŸ<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>ู…ู‡ู…ุฉ ุจุฑุดุฉ ู„ู„ู€ ุดุฑูƒุงุช ุงู„ุฃุฌู†ุจูŠุฉ ูˆุงู„ู€ 'ุงูˆู-ุดูˆุฑ'. ุงู„ู€ (DELF) ูˆุงู„ู€ (TOEIC) ู‡ูˆู…ุง ุงู„ู€ ู…ุนูŠุงุฑ ุงู„ู„ูŠ ูŠู‚ูŠู…ูˆุง ุจูŠู‡ ู…ุณุชูˆุงูƒ ุงู„ุญู‚ูŠู‚ูŠ.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>
2026-05-22 12:36:00,201 | INFO | force_restart enabled; checkpoint resume disabled
2026-05-22 12:36:02,401 | INFO | Training started
2026-05-22 15:42:58,686 | INFO | Starting SFT fine-tuning job
2026-05-22 15:42:58,686 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-22 15:42:58,687 | INFO | Base model: CohereLabs/aya-expanse-8b
2026-05-22 15:42:58,687 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-22 15:42:58,687 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-22 15:42:58,687 | INFO | Torch version: 2.11.0+cu130
2026-05-22 15:42:58,821 | INFO | CUDA available: True
2026-05-22 15:42:58,846 | INFO | GPU: NVIDIA GB10
2026-05-22 15:42:59,224 | INFO | Hugging Face login succeeded
2026-05-22 15:42:59,224 | INFO | trl version 1.3.0: assistant_only_loss is fully supported
2026-05-22 15:42:59,224 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100
2026-05-22 15:42:59,224 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b
2026-05-22 15:43:00,115 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization)
2026-05-22 15:44:21,762 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-22 15:44:24,226 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-22 15:44:26,472 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-22 15:44:26,643 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-22 15:44:26,784 | INFO | Train rows: 25335
2026-05-22 15:44:26,784 | INFO | Eval rows: 6334
2026-05-22 15:44:26,786 | INFO | Sample train example text: <BOS_TOKEN><|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>ุงู„ู€ ุณูŠุฑุชูŠููŠูƒุงุณูŠูˆู† ููŠ ุงู„ู€ ู„ุบุงุช ู…ู‡ู…ุฉ ู„ู€ ุฎุฏู…ุฉ ููŠ ุชูˆู†ุณุŸ<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>ู…ู‡ู…ุฉ ุจุฑุดุฉ ู„ู„ู€ ุดุฑูƒุงุช ุงู„ุฃุฌู†ุจูŠุฉ ูˆุงู„ู€ 'ุงูˆู-ุดูˆุฑ'. ุงู„ู€ (DELF) ูˆุงู„ู€ (TOEIC) ู‡ูˆู…ุง ุงู„ู€ ู…ุนูŠุงุฑ ุงู„ู„ูŠ ูŠู‚ูŠู…ูˆุง ุจูŠู‡ ู…ุณุชูˆุงูƒ ุงู„ุญู‚ูŠู‚ูŠ.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>
2026-05-22 15:44:26,786 | INFO | force_restart enabled; checkpoint resume disabled
2026-05-22 15:44:29,343 | INFO | Training started
2026-05-22 15:44:50,452 | INFO | Starting SFT fine-tuning job
2026-05-22 15:44:50,452 | INFO | Output directory: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-22 15:44:50,452 | INFO | Base model: CohereLabs/aya-expanse-8b
2026-05-22 15:44:50,452 | INFO | CPT adapter: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-22 15:44:50,452 | INFO | Dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-22 15:44:50,452 | INFO | Torch version: 2.11.0+cu130
2026-05-22 15:44:50,586 | INFO | CUDA available: True
2026-05-22 15:44:50,602 | INFO | GPU: NVIDIA GB10
2026-05-22 15:44:50,763 | INFO | Hugging Face login succeeded
2026-05-22 15:44:50,764 | INFO | trl version 1.3.0: assistant_only_loss is fully supported
2026-05-22 15:44:50,764 | INFO | Loading configuration: batch_size=8, accumulation=4, warmup=0.03, weight_decay=0.0100
2026-05-22 15:44:50,764 | INFO | Loading tokenizer from base model: CohereLabs/aya-expanse-8b
2026-05-22 15:44:53,754 | INFO | Loading base model: CohereLabs/aya-expanse-8b with full bf16 precision (no quantization)
2026-05-22 15:46:13,999 | INFO | Loading CPT adapter from: /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-cpt-tunisian
2026-05-22 15:46:16,123 | INFO | Loading dataset: Syrinesmati/tunisian-question-response-dataset
2026-05-22 15:46:18,139 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-22 15:46:18,276 | INFO | Using dataset fields: question=instruction, answer=response
2026-05-22 15:46:18,398 | INFO | Train rows: 25335
2026-05-22 15:46:18,398 | INFO | Eval rows: 6334
2026-05-22 15:46:18,399 | INFO | Sample train example text: <BOS_TOKEN><|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|USER_TOKEN|>ุงู„ู€ ุณูŠุฑุชูŠููŠูƒุงุณูŠูˆู† ููŠ ุงู„ู€ ู„ุบุงุช ู…ู‡ู…ุฉ ู„ู€ ุฎุฏู…ุฉ ููŠ ุชูˆู†ุณุŸ<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>ู…ู‡ู…ุฉ ุจุฑุดุฉ ู„ู„ู€ ุดุฑูƒุงุช ุงู„ุฃุฌู†ุจูŠุฉ ูˆุงู„ู€ 'ุงูˆู-ุดูˆุฑ'. ุงู„ู€ (DELF) ูˆุงู„ู€ (TOEIC) ู‡ูˆู…ุง ุงู„ู€ ู…ุนูŠุงุฑ ุงู„ู„ูŠ ูŠู‚ูŠู…ูˆุง ุจูŠู‡ ู…ุณุชูˆุงูƒ ุงู„ุญู‚ูŠู‚ูŠ.<|END_OF_TURN_TOKEN|><|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>
2026-05-22 15:46:18,399 | INFO | force_restart enabled; checkpoint resume disabled
2026-05-22 15:46:20,378 | INFO | Training started
2026-05-22 22:06:47,054 | INFO | Training finished
2026-05-22 22:06:47,055 | INFO | Saving model and tokenizer to /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft
2026-05-22 22:06:49,072 | INFO | Saved training metrics to /home/ala/TunisianDialogSystem/outputs/checkpoints/aya-expanse-8b-tunisian-sft/training_metrics.json
2026-05-22 22:06:49,072 | INFO | Running preview generation on a Tunisian prompt
2026-05-22 22:06:50,169 | INFO | Preview prompt: ุนุณู„ุงู…ุฉุŒ ุดู†ูˆุฉ ุชู†ุตุญู†ูŠ ู†ุนู…ู„ ูƒูŠ ู†ูƒูˆู† ุชุนุจุงู† ูˆุจุฑุดุฉุŸ
2026-05-22 22:06:50,169 | INFO | Preview output: <|START_OF_TURN_TOKEN|><|SYSTEM_TOKEN|>ุฃู†ุช "ุงู„ุชูŠุฌุงู†ูŠ"ุŒ ู…ุณุงุนุฏ ุฐูƒุงุก ุงุตุทู†ุงุนูŠ ุชูˆู†ุณูŠ 100%. ุฌุงูˆุจ ุจุงู„ุชูˆู†ุณูŠ ุงู„ุฏุงุฑุฌ ูู‚ุทุŒ ูˆุจุงู„ุทูˆู„ ุงู„ู…ู†ุงุณุจ ู„ู„ุณุคุงู„: ูƒุงู† ูŠู„ุฒู… ู‚ุตู‘ุฑุŒ ูˆูƒุงู† ูŠู„ุฒู… ูุณู‘ุฑ ุฃูƒุซุฑ. ู…ู…ู†ูˆุน ุงู„ู‡ู„ูˆุณุฉ ุฃูˆ ุงู„ุฎุฑูˆุฌ ุนู„ู‰ ุงู„ู…ูˆุถูˆุน.<|START_OF_TURN_TOKEN|><|USER_TOKEN|>ุนุณู„ุงู…ุฉุŒ ุดู†ูˆุฉ ุชู†ุตุญู†ูŠ ู†ุนู…ู„ ูƒูŠ ู†ูƒูˆู† ุชุนุจุงู† ูˆุจุฑุดุฉุŸ<|START_OF_TURN_TOKEN|><|CHATBOT_TOKEN|>ุญ ููŠ ููŠ ููŠ<|START_OF_TURN_TOKEN|>