[2025-09-29 16:16:56,308] [INFO] [axolotl.utils.data.sft._load_raw_datasets:320] [PID:23243] Loading raw datasets... [2025-09-29 16:16:56,541] [INFO] [axolotl.utils.data.wrappers.get_dataset_wrapper:87] [PID:23243] Loading dataset: /workspace/outputs/training_data/ with base_type: chat_template and prompt_style: None Dropping Long Sequences (>1024) (num_proc=192): 0%| | 0/1918 [00:001024) (num_proc=192): 1%|▏ | 10/1918 [00:02<08:04, 3.94 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 4%|▉ | 70/1918 [00:02<00:52, 35.45 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 8%|█▊ | 150/1918 [00:02<00:20, 85.95 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 10%|██▎ | 200/1918 [00:02<00:14, 122.09 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 14%|██▉ | 260/1918 [00:03<00:09, 174.73 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 17%|███▋ | 320/1918 [00:03<00:06, 231.88 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 20%|████▎ | 380/1918 [00:03<00:05, 290.99 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 23%|█████▏ | 450/1918 [00:03<00:04, 357.52 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 27%|█████▉ | 520/1918 [00:03<00:03, 417.39 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 30%|██████▋ | 580/1918 [00:03<00:02, 446.62 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 34%|███████▌ | 660/1918 [00:03<00:02, 515.42 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 38%|████████▎ | 730/1918 [00:03<00:02, 556.77 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 42%|█████████▏ | 800/1918 [00:03<00:01, 570.35 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 46%|██████████ | 880/1918 [00:04<00:01, 614.25 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 83%|████████████████▌ | 1590/1918 [00:04<00:00, 2349.82 examples/s] Dropping Long Sequences (>1024) (num_proc=192): 100%|█████████████████████| 1918/1918 [00:04<00:00, 410.73 examples/s] Drop Samples with Zero Trainable Tokens (num_proc=192): 0%| | 0/1918 [00:00