| 2026-06-11 06:29:54,615 | T4/P100 (Tesla T4 sm_75): Flash unavailable, using chunked attn |
| 2026-06-11 06:29:54,875 | HTTP Request: GET https://huggingface.co/api/whoami-v2 "HTTP/1.1 200 OK" |
| 2026-06-11 06:29:54,952 | HTTP Request: HEAD https://huggingface.co/GODELEV/TOK-4K/resolve/main/config.json "HTTP/1.1 404 Not Found" |
| 2026-06-11 06:29:55,021 | HTTP Request: HEAD https://huggingface.co/GODELEV/TOK-4K/resolve/main/config.json "HTTP/1.1 404 Not Found" |
| 2026-06-11 06:29:55,095 | HTTP Request: HEAD https://huggingface.co/GODELEV/TOK-4K/resolve/main/tokenizer_config.json "HTTP/1.1 307 Temporary Redirect" |
| 2026-06-11 06:29:55,111 | HTTP Request: HEAD https://huggingface.co/api/resolve-cache/models/GODELEV/TOK-4K/0a93937fbb072e0b839a0ae1902127e0d22b872f/tokenizer_config.json "HTTP/1.1 200 OK" |
| 2026-06-11 06:29:55,129 | HTTP Request: GET https://huggingface.co/api/resolve-cache/models/GODELEV/TOK-4K/0a93937fbb072e0b839a0ae1902127e0d22b872f/tokenizer_config.json "HTTP/1.1 200 OK" |
| 2026-06-11 06:29:55,213 | HTTP Request: GET https://huggingface.co/api/models/GODELEV/TOK-4K/tree/main/additional_chat_templates?recursive=false&expand=false "HTTP/1.1 404 Not Found" |
| 2026-06-11 06:29:55,276 | HTTP Request: GET https://huggingface.co/api/models/GODELEV/TOK-4K/tree/main?recursive=true&expand=false "HTTP/1.1 200 OK" |
| 2026-06-11 06:29:55,344 | HTTP Request: HEAD https://huggingface.co/GODELEV/TOK-4K/resolve/main/tokenizer.json "HTTP/1.1 307 Temporary Redirect" |
| 2026-06-11 06:29:55,361 | HTTP Request: HEAD https://huggingface.co/api/resolve-cache/models/GODELEV/TOK-4K/0a93937fbb072e0b839a0ae1902127e0d22b872f/tokenizer.json "HTTP/1.1 200 OK" |
| 2026-06-11 06:29:55,381 | HTTP Request: GET https://huggingface.co/api/resolve-cache/models/GODELEV/TOK-4K/0a93937fbb072e0b839a0ae1902127e0d22b872f/tokenizer.json "HTTP/1.1 200 OK" |
| 2026-06-11 06:29:55,461 | HTTP Request: HEAD https://huggingface.co/GODELEV/TOK-4K/resolve/main/tokenizer.model "HTTP/1.1 404 Not Found" |
| 2026-06-11 06:29:55,541 | HTTP Request: HEAD https://huggingface.co/GODELEV/TOK-4K/resolve/main/added_tokens.json "HTTP/1.1 404 Not Found" |
| 2026-06-11 06:29:55,610 | HTTP Request: HEAD https://huggingface.co/GODELEV/TOK-4K/resolve/main/special_tokens_map.json "HTTP/1.1 307 Temporary Redirect" |
| 2026-06-11 06:29:55,626 | HTTP Request: HEAD https://huggingface.co/api/resolve-cache/models/GODELEV/TOK-4K/0a93937fbb072e0b839a0ae1902127e0d22b872f/special_tokens_map.json "HTTP/1.1 200 OK" |
| 2026-06-11 06:29:55,644 | HTTP Request: GET https://huggingface.co/api/resolve-cache/models/GODELEV/TOK-4K/0a93937fbb072e0b839a0ae1902127e0d22b872f/special_tokens_map.json "HTTP/1.1 200 OK" |
| 2026-06-11 06:29:55,723 | HTTP Request: HEAD https://huggingface.co/GODELEV/TOK-4K/resolve/main/chat_template.jinja "HTTP/1.1 404 Not Found" |
| 2026-06-11 06:29:55,838 | HTTP Request: HEAD https://huggingface.co/datasets/GODELEV/Ant-5M-V2-T/resolve/main/README.md "HTTP/1.1 404 Not Found" |
| 2026-06-11 06:29:56,059 | HTTP Request: GET https://huggingface.co/api/datasets/GODELEV/Ant-5M-V2-T "HTTP/1.1 200 OK" |
| 2026-06-11 06:29:56,127 | HTTP Request: HEAD https://huggingface.co/datasets/GODELEV/Ant-5M-V2-T/resolve/bbdd76ffdd523212181b29971d8c6e834aa1c5ea/Ant-5M-V2-T.py "HTTP/1.1 404 Not Found" |
| 2026-06-11 06:29:56,250 | HTTP Request: HEAD https://s3.amazonaws.com/datasets.huggingface.co/datasets/datasets/GODELEV/Ant-5M-V2-T/GODELEV/Ant-5M-V2-T.py "HTTP/1.1 404 Not Found" |
| 2026-06-11 06:29:56,320 | HTTP Request: HEAD https://huggingface.co/datasets/GODELEV/Ant-5M-V2-T/resolve/bbdd76ffdd523212181b29971d8c6e834aa1c5ea/README.md "HTTP/1.1 404 Not Found" |
| 2026-06-11 06:29:56,387 | HTTP Request: GET https://huggingface.co/api/datasets/GODELEV/Ant-5M-V2-T/revision/bbdd76ffdd523212181b29971d8c6e834aa1c5ea "HTTP/1.1 200 OK" |
| 2026-06-11 06:29:56,452 | HTTP Request: HEAD https://huggingface.co/datasets/GODELEV/Ant-5M-V2-T/resolve/bbdd76ffdd523212181b29971d8c6e834aa1c5ea/.huggingface.yaml "HTTP/1.1 404 Not Found" |
| 2026-06-11 06:29:56,609 | HTTP Request: GET https://datasets-server.huggingface.co/info?dataset=GODELEV/Ant-5M-V2-T "HTTP/1.1 200 OK" |
| 2026-06-11 06:29:56,709 | HTTP Request: GET https://huggingface.co/api/datasets/GODELEV/Ant-5M-V2-T/tree/bbdd76ffdd523212181b29971d8c6e834aa1c5ea/data?recursive=true&expand=false "HTTP/1.1 200 OK" |
| 2026-06-11 06:29:56,787 | HTTP Request: GET https://huggingface.co/api/datasets/GODELEV/Ant-5M-V2-T/tree/bbdd76ffdd523212181b29971d8c6e834aa1c5ea?recursive=false&expand=false "HTTP/1.1 200 OK" |
| 2026-06-11 06:29:56,859 | HTTP Request: HEAD https://huggingface.co/datasets/GODELEV/Ant-5M-V2-T/resolve/bbdd76ffdd523212181b29971d8c6e834aa1c5ea/dataset_infos.json "HTTP/1.1 404 Not Found" |
| 2026-06-11 06:29:56,970 | HTTP Request: HEAD https://huggingface.co/datasets/GODELEV/Ant-5M-V2-T/resolve/bbdd76ffdd523212181b29971d8c6e834aa1c5ea/data/train-00000-of-00006.parquet "HTTP/1.1 302 Found" |
| 2026-06-11 06:29:57,076 | HTTP Request: GET https://huggingface.co/api/datasets/GODELEV/Ant-5M-V2-T/xet-read-token/bbdd76ffdd523212181b29971d8c6e834aa1c5ea "HTTP/1.1 200 OK" |
| 2026-06-11 06:30:00,130 | HTTP Request: HEAD https://huggingface.co/datasets/GODELEV/Ant-5M-V2-T/resolve/bbdd76ffdd523212181b29971d8c6e834aa1c5ea/data/train-00001-of-00006.parquet "HTTP/1.1 302 Found" |
| 2026-06-11 06:30:04,826 | HTTP Request: HEAD https://huggingface.co/datasets/GODELEV/Ant-5M-V2-T/resolve/bbdd76ffdd523212181b29971d8c6e834aa1c5ea/data/train-00002-of-00006.parquet "HTTP/1.1 302 Found" |
| 2026-06-11 06:30:07,920 | HTTP Request: HEAD https://huggingface.co/datasets/GODELEV/Ant-5M-V2-T/resolve/bbdd76ffdd523212181b29971d8c6e834aa1c5ea/data/train-00003-of-00006.parquet "HTTP/1.1 302 Found" |
| 2026-06-11 06:30:12,862 | HTTP Request: HEAD https://huggingface.co/datasets/GODELEV/Ant-5M-V2-T/resolve/bbdd76ffdd523212181b29971d8c6e834aa1c5ea/data/train-00004-of-00006.parquet "HTTP/1.1 302 Found" |
| 2026-06-11 06:30:15,761 | HTTP Request: HEAD https://huggingface.co/datasets/GODELEV/Ant-5M-V2-T/resolve/bbdd76ffdd523212181b29971d8c6e834aa1c5ea/data/train-00005-of-00006.parquet "HTTP/1.1 302 Found" |
| 2026-06-11 06:30:19,862 | HTTP Request: HEAD https://huggingface.co/datasets/GODELEV/Ant-5M-V2-T/resolve/bbdd76ffdd523212181b29971d8c6e834aa1c5ea/data/val-00000-of-00001.parquet "HTTP/1.1 302 Found" |
| 2026-06-11 06:31:27,680 | Data loaded train=2,910,746 val=20,000 steps_per_epoch=1,264 total_epochs=1.00 |
| 2026-06-11 06:31:31,643 | Model 9.902M params | dtype=float32 | amp=torch.float16 |
| 2026-06-11 06:31:31,877 | HTTP Request: HEAD https://huggingface.co/GODELEV/Experimenting/resolve/main/resume/latest_step.txt "HTTP/1.1 307 Temporary Redirect" |
| 2026-06-11 06:31:32,027 | HTTP Request: HEAD https://huggingface.co/api/resolve-cache/models/GODELEV/Experimenting/73dbdd890e76f238232c2d5fed8e4927a5b510f4/resume%2Flatest_step.txt "HTTP/1.1 200 OK" |
| 2026-06-11 06:31:32,131 | HTTP Request: GET https://huggingface.co/api/resolve-cache/models/GODELEV/Experimenting/73dbdd890e76f238232c2d5fed8e4927a5b510f4/resume%2Flatest_step.txt "HTTP/1.1 200 OK" |
| 2026-06-11 06:31:32,217 | HTTP Request: HEAD https://huggingface.co/GODELEV/Experimenting/resolve/main/resume/ckpt.pt "HTTP/1.1 302 Found" |
| 2026-06-11 06:31:32,288 | HTTP Request: GET https://huggingface.co/api/models/GODELEV/Experimenting/xet-read-token/73dbdd890e76f238232c2d5fed8e4927a5b510f4 "HTTP/1.1 200 OK" |
| 2026-06-11 06:31:39,439 | Resumed step=60 tokens=141419520 samples=138240 |
| 2026-06-11 06:42:05,961 | step= 80 | epoch=0.06 | loss=5.0837 | ppl=161.38 | lr=2.47e-04 | grad=2.096 | tok/s=300,996 | ETA=9:43:51 | VRAM=0.6GB | RAM=16% |
| 2026-06-11 06:52:00,604 | step= 100 | epoch=0.08 | loss=4.7250 | ppl=112.73 | lr=3.09e-04 | grad=1.062 | tok/s=193,023 | ETA=9:41:58 | VRAM=0.6GB | RAM=16% |
| 2026-06-11 07:02:01,626 | step= 120 | epoch=0.09 | loss=4.5171 | ppl=91.57 | lr=3.72e-04 | grad=0.569 | tok/s=155,225 | ETA=9:33:06 | VRAM=0.6GB | RAM=16% |
| 2026-06-11 07:12:03,556 | step= 140 | epoch=0.11 | loss=4.3554 | ppl=77.90 | lr=4.00e-04 | grad=1.029 | tok/s=136,127 | ETA=9:22:35 | VRAM=0.6GB | RAM=16% |
| 2026-06-11 07:22:04,493 | step= 160 | epoch=0.13 | loss=4.1771 | ppl=65.18 | lr=3.99e-04 | grad=0.967 | tok/s=124,668 | ETA=9:12:52 | VRAM=0.6GB | RAM=16% |
| 2026-06-11 07:32:07,371 | step= 180 | epoch=0.14 | loss=4.0030 | ppl=54.76 | lr=3.98e-04 | grad=1.533 | tok/s=116,944 | ETA=9:08:54 | VRAM=0.6GB | RAM=16% |
| 2026-06-11 07:42:11,025 | step= 200 | epoch=0.16 | loss=3.8110 | ppl=45.20 | lr=3.97e-04 | grad=0.806 | tok/s=111,402 | ETA=8:59:30 | VRAM=0.6GB | RAM=16% |
| 2026-06-11 07:42:55,395 | VAL step=200 epoch=0.16 loss=3.8175 ppl=45.49 BEST |
| 2026-06-11 07:42:55,474 | Saved safetensors: 110 tensors (tied=True, embed_key=model.embed_tokens.weight, lm_head omitted — HF will tie) |
| 2026-06-11 07:42:55,791 | HTTP Request: POST https://huggingface.co/api/repos/create "HTTP/1.1 409 Conflict" |
| 2026-06-11 07:42:55,853 | HTTP Request: POST https://huggingface.co/api/validate-yaml "HTTP/1.1 200 OK" |
| 2026-06-11 07:42:56,387 | HTTP Request: POST https://huggingface.co/api/validate-yaml "HTTP/1.1 200 OK" |
| 2026-06-11 07:42:56,528 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/preupload/main "HTTP/1.1 200 OK" |
| 2026-06-11 07:42:56,641 | HTTP Request: POST https://huggingface.co/GODELEV/Experimenting.git/info/lfs/objects/batch "HTTP/1.1 200 OK" |
| 2026-06-11 07:42:56,710 | HTTP Request: GET https://huggingface.co/api/models/GODELEV/Experimenting/xet-write-token/main "HTTP/1.1 200 OK" |
| 2026-06-11 07:43:04,316 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/commit/main "HTTP/1.1 200 OK" |
| 2026-06-11 07:43:04,319 | Hub push step=200 |
| 2026-06-11 07:52:59,184 | step= 220 | epoch=0.17 | loss=3.6513 | ppl=38.53 | lr=3.94e-04 | grad=1.093 | tok/s=106,265 | ETA=8:49:23 | VRAM=0.6GB | RAM=18% |
| 2026-06-11 08:03:07,634 | step= 240 | epoch=0.19 | loss=3.5250 | ppl=33.95 | lr=3.92e-04 | grad=1.079 | tok/s=103,073 | ETA=8:36:29 | VRAM=0.6GB | RAM=18% |
| 2026-06-11 08:13:14,936 | step= 260 | epoch=0.21 | loss=3.4222 | ppl=30.64 | lr=3.88e-04 | grad=1.176 | tok/s=100,537 | ETA=8:24:10 | VRAM=0.6GB | RAM=18% |
| 2026-06-11 08:23:20,546 | step= 280 | epoch=0.22 | loss=3.3389 | ppl=28.19 | lr=3.85e-04 | grad=1.165 | tok/s=98,486 | ETA=8:13:04 | VRAM=0.6GB | RAM=18% |
| 2026-06-11 08:33:27,224 | step= 300 | epoch=0.24 | loss=3.2471 | ppl=25.71 | lr=3.80e-04 | grad=1.034 | tok/s=96,760 | ETA=8:09:26 | VRAM=0.6GB | RAM=18% |
| 2026-06-11 08:43:33,425 | step= 320 | epoch=0.25 | loss=3.1869 | ppl=24.21 | lr=3.75e-04 | grad=1.081 | tok/s=95,305 | ETA=8:00:20 | VRAM=0.6GB | RAM=18% |
| 2026-06-11 08:53:41,161 | step= 340 | epoch=0.27 | loss=3.1030 | ppl=22.26 | lr=3.70e-04 | grad=1.101 | tok/s=94,040 | ETA=7:46:57 | VRAM=0.6GB | RAM=18% |
| 2026-06-11 09:03:50,023 | step= 360 | epoch=0.28 | loss=3.0533 | ppl=21.19 | lr=3.65e-04 | grad=1.082 | tok/s=92,932 | ETA=7:40:25 | VRAM=0.6GB | RAM=18% |
| 2026-06-11 09:13:59,503 | step= 380 | epoch=0.30 | loss=3.0083 | ppl=20.25 | lr=3.58e-04 | grad=0.962 | tok/s=91,957 | ETA=7:26:44 | VRAM=0.6GB | RAM=18% |
| 2026-06-11 09:24:06,735 | step= 400 | epoch=0.32 | loss=2.9510 | ppl=19.12 | lr=3.52e-04 | grad=0.867 | tok/s=91,116 | ETA=7:19:46 | VRAM=0.6GB | RAM=18% |
| 2026-06-11 09:24:17,996 | VAL step=400 epoch=0.32 loss=2.9669 ppl=19.43 BEST |
| 2026-06-11 09:24:18,177 | Checkpoint saved: step 400 |
| 2026-06-11 09:24:18,234 | Saved safetensors: 110 tensors (tied=True, embed_key=model.embed_tokens.weight, lm_head omitted — HF will tie) |
| 2026-06-11 09:24:18,610 | HTTP Request: POST https://huggingface.co/api/repos/create "HTTP/1.1 409 Conflict" |
| 2026-06-11 09:24:18,674 | HTTP Request: POST https://huggingface.co/api/validate-yaml "HTTP/1.1 200 OK" |
| 2026-06-11 09:24:19,160 | HTTP Request: POST https://huggingface.co/api/validate-yaml "HTTP/1.1 200 OK" |
| 2026-06-11 09:24:19,329 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/preupload/main "HTTP/1.1 200 OK" |
| 2026-06-11 09:24:19,407 | HTTP Request: POST https://huggingface.co/GODELEV/Experimenting.git/info/lfs/objects/batch "HTTP/1.1 200 OK" |
| 2026-06-11 09:24:19,468 | HTTP Request: GET https://huggingface.co/api/models/GODELEV/Experimenting/xet-write-token/main "HTTP/1.1 200 OK" |
| 2026-06-11 09:24:25,139 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/commit/main "HTTP/1.1 200 OK" |
| 2026-06-11 09:24:25,141 | Hub push step=400 |
| 2026-06-11 09:34:29,791 | step= 420 | epoch=0.33 | loss=2.9260 | ppl=18.65 | lr=3.45e-04 | grad=1.256 | tok/s=90,238 | ETA=7:08:34 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 09:44:37,407 | step= 440 | epoch=0.35 | loss=2.8942 | ppl=18.07 | lr=3.37e-04 | grad=1.043 | tok/s=89,574 | ETA=6:58:37 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 09:54:45,798 | step= 460 | epoch=0.36 | loss=2.8546 | ppl=17.37 | lr=3.30e-04 | grad=1.132 | tok/s=88,970 | ETA=6:49:23 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 10:04:56,517 | step= 480 | epoch=0.38 | loss=2.8238 | ppl=16.84 | lr=3.22e-04 | grad=1.166 | tok/s=88,408 | ETA=6:37:36 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 10:15:06,029 | step= 500 | epoch=0.40 | loss=2.8007 | ppl=16.46 | lr=3.13e-04 | grad=0.940 | tok/s=87,905 | ETA=6:28:57 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 10:25:12,965 | step= 520 | epoch=0.41 | loss=2.7780 | ppl=16.09 | lr=3.05e-04 | grad=1.025 | tok/s=87,461 | ETA=6:17:09 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 10:35:19,899 | step= 540 | epoch=0.43 | loss=2.7576 | ppl=15.76 | lr=2.96e-04 | grad=0.722 | tok/s=87,055 | ETA=6:06:13 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 10:45:24,846 | step= 560 | epoch=0.44 | loss=2.7592 | ppl=15.79 | lr=2.87e-04 | grad=1.001 | tok/s=86,692 | ETA=5:56:15 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 10:55:31,823 | step= 580 | epoch=0.46 | loss=2.7237 | ppl=15.24 | lr=2.77e-04 | grad=0.863 | tok/s=86,346 | ETA=5:48:34 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 11:05:44,507 | step= 600 | epoch=0.47 | loss=2.7174 | ppl=15.14 | lr=2.68e-04 | grad=1.031 | tok/s=85,995 | ETA=5:39:35 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 11:05:55,800 | VAL step=600 epoch=0.47 loss=2.7217 ppl=15.21 BEST |
| 2026-06-11 11:05:55,855 | Saved safetensors: 110 tensors (tied=True, embed_key=model.embed_tokens.weight, lm_head omitted — HF will tie) |
| 2026-06-11 11:05:56,230 | HTTP Request: POST https://huggingface.co/api/repos/create "HTTP/1.1 409 Conflict" |
| 2026-06-11 11:05:56,288 | HTTP Request: POST https://huggingface.co/api/validate-yaml "HTTP/1.1 200 OK" |
| 2026-06-11 11:05:56,771 | HTTP Request: POST https://huggingface.co/api/validate-yaml "HTTP/1.1 200 OK" |
| 2026-06-11 11:05:56,875 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/preupload/main "HTTP/1.1 200 OK" |
| 2026-06-11 11:05:56,950 | HTTP Request: POST https://huggingface.co/GODELEV/Experimenting.git/info/lfs/objects/batch "HTTP/1.1 200 OK" |
| 2026-06-11 11:05:57,011 | HTTP Request: GET https://huggingface.co/api/models/GODELEV/Experimenting/xet-write-token/main "HTTP/1.1 200 OK" |
| 2026-06-11 11:06:02,661 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/commit/main "HTTP/1.1 200 OK" |
| 2026-06-11 11:06:02,663 | Hub push step=600 |
| 2026-06-11 11:16:06,339 | step= 620 | epoch=0.49 | loss=2.7078 | ppl=15.00 | lr=2.58e-04 | grad=0.951 | tok/s=85,624 | ETA=5:24:04 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 11:26:11,866 | step= 640 | epoch=0.51 | loss=2.6989 | ppl=14.86 | lr=2.48e-04 | grad=1.173 | tok/s=85,358 | ETA=5:13:28 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 11:36:17,585 | step= 660 | epoch=0.52 | loss=2.6667 | ppl=14.39 | lr=2.38e-04 | grad=0.870 | tok/s=85,108 | ETA=5:06:39 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 11:46:22,399 | step= 680 | epoch=0.54 | loss=2.6486 | ppl=14.13 | lr=2.28e-04 | grad=0.638 | tok/s=84,879 | ETA=4:52:39 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 11:56:27,607 | step= 700 | epoch=0.55 | loss=2.6595 | ppl=14.29 | lr=2.19e-04 | grad=0.835 | tok/s=84,662 | ETA=4:43:15 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 12:06:33,643 | step= 720 | epoch=0.57 | loss=2.6418 | ppl=14.04 | lr=2.09e-04 | grad=0.751 | tok/s=84,454 | ETA=4:35:23 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 12:16:39,113 | step= 740 | epoch=0.59 | loss=2.6338 | ppl=13.93 | lr=1.99e-04 | grad=0.775 | tok/s=84,261 | ETA=4:25:31 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 12:26:45,316 | step= 760 | epoch=0.60 | loss=2.6075 | ppl=13.57 | lr=1.89e-04 | grad=0.730 | tok/s=84,076 | ETA=4:12:41 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 12:36:51,989 | step= 780 | epoch=0.62 | loss=2.6104 | ppl=13.60 | lr=1.79e-04 | grad=0.578 | tok/s=83,900 | ETA=4:05:36 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 12:46:59,512 | step= 800 | epoch=0.63 | loss=2.6066 | ppl=13.55 | lr=1.69e-04 | grad=0.807 | tok/s=83,730 | ETA=3:52:38 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 12:47:10,645 | VAL step=800 epoch=0.63 loss=2.6120 ppl=13.63 BEST |
| 2026-06-11 12:47:10,814 | Checkpoint saved: step 800 |
| 2026-06-11 12:47:10,869 | Saved safetensors: 110 tensors (tied=True, embed_key=model.embed_tokens.weight, lm_head omitted — HF will tie) |
| 2026-06-11 12:47:11,244 | HTTP Request: POST https://huggingface.co/api/repos/create "HTTP/1.1 409 Conflict" |
| 2026-06-11 12:47:11,308 | HTTP Request: POST https://huggingface.co/api/validate-yaml "HTTP/1.1 200 OK" |
| 2026-06-11 12:47:11,801 | HTTP Request: POST https://huggingface.co/api/validate-yaml "HTTP/1.1 200 OK" |
| 2026-06-11 12:47:11,911 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/preupload/main "HTTP/1.1 200 OK" |
| 2026-06-11 12:47:12,016 | HTTP Request: POST https://huggingface.co/GODELEV/Experimenting.git/info/lfs/objects/batch "HTTP/1.1 200 OK" |
| 2026-06-11 12:47:12,079 | HTTP Request: GET https://huggingface.co/api/models/GODELEV/Experimenting/xet-write-token/main "HTTP/1.1 200 OK" |
| 2026-06-11 12:47:18,595 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/commit/main "HTTP/1.1 200 OK" |
| 2026-06-11 12:47:18,598 | Hub push step=800 |
| 2026-06-11 12:57:22,752 | step= 820 | epoch=0.65 | loss=2.6016 | ppl=13.48 | lr=1.60e-04 | grad=0.774 | tok/s=83,512 | ETA=3:43:39 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 13:07:29,946 | step= 840 | epoch=0.66 | loss=2.5871 | ppl=13.29 | lr=1.51e-04 | grad=0.765 | tok/s=83,362 | ETA=3:38:05 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 13:17:34,836 | step= 860 | epoch=0.68 | loss=2.5837 | ppl=13.25 | lr=1.42e-04 | grad=0.741 | tok/s=83,227 | ETA=3:22:39 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 13:27:37,756 | step= 880 | epoch=0.70 | loss=2.5803 | ppl=13.20 | lr=1.33e-04 | grad=0.883 | tok/s=83,105 | ETA=3:12:57 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 13:37:44,484 | step= 900 | epoch=0.71 | loss=2.5702 | ppl=13.07 | lr=1.24e-04 | grad=0.568 | tok/s=82,977 | ETA=3:02:22 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 13:47:50,604 | step= 920 | epoch=0.73 | loss=2.5767 | ppl=13.15 | lr=1.16e-04 | grad=0.533 | tok/s=82,856 | ETA=2:53:22 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 13:57:58,661 | step= 940 | epoch=0.74 | loss=2.5578 | ppl=12.91 | lr=1.08e-04 | grad=0.750 | tok/s=82,735 | ETA=2:44:59 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 14:08:11,735 | step= 960 | epoch=0.76 | loss=2.5631 | ppl=12.98 | lr=1.00e-04 | grad=0.541 | tok/s=82,604 | ETA=2:33:55 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 14:18:17,443 | step= 980 | epoch=0.78 | loss=2.5458 | ppl=12.75 | lr=9.31e-05 | grad=0.588 | tok/s=82,501 | ETA=2:23:23 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 14:28:22,853 | step= 1000 | epoch=0.79 | loss=2.5536 | ppl=12.85 | lr=8.62e-05 | grad=0.608 | tok/s=82,403 | ETA=2:12:14 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 14:28:33,984 | VAL step=1000 epoch=0.79 loss=2.5559 ppl=12.88 BEST |
| 2026-06-11 14:28:34,058 | Saved safetensors: 110 tensors (tied=True, embed_key=model.embed_tokens.weight, lm_head omitted — HF will tie) |
| 2026-06-11 14:28:34,436 | HTTP Request: POST https://huggingface.co/api/repos/create "HTTP/1.1 409 Conflict" |
| 2026-06-11 14:28:34,497 | HTTP Request: POST https://huggingface.co/api/validate-yaml "HTTP/1.1 200 OK" |
| 2026-06-11 14:28:35,006 | HTTP Request: POST https://huggingface.co/api/validate-yaml "HTTP/1.1 200 OK" |
| 2026-06-11 14:28:35,122 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/preupload/main "HTTP/1.1 200 OK" |
| 2026-06-11 14:28:35,213 | HTTP Request: POST https://huggingface.co/GODELEV/Experimenting.git/info/lfs/objects/batch "HTTP/1.1 200 OK" |
| 2026-06-11 14:28:35,276 | HTTP Request: GET https://huggingface.co/api/models/GODELEV/Experimenting/xet-write-token/main "HTTP/1.1 200 OK" |
| 2026-06-11 14:28:41,949 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/commit/main "HTTP/1.1 200 OK" |
| 2026-06-11 14:28:41,952 | Hub push step=1000 |
| 2026-06-11 14:38:45,331 | step= 1020 | epoch=0.81 | loss=2.3078 | ppl=10.05 | lr=7.98e-05 | grad=1.918 | tok/s=82,261 | ETA=2:03:07 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 14:48:51,304 | step= 1040 | epoch=0.82 | loss=2.2168 | ppl=9.18 | lr=7.37e-05 | grad=0.789 | tok/s=82,170 | ETA=1:53:16 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 14:59:00,762 | step= 1060 | epoch=0.84 | loss=2.5697 | ppl=13.06 | lr=6.82e-05 | grad=0.509 | tok/s=82,073 | ETA=1:44:28 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 15:09:09,586 | step= 1080 | epoch=0.85 | loss=2.5579 | ppl=12.91 | lr=6.30e-05 | grad=0.467 | tok/s=81,982 | ETA=1:33:40 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 15:19:16,212 | step= 1100 | epoch=0.87 | loss=2.5473 | ppl=12.77 | lr=5.84e-05 | grad=0.446 | tok/s=81,900 | ETA=1:23:24 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 15:29:22,702 | step= 1120 | epoch=0.89 | loss=2.5299 | ppl=12.55 | lr=5.43e-05 | grad=0.433 | tok/s=81,822 | ETA=1:12:55 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 15:39:30,420 | step= 1140 | epoch=0.90 | loss=2.5377 | ppl=12.65 | lr=5.06e-05 | grad=0.392 | tok/s=81,743 | ETA=1:02:07 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 15:49:36,534 | step= 1160 | epoch=0.92 | loss=2.5377 | ppl=12.65 | lr=4.75e-05 | grad=0.429 | tok/s=81,671 | ETA=0:52:05 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 15:59:42,638 | step= 1180 | epoch=0.93 | loss=2.5383 | ppl=12.66 | lr=4.50e-05 | grad=0.472 | tok/s=81,602 | ETA=0:42:47 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 16:09:54,897 | step= 1200 | epoch=0.95 | loss=2.5304 | ppl=12.56 | lr=4.29e-05 | grad=0.319 | tok/s=81,521 | ETA=0:32:44 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 16:10:06,127 | VAL step=1200 epoch=0.95 loss=2.5314 ppl=12.57 BEST |
| 2026-06-11 16:10:06,313 | Checkpoint saved: step 1200 |
| 2026-06-11 16:10:06,379 | Saved safetensors: 110 tensors (tied=True, embed_key=model.embed_tokens.weight, lm_head omitted — HF will tie) |
| 2026-06-11 16:10:06,753 | HTTP Request: POST https://huggingface.co/api/repos/create "HTTP/1.1 409 Conflict" |
| 2026-06-11 16:10:06,814 | HTTP Request: POST https://huggingface.co/api/validate-yaml "HTTP/1.1 200 OK" |
| 2026-06-11 16:10:07,303 | HTTP Request: POST https://huggingface.co/api/validate-yaml "HTTP/1.1 200 OK" |
| 2026-06-11 16:10:07,409 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/preupload/main "HTTP/1.1 200 OK" |
| 2026-06-11 16:10:07,490 | HTTP Request: POST https://huggingface.co/GODELEV/Experimenting.git/info/lfs/objects/batch "HTTP/1.1 200 OK" |
| 2026-06-11 16:10:07,553 | HTTP Request: GET https://huggingface.co/api/models/GODELEV/Experimenting/xet-write-token/main "HTTP/1.1 200 OK" |
| 2026-06-11 16:10:16,698 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/commit/main "HTTP/1.1 200 OK" |
| 2026-06-11 16:10:16,701 | Hub push step=1200 |
| 2026-06-11 16:20:20,220 | step= 1220 | epoch=0.97 | loss=2.5229 | ppl=12.47 | lr=4.14e-05 | grad=0.402 | tok/s=81,412 | ETA=0:22:03 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 16:30:28,308 | step= 1240 | epoch=0.98 | loss=2.5312 | ppl=12.57 | lr=4.04e-05 | grad=0.388 | tok/s=81,346 | ETA=0:12:11 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 16:40:34,691 | step= 1260 | epoch=1.00 | loss=2.5229 | ppl=12.46 | lr=4.00e-05 | grad=0.403 | tok/s=81,286 | ETA=0:02:00 | VRAM=0.6GB | RAM=19% |
| 2026-06-11 16:46:41,058 | Final eval loss=2.5222 ppl=12.46 |
| 2026-06-11 16:46:41,255 | Checkpoint saved: step 1264 |
| 2026-06-11 16:46:41,311 | Saved safetensors: 110 tensors (tied=True, embed_key=model.embed_tokens.weight, lm_head omitted — HF will tie) |
| 2026-06-11 16:46:41,699 | HTTP Request: POST https://huggingface.co/api/repos/create "HTTP/1.1 409 Conflict" |
| 2026-06-11 16:46:41,759 | HTTP Request: POST https://huggingface.co/api/validate-yaml "HTTP/1.1 200 OK" |
| 2026-06-11 16:46:42,253 | HTTP Request: POST https://huggingface.co/api/validate-yaml "HTTP/1.1 200 OK" |
| 2026-06-11 16:46:42,414 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/preupload/main "HTTP/1.1 200 OK" |
| 2026-06-11 16:46:42,503 | HTTP Request: POST https://huggingface.co/GODELEV/Experimenting.git/info/lfs/objects/batch "HTTP/1.1 200 OK" |
| 2026-06-11 16:46:42,569 | HTTP Request: GET https://huggingface.co/api/models/GODELEV/Experimenting/xet-write-token/main "HTTP/1.1 200 OK" |
| 2026-06-11 16:46:49,206 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/commit/main "HTTP/1.1 200 OK" |
| 2026-06-11 16:46:49,209 | Hub push step=1264 |
| 2026-06-11 16:46:49,306 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/preupload/main "HTTP/1.1 200 OK" |
| 2026-06-11 16:46:50,104 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/commit/main "HTTP/1.1 200 OK" |
| 2026-06-11 16:46:50,203 | HTTP Request: POST https://huggingface.co/api/models/GODELEV/Experimenting/preupload/main "HTTP/1.1 200 OK" |
|
|