Image-Text-to-Text
Transformers
Safetensors
qwen3_5
llama-factory
full
Generated from Trainer
conversational
Instructions to use furproxy/9b-114 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use furproxy/9b-114 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="furproxy/9b-114") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("furproxy/9b-114") model = AutoModelForImageTextToText.from_pretrained("furproxy/9b-114") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] inputs = processor.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use furproxy/9b-114 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "furproxy/9b-114" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "furproxy/9b-114", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/furproxy/9b-114
- SGLang
How to use furproxy/9b-114 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "furproxy/9b-114" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "furproxy/9b-114", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "furproxy/9b-114" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "furproxy/9b-114", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' - Docker Model Runner
How to use furproxy/9b-114 with Docker Model Runner:
docker model run hf.co/furproxy/9b-114
| {"current_steps": 2, "total_steps": 2457, "loss": 1.9268020391464233, "lr": 8.130081300813009e-08, "epoch": 0.002442002442002442, "percentage": 0.08, "elapsed_time": "0:00:10", "remaining_time": "3:27:51"} | |
| {"current_steps": 4, "total_steps": 2457, "loss": 2.098186492919922, "lr": 2.439024390243903e-07, "epoch": 0.004884004884004884, "percentage": 0.16, "elapsed_time": "0:00:17", "remaining_time": "2:57:00"} | |
| {"current_steps": 6, "total_steps": 2457, "loss": 2.4138333797454834, "lr": 4.0650406504065046e-07, "epoch": 0.007326007326007326, "percentage": 0.24, "elapsed_time": "0:00:23", "remaining_time": "2:43:02"} | |
| {"current_steps": 8, "total_steps": 2457, "loss": 1.953867793083191, "lr": 5.691056910569106e-07, "epoch": 0.009768009768009768, "percentage": 0.33, "elapsed_time": "0:00:32", "remaining_time": "2:45:06"} | |
| {"current_steps": 10, "total_steps": 2457, "loss": 2.1249871253967285, "lr": 7.317073170731707e-07, "epoch": 0.01221001221001221, "percentage": 0.41, "elapsed_time": "0:00:39", "remaining_time": "2:39:15"} | |
| {"current_steps": 12, "total_steps": 2457, "loss": 1.870603322982788, "lr": 8.94308943089431e-07, "epoch": 0.014652014652014652, "percentage": 0.49, "elapsed_time": "0:00:44", "remaining_time": "2:32:25"} | |
| {"current_steps": 14, "total_steps": 2457, "loss": 1.646697998046875, "lr": 1.0569105691056912e-06, "epoch": 0.017094017094017096, "percentage": 0.57, "elapsed_time": "0:00:51", "remaining_time": "2:29:02"} | |
| {"current_steps": 16, "total_steps": 2457, "loss": 1.6898235082626343, "lr": 1.2195121951219514e-06, "epoch": 0.019536019536019536, "percentage": 0.65, "elapsed_time": "0:01:00", "remaining_time": "2:34:46"} | |
| {"current_steps": 18, "total_steps": 2457, "loss": 1.8439620733261108, "lr": 1.3821138211382116e-06, "epoch": 0.02197802197802198, "percentage": 0.73, "elapsed_time": "0:01:12", "remaining_time": "2:43:36"} | |
| {"current_steps": 20, "total_steps": 2457, "loss": 1.6882305145263672, "lr": 1.5447154471544717e-06, "epoch": 0.02442002442002442, "percentage": 0.81, "elapsed_time": "0:01:23", "remaining_time": "2:50:08"} | |
| {"current_steps": 22, "total_steps": 2457, "loss": 1.4086613655090332, "lr": 1.707317073170732e-06, "epoch": 0.026862026862026864, "percentage": 0.9, "elapsed_time": "0:01:30", "remaining_time": "2:47:10"} | |
| {"current_steps": 24, "total_steps": 2457, "loss": 1.22359037399292, "lr": 1.8699186991869919e-06, "epoch": 0.029304029304029304, "percentage": 0.98, "elapsed_time": "0:01:40", "remaining_time": "2:50:12"} | |
| {"current_steps": 26, "total_steps": 2457, "loss": 1.6794222593307495, "lr": 2.0325203252032523e-06, "epoch": 0.031746031746031744, "percentage": 1.06, "elapsed_time": "0:01:53", "remaining_time": "2:56:36"} | |
| {"current_steps": 28, "total_steps": 2457, "loss": 1.7425767183303833, "lr": 2.1951219512195125e-06, "epoch": 0.03418803418803419, "percentage": 1.14, "elapsed_time": "0:02:02", "remaining_time": "2:56:53"} | |
| {"current_steps": 30, "total_steps": 2457, "loss": 1.1433881521224976, "lr": 2.3577235772357727e-06, "epoch": 0.03663003663003663, "percentage": 1.22, "elapsed_time": "0:02:10", "remaining_time": "2:56:04"} | |
| {"current_steps": 32, "total_steps": 2457, "loss": 1.0014692544937134, "lr": 2.5203252032520324e-06, "epoch": 0.03907203907203907, "percentage": 1.3, "elapsed_time": "0:02:18", "remaining_time": "2:54:56"} | |
| {"current_steps": 34, "total_steps": 2457, "loss": 1.558118224143982, "lr": 2.682926829268293e-06, "epoch": 0.04151404151404151, "percentage": 1.38, "elapsed_time": "0:02:26", "remaining_time": "2:54:10"} | |
| {"current_steps": 36, "total_steps": 2457, "loss": 1.2752659320831299, "lr": 2.845528455284553e-06, "epoch": 0.04395604395604396, "percentage": 1.47, "elapsed_time": "0:02:37", "remaining_time": "2:56:52"} | |
| {"current_steps": 38, "total_steps": 2457, "loss": 1.5238615274429321, "lr": 3.0081300813008134e-06, "epoch": 0.0463980463980464, "percentage": 1.55, "elapsed_time": "0:02:46", "remaining_time": "2:56:48"} | |
| {"current_steps": 40, "total_steps": 2457, "loss": 1.3837251663208008, "lr": 3.1707317073170736e-06, "epoch": 0.04884004884004884, "percentage": 1.63, "elapsed_time": "0:02:54", "remaining_time": "2:56:02"} | |
| {"current_steps": 42, "total_steps": 2457, "loss": 1.4677042961120605, "lr": 3.3333333333333333e-06, "epoch": 0.05128205128205128, "percentage": 1.71, "elapsed_time": "0:03:02", "remaining_time": "2:55:06"} | |
| {"current_steps": 44, "total_steps": 2457, "loss": 1.4204754829406738, "lr": 3.495934959349594e-06, "epoch": 0.05372405372405373, "percentage": 1.79, "elapsed_time": "0:03:10", "remaining_time": "2:53:57"} | |
| {"current_steps": 46, "total_steps": 2457, "loss": 1.3837416172027588, "lr": 3.6585365853658537e-06, "epoch": 0.05616605616605617, "percentage": 1.87, "elapsed_time": "0:03:20", "remaining_time": "2:55:03"} | |
| {"current_steps": 48, "total_steps": 2457, "loss": 1.0739210844039917, "lr": 3.821138211382115e-06, "epoch": 0.05860805860805861, "percentage": 1.95, "elapsed_time": "0:03:29", "remaining_time": "2:55:06"} | |
| {"current_steps": 50, "total_steps": 2457, "loss": 0.9827917814254761, "lr": 3.983739837398374e-06, "epoch": 0.06105006105006105, "percentage": 2.04, "elapsed_time": "0:03:37", "remaining_time": "2:54:13"} | |
| {"current_steps": 52, "total_steps": 2457, "loss": 1.3439877033233643, "lr": 4.146341463414634e-06, "epoch": 0.06349206349206349, "percentage": 2.12, "elapsed_time": "0:03:44", "remaining_time": "2:53:10"} | |
| {"current_steps": 54, "total_steps": 2457, "loss": 1.0445363521575928, "lr": 4.308943089430894e-06, "epoch": 0.06593406593406594, "percentage": 2.2, "elapsed_time": "0:03:51", "remaining_time": "2:51:56"} | |
| {"current_steps": 56, "total_steps": 2457, "loss": 1.3443750143051147, "lr": 4.471544715447155e-06, "epoch": 0.06837606837606838, "percentage": 2.28, "elapsed_time": "0:04:00", "remaining_time": "2:52:12"} | |
| {"current_steps": 58, "total_steps": 2457, "loss": 1.3475321531295776, "lr": 4.634146341463416e-06, "epoch": 0.07081807081807082, "percentage": 2.36, "elapsed_time": "0:04:09", "remaining_time": "2:52:12"} | |
| {"current_steps": 60, "total_steps": 2457, "loss": 1.4037724733352661, "lr": 4.796747967479675e-06, "epoch": 0.07326007326007326, "percentage": 2.44, "elapsed_time": "0:04:16", "remaining_time": "2:50:46"} | |
| {"current_steps": 62, "total_steps": 2457, "loss": 1.0292410850524902, "lr": 4.959349593495935e-06, "epoch": 0.0757020757020757, "percentage": 2.52, "elapsed_time": "0:04:23", "remaining_time": "2:49:20"} | |
| {"current_steps": 64, "total_steps": 2457, "loss": 1.6453887224197388, "lr": 5.121951219512195e-06, "epoch": 0.07814407814407814, "percentage": 2.6, "elapsed_time": "0:04:30", "remaining_time": "2:48:41"} | |
| {"current_steps": 66, "total_steps": 2457, "loss": 1.1829452514648438, "lr": 5.2845528455284555e-06, "epoch": 0.08058608058608059, "percentage": 2.69, "elapsed_time": "0:04:38", "remaining_time": "2:48:05"} | |
| {"current_steps": 68, "total_steps": 2457, "loss": 1.1011936664581299, "lr": 5.447154471544716e-06, "epoch": 0.08302808302808302, "percentage": 2.77, "elapsed_time": "0:04:45", "remaining_time": "2:47:24"} | |
| {"current_steps": 70, "total_steps": 2457, "loss": 1.1616472005844116, "lr": 5.609756097560977e-06, "epoch": 0.08547008547008547, "percentage": 2.85, "elapsed_time": "0:04:53", "remaining_time": "2:46:36"} | |
| {"current_steps": 72, "total_steps": 2457, "loss": 1.3839240074157715, "lr": 5.772357723577237e-06, "epoch": 0.08791208791208792, "percentage": 2.93, "elapsed_time": "0:05:04", "remaining_time": "2:48:13"} | |
| {"current_steps": 74, "total_steps": 2457, "loss": 1.4121818542480469, "lr": 5.934959349593496e-06, "epoch": 0.09035409035409035, "percentage": 3.01, "elapsed_time": "0:05:12", "remaining_time": "2:47:46"} | |
| {"current_steps": 76, "total_steps": 2457, "loss": 1.416529893875122, "lr": 6.0975609756097564e-06, "epoch": 0.0927960927960928, "percentage": 3.09, "elapsed_time": "0:05:20", "remaining_time": "2:47:32"} | |
| {"current_steps": 78, "total_steps": 2457, "loss": 1.222318410873413, "lr": 6.260162601626017e-06, "epoch": 0.09523809523809523, "percentage": 3.17, "elapsed_time": "0:05:29", "remaining_time": "2:47:19"} | |
| {"current_steps": 80, "total_steps": 2457, "loss": 1.1652414798736572, "lr": 6.422764227642278e-06, "epoch": 0.09768009768009768, "percentage": 3.26, "elapsed_time": "0:05:37", "remaining_time": "2:46:56"} | |
| {"current_steps": 82, "total_steps": 2457, "loss": 1.0345308780670166, "lr": 6.585365853658538e-06, "epoch": 0.10012210012210013, "percentage": 3.34, "elapsed_time": "0:05:48", "remaining_time": "2:48:10"} | |
| {"current_steps": 84, "total_steps": 2457, "loss": 1.0803658962249756, "lr": 6.747967479674797e-06, "epoch": 0.10256410256410256, "percentage": 3.42, "elapsed_time": "0:05:58", "remaining_time": "2:48:43"} | |
| {"current_steps": 86, "total_steps": 2457, "loss": 1.249168038368225, "lr": 6.910569105691057e-06, "epoch": 0.10500610500610501, "percentage": 3.5, "elapsed_time": "0:06:04", "remaining_time": "2:47:33"} | |
| {"current_steps": 88, "total_steps": 2457, "loss": 1.4062129259109497, "lr": 7.0731707317073175e-06, "epoch": 0.10744810744810745, "percentage": 3.58, "elapsed_time": "0:06:11", "remaining_time": "2:46:47"} | |
| {"current_steps": 90, "total_steps": 2457, "loss": 1.1516574621200562, "lr": 7.2357723577235786e-06, "epoch": 0.10989010989010989, "percentage": 3.66, "elapsed_time": "0:06:20", "remaining_time": "2:46:54"} | |
| {"current_steps": 92, "total_steps": 2457, "loss": 1.4968205690383911, "lr": 7.398373983739838e-06, "epoch": 0.11233211233211234, "percentage": 3.74, "elapsed_time": "0:06:27", "remaining_time": "2:45:53"} | |
| {"current_steps": 94, "total_steps": 2457, "loss": 1.1234135627746582, "lr": 7.560975609756098e-06, "epoch": 0.11477411477411477, "percentage": 3.83, "elapsed_time": "0:06:34", "remaining_time": "2:45:06"} | |
| {"current_steps": 96, "total_steps": 2457, "loss": 1.4027729034423828, "lr": 7.723577235772358e-06, "epoch": 0.11721611721611722, "percentage": 3.91, "elapsed_time": "0:06:41", "remaining_time": "2:44:32"} | |
| {"current_steps": 98, "total_steps": 2457, "loss": 1.3487744331359863, "lr": 7.886178861788618e-06, "epoch": 0.11965811965811966, "percentage": 3.99, "elapsed_time": "0:06:47", "remaining_time": "2:43:30"} | |
| {"current_steps": 100, "total_steps": 2457, "loss": 1.074942708015442, "lr": 8.048780487804879e-06, "epoch": 0.1221001221001221, "percentage": 4.07, "elapsed_time": "0:06:54", "remaining_time": "2:42:44"} | |
| {"current_steps": 102, "total_steps": 2457, "loss": 1.0268707275390625, "lr": 8.21138211382114e-06, "epoch": 0.12454212454212454, "percentage": 4.15, "elapsed_time": "0:06:59", "remaining_time": "2:41:30"} | |
| {"current_steps": 104, "total_steps": 2457, "loss": 0.9993240833282471, "lr": 8.373983739837399e-06, "epoch": 0.12698412698412698, "percentage": 4.23, "elapsed_time": "0:07:05", "remaining_time": "2:40:29"} | |
| {"current_steps": 106, "total_steps": 2457, "loss": 0.9525761604309082, "lr": 8.536585365853658e-06, "epoch": 0.12942612942612944, "percentage": 4.31, "elapsed_time": "0:07:10", "remaining_time": "2:39:15"} | |
| {"current_steps": 108, "total_steps": 2457, "loss": 1.3531205654144287, "lr": 8.69918699186992e-06, "epoch": 0.13186813186813187, "percentage": 4.4, "elapsed_time": "0:07:18", "remaining_time": "2:38:51"} | |
| {"current_steps": 110, "total_steps": 2457, "loss": 1.6010525226593018, "lr": 8.86178861788618e-06, "epoch": 0.1343101343101343, "percentage": 4.48, "elapsed_time": "0:07:25", "remaining_time": "2:38:25"} | |
| {"current_steps": 112, "total_steps": 2457, "loss": 1.41323721408844, "lr": 9.02439024390244e-06, "epoch": 0.13675213675213677, "percentage": 4.56, "elapsed_time": "0:07:32", "remaining_time": "2:37:55"} | |
| {"current_steps": 114, "total_steps": 2457, "loss": 1.1879425048828125, "lr": 9.1869918699187e-06, "epoch": 0.1391941391941392, "percentage": 4.64, "elapsed_time": "0:07:38", "remaining_time": "2:37:11"} | |
| {"current_steps": 116, "total_steps": 2457, "loss": 1.3165570497512817, "lr": 9.34959349593496e-06, "epoch": 0.14163614163614163, "percentage": 4.72, "elapsed_time": "0:07:46", "remaining_time": "2:36:48"} | |
| {"current_steps": 118, "total_steps": 2457, "loss": 1.2242571115493774, "lr": 9.51219512195122e-06, "epoch": 0.14407814407814407, "percentage": 4.8, "elapsed_time": "0:07:56", "remaining_time": "2:37:33"} | |
| {"current_steps": 120, "total_steps": 2457, "loss": 1.3259317874908447, "lr": 9.67479674796748e-06, "epoch": 0.14652014652014653, "percentage": 4.88, "elapsed_time": "0:08:05", "remaining_time": "2:37:35"} | |
| {"current_steps": 122, "total_steps": 2457, "loss": 1.3270224332809448, "lr": 9.837398373983741e-06, "epoch": 0.14896214896214896, "percentage": 4.97, "elapsed_time": "0:08:13", "remaining_time": "2:37:23"} | |
| {"current_steps": 124, "total_steps": 2457, "loss": 1.3085572719573975, "lr": 1e-05, "epoch": 0.1514041514041514, "percentage": 5.05, "elapsed_time": "0:08:25", "remaining_time": "2:38:24"} | |
| {"current_steps": 126, "total_steps": 2457, "loss": 1.1765004396438599, "lr": 1.0162601626016262e-05, "epoch": 0.15384615384615385, "percentage": 5.13, "elapsed_time": "0:08:33", "remaining_time": "2:38:12"} | |
| {"current_steps": 128, "total_steps": 2457, "loss": 1.4069057703018188, "lr": 1.0325203252032521e-05, "epoch": 0.1562881562881563, "percentage": 5.21, "elapsed_time": "0:08:44", "remaining_time": "2:38:56"} | |
| {"current_steps": 130, "total_steps": 2457, "loss": 1.2615665197372437, "lr": 1.0487804878048782e-05, "epoch": 0.15873015873015872, "percentage": 5.29, "elapsed_time": "0:08:52", "remaining_time": "2:38:49"} | |
| {"current_steps": 132, "total_steps": 2457, "loss": 1.0511776208877563, "lr": 1.065040650406504e-05, "epoch": 0.16117216117216118, "percentage": 5.37, "elapsed_time": "0:08:59", "remaining_time": "2:38:28"} | |
| {"current_steps": 134, "total_steps": 2457, "loss": 1.1562919616699219, "lr": 1.0813008130081301e-05, "epoch": 0.16361416361416362, "percentage": 5.45, "elapsed_time": "0:09:10", "remaining_time": "2:39:11"} | |
| {"current_steps": 136, "total_steps": 2457, "loss": 1.1669853925704956, "lr": 1.0975609756097562e-05, "epoch": 0.16605616605616605, "percentage": 5.54, "elapsed_time": "0:09:17", "remaining_time": "2:38:27"} | |
| {"current_steps": 138, "total_steps": 2457, "loss": 1.132803201675415, "lr": 1.1138211382113821e-05, "epoch": 0.1684981684981685, "percentage": 5.62, "elapsed_time": "0:09:23", "remaining_time": "2:37:50"} | |
| {"current_steps": 140, "total_steps": 2457, "loss": 1.078572392463684, "lr": 1.1300813008130082e-05, "epoch": 0.17094017094017094, "percentage": 5.7, "elapsed_time": "0:09:29", "remaining_time": "2:37:10"} | |
| {"current_steps": 142, "total_steps": 2457, "loss": 1.3616305589675903, "lr": 1.1463414634146342e-05, "epoch": 0.17338217338217338, "percentage": 5.78, "elapsed_time": "0:09:36", "remaining_time": "2:36:31"} | |
| {"current_steps": 144, "total_steps": 2457, "loss": 1.1173185110092163, "lr": 1.1626016260162603e-05, "epoch": 0.17582417582417584, "percentage": 5.86, "elapsed_time": "0:09:41", "remaining_time": "2:35:46"} | |
| {"current_steps": 146, "total_steps": 2457, "loss": 0.9900561571121216, "lr": 1.1788617886178864e-05, "epoch": 0.17826617826617827, "percentage": 5.94, "elapsed_time": "0:09:49", "remaining_time": "2:35:36"} | |
| {"current_steps": 148, "total_steps": 2457, "loss": 0.9898566007614136, "lr": 1.1951219512195123e-05, "epoch": 0.1807081807081807, "percentage": 6.02, "elapsed_time": "0:09:56", "remaining_time": "2:35:08"} | |
| {"current_steps": 150, "total_steps": 2457, "loss": 1.3455116748809814, "lr": 1.2113821138211384e-05, "epoch": 0.18315018315018314, "percentage": 6.11, "elapsed_time": "0:10:02", "remaining_time": "2:34:31"} | |
| {"current_steps": 152, "total_steps": 2457, "loss": 1.4696322679519653, "lr": 1.2276422764227642e-05, "epoch": 0.1855921855921856, "percentage": 6.19, "elapsed_time": "0:10:09", "remaining_time": "2:33:56"} | |
| {"current_steps": 154, "total_steps": 2457, "loss": 1.343530297279358, "lr": 1.2439024390243903e-05, "epoch": 0.18803418803418803, "percentage": 6.27, "elapsed_time": "0:10:15", "remaining_time": "2:33:17"} | |
| {"current_steps": 156, "total_steps": 2457, "loss": 1.6466281414031982, "lr": 1.2601626016260164e-05, "epoch": 0.19047619047619047, "percentage": 6.35, "elapsed_time": "0:10:21", "remaining_time": "2:32:40"} | |
| {"current_steps": 158, "total_steps": 2457, "loss": 1.1969726085662842, "lr": 1.2764227642276423e-05, "epoch": 0.19291819291819293, "percentage": 6.43, "elapsed_time": "0:10:27", "remaining_time": "2:32:06"} | |
| {"current_steps": 160, "total_steps": 2457, "loss": 0.961052656173706, "lr": 1.2926829268292684e-05, "epoch": 0.19536019536019536, "percentage": 6.51, "elapsed_time": "0:10:33", "remaining_time": "2:31:34"} | |
| {"current_steps": 162, "total_steps": 2457, "loss": 1.4117612838745117, "lr": 1.3089430894308943e-05, "epoch": 0.1978021978021978, "percentage": 6.59, "elapsed_time": "0:10:39", "remaining_time": "2:31:04"} | |
| {"current_steps": 164, "total_steps": 2457, "loss": 1.319150447845459, "lr": 1.3252032520325204e-05, "epoch": 0.20024420024420025, "percentage": 6.67, "elapsed_time": "0:10:47", "remaining_time": "2:30:49"} | |
| {"current_steps": 166, "total_steps": 2457, "loss": 1.3318078517913818, "lr": 1.3414634146341466e-05, "epoch": 0.2026862026862027, "percentage": 6.76, "elapsed_time": "0:10:54", "remaining_time": "2:30:29"} | |
| {"current_steps": 168, "total_steps": 2457, "loss": 1.1935322284698486, "lr": 1.3577235772357725e-05, "epoch": 0.20512820512820512, "percentage": 6.84, "elapsed_time": "0:11:00", "remaining_time": "2:30:03"} | |
| {"current_steps": 170, "total_steps": 2457, "loss": 0.9753493666648865, "lr": 1.3739837398373986e-05, "epoch": 0.20757020757020758, "percentage": 6.92, "elapsed_time": "0:11:07", "remaining_time": "2:29:34"} | |
| {"current_steps": 172, "total_steps": 2457, "loss": 1.0886809825897217, "lr": 1.3902439024390244e-05, "epoch": 0.21001221001221002, "percentage": 7.0, "elapsed_time": "0:11:13", "remaining_time": "2:29:05"} | |
| {"current_steps": 174, "total_steps": 2457, "loss": 1.3587074279785156, "lr": 1.4065040650406505e-05, "epoch": 0.21245421245421245, "percentage": 7.08, "elapsed_time": "0:11:19", "remaining_time": "2:28:33"} | |
| {"current_steps": 176, "total_steps": 2457, "loss": 1.2677640914916992, "lr": 1.4227642276422766e-05, "epoch": 0.2148962148962149, "percentage": 7.16, "elapsed_time": "0:11:25", "remaining_time": "2:28:01"} | |
| {"current_steps": 178, "total_steps": 2457, "loss": 1.3696472644805908, "lr": 1.4390243902439025e-05, "epoch": 0.21733821733821734, "percentage": 7.24, "elapsed_time": "0:11:31", "remaining_time": "2:27:32"} | |
| {"current_steps": 180, "total_steps": 2457, "loss": 1.0324885845184326, "lr": 1.4552845528455286e-05, "epoch": 0.21978021978021978, "percentage": 7.33, "elapsed_time": "0:11:39", "remaining_time": "2:27:29"} | |
| {"current_steps": 182, "total_steps": 2457, "loss": 0.9206986427307129, "lr": 1.4715447154471545e-05, "epoch": 0.2222222222222222, "percentage": 7.41, "elapsed_time": "0:11:46", "remaining_time": "2:27:11"} | |
| {"current_steps": 184, "total_steps": 2457, "loss": 1.4569449424743652, "lr": 1.4878048780487806e-05, "epoch": 0.22466422466422467, "percentage": 7.49, "elapsed_time": "0:11:52", "remaining_time": "2:26:46"} | |
| {"current_steps": 186, "total_steps": 2457, "loss": 1.3317943811416626, "lr": 1.5040650406504067e-05, "epoch": 0.2271062271062271, "percentage": 7.57, "elapsed_time": "0:11:59", "remaining_time": "2:26:19"} | |
| {"current_steps": 188, "total_steps": 2457, "loss": 1.4440950155258179, "lr": 1.5203252032520327e-05, "epoch": 0.22954822954822954, "percentage": 7.65, "elapsed_time": "0:12:05", "remaining_time": "2:25:52"} | |
| {"current_steps": 190, "total_steps": 2457, "loss": 1.4389160871505737, "lr": 1.5365853658536586e-05, "epoch": 0.231990231990232, "percentage": 7.73, "elapsed_time": "0:12:11", "remaining_time": "2:25:32"} | |
| {"current_steps": 192, "total_steps": 2457, "loss": 1.383222222328186, "lr": 1.5528455284552847e-05, "epoch": 0.23443223443223443, "percentage": 7.81, "elapsed_time": "0:12:17", "remaining_time": "2:25:03"} | |
| {"current_steps": 194, "total_steps": 2457, "loss": 1.3772218227386475, "lr": 1.5691056910569108e-05, "epoch": 0.23687423687423687, "percentage": 7.9, "elapsed_time": "0:12:25", "remaining_time": "2:24:55"} | |
| {"current_steps": 196, "total_steps": 2457, "loss": 1.152698040008545, "lr": 1.585365853658537e-05, "epoch": 0.23931623931623933, "percentage": 7.98, "elapsed_time": "0:12:32", "remaining_time": "2:24:44"} | |
| {"current_steps": 198, "total_steps": 2457, "loss": 1.3445426225662231, "lr": 1.6016260162601627e-05, "epoch": 0.24175824175824176, "percentage": 8.06, "elapsed_time": "0:12:39", "remaining_time": "2:24:20"} | |
| {"current_steps": 200, "total_steps": 2457, "loss": 1.4353071451187134, "lr": 1.6178861788617888e-05, "epoch": 0.2442002442002442, "percentage": 8.14, "elapsed_time": "0:12:45", "remaining_time": "2:23:56"} | |
| {"current_steps": 202, "total_steps": 2457, "loss": 1.3451241254806519, "lr": 1.6341463414634145e-05, "epoch": 0.24664224664224665, "percentage": 8.22, "elapsed_time": "0:12:52", "remaining_time": "2:23:39"} | |
| {"current_steps": 204, "total_steps": 2457, "loss": 1.0413107872009277, "lr": 1.6504065040650406e-05, "epoch": 0.2490842490842491, "percentage": 8.3, "elapsed_time": "0:12:59", "remaining_time": "2:23:32"} | |
| {"current_steps": 206, "total_steps": 2457, "loss": 1.319067120552063, "lr": 1.6666666666666667e-05, "epoch": 0.2515262515262515, "percentage": 8.38, "elapsed_time": "0:13:08", "remaining_time": "2:23:40"} | |
| {"current_steps": 208, "total_steps": 2457, "loss": 1.1813554763793945, "lr": 1.682926829268293e-05, "epoch": 0.25396825396825395, "percentage": 8.47, "elapsed_time": "0:13:19", "remaining_time": "2:24:00"} | |
| {"current_steps": 210, "total_steps": 2457, "loss": 0.9516135454177856, "lr": 1.699186991869919e-05, "epoch": 0.2564102564102564, "percentage": 8.55, "elapsed_time": "0:13:26", "remaining_time": "2:23:54"} | |
| {"current_steps": 212, "total_steps": 2457, "loss": 0.7080205678939819, "lr": 1.7154471544715447e-05, "epoch": 0.2588522588522589, "percentage": 8.63, "elapsed_time": "0:13:33", "remaining_time": "2:23:31"} | |
| {"current_steps": 214, "total_steps": 2457, "loss": 1.2077349424362183, "lr": 1.7317073170731708e-05, "epoch": 0.2612942612942613, "percentage": 8.71, "elapsed_time": "0:13:39", "remaining_time": "2:23:06"} | |
| {"current_steps": 216, "total_steps": 2457, "loss": 1.340700626373291, "lr": 1.747967479674797e-05, "epoch": 0.26373626373626374, "percentage": 8.79, "elapsed_time": "0:13:45", "remaining_time": "2:22:45"} | |
| {"current_steps": 218, "total_steps": 2457, "loss": 1.2717061042785645, "lr": 1.7642276422764227e-05, "epoch": 0.2661782661782662, "percentage": 8.87, "elapsed_time": "0:13:52", "remaining_time": "2:22:26"} | |
| {"current_steps": 220, "total_steps": 2457, "loss": 1.3473409414291382, "lr": 1.7804878048780488e-05, "epoch": 0.2686202686202686, "percentage": 8.95, "elapsed_time": "0:14:00", "remaining_time": "2:22:31"} | |
| {"current_steps": 222, "total_steps": 2457, "loss": 1.0149593353271484, "lr": 1.796747967479675e-05, "epoch": 0.27106227106227104, "percentage": 9.04, "elapsed_time": "0:14:11", "remaining_time": "2:22:51"} | |
| {"current_steps": 224, "total_steps": 2457, "loss": 1.0153287649154663, "lr": 1.813008130081301e-05, "epoch": 0.27350427350427353, "percentage": 9.12, "elapsed_time": "0:14:20", "remaining_time": "2:22:58"} | |
| {"current_steps": 226, "total_steps": 2457, "loss": 1.3861629962921143, "lr": 1.829268292682927e-05, "epoch": 0.27594627594627597, "percentage": 9.2, "elapsed_time": "0:14:29", "remaining_time": "2:22:59"} | |
| {"current_steps": 228, "total_steps": 2457, "loss": 1.4393969774246216, "lr": 1.845528455284553e-05, "epoch": 0.2783882783882784, "percentage": 9.28, "elapsed_time": "0:14:38", "remaining_time": "2:23:05"} | |
| {"current_steps": 230, "total_steps": 2457, "loss": 1.3899247646331787, "lr": 1.861788617886179e-05, "epoch": 0.28083028083028083, "percentage": 9.36, "elapsed_time": "0:14:46", "remaining_time": "2:23:01"} | |
| {"current_steps": 232, "total_steps": 2457, "loss": 1.5776143074035645, "lr": 1.878048780487805e-05, "epoch": 0.28327228327228327, "percentage": 9.44, "elapsed_time": "0:14:54", "remaining_time": "2:22:57"} | |
| {"current_steps": 234, "total_steps": 2457, "loss": 1.1832704544067383, "lr": 1.8943089430894312e-05, "epoch": 0.2857142857142857, "percentage": 9.52, "elapsed_time": "0:15:04", "remaining_time": "2:23:08"} | |
| {"current_steps": 236, "total_steps": 2457, "loss": 1.4064499139785767, "lr": 1.9105691056910573e-05, "epoch": 0.28815628815628813, "percentage": 9.61, "elapsed_time": "0:15:11", "remaining_time": "2:22:53"} | |
| {"current_steps": 238, "total_steps": 2457, "loss": 1.3489106893539429, "lr": 1.926829268292683e-05, "epoch": 0.2905982905982906, "percentage": 9.69, "elapsed_time": "0:15:17", "remaining_time": "2:22:30"} | |
| {"current_steps": 240, "total_steps": 2457, "loss": 1.2338833808898926, "lr": 1.943089430894309e-05, "epoch": 0.29304029304029305, "percentage": 9.77, "elapsed_time": "0:15:22", "remaining_time": "2:22:01"} | |
| {"current_steps": 242, "total_steps": 2457, "loss": 1.1017166376113892, "lr": 1.959349593495935e-05, "epoch": 0.2954822954822955, "percentage": 9.85, "elapsed_time": "0:15:28", "remaining_time": "2:21:40"} | |
| {"current_steps": 244, "total_steps": 2457, "loss": 1.4224822521209717, "lr": 1.975609756097561e-05, "epoch": 0.2979242979242979, "percentage": 9.93, "elapsed_time": "0:15:34", "remaining_time": "2:21:19"} | |
| {"current_steps": 246, "total_steps": 2457, "loss": 0.9105122089385986, "lr": 1.991869918699187e-05, "epoch": 0.30036630036630035, "percentage": 10.01, "elapsed_time": "0:15:40", "remaining_time": "2:20:50"} | |
| {"current_steps": 248, "total_steps": 2457, "loss": 1.3537715673446655, "lr": 1.9999990914795638e-05, "epoch": 0.3028083028083028, "percentage": 10.09, "elapsed_time": "0:15:48", "remaining_time": "2:20:46"} | |
| {"current_steps": 250, "total_steps": 2457, "loss": 1.1235604286193848, "lr": 1.9999918233270764e-05, "epoch": 0.3052503052503053, "percentage": 10.18, "elapsed_time": "0:15:55", "remaining_time": "2:20:32"} | |
| {"current_steps": 252, "total_steps": 2457, "loss": 1.2547414302825928, "lr": 1.999977287080797e-05, "epoch": 0.3076923076923077, "percentage": 10.26, "elapsed_time": "0:16:01", "remaining_time": "2:20:08"} | |
| {"current_steps": 254, "total_steps": 2457, "loss": 1.4373202323913574, "lr": 1.9999554828581173e-05, "epoch": 0.31013431013431014, "percentage": 10.34, "elapsed_time": "0:16:07", "remaining_time": "2:19:54"} | |
| {"current_steps": 256, "total_steps": 2457, "loss": 1.3956284523010254, "lr": 1.9999264108351216e-05, "epoch": 0.3125763125763126, "percentage": 10.42, "elapsed_time": "0:16:14", "remaining_time": "2:19:39"} | |
| {"current_steps": 258, "total_steps": 2457, "loss": 1.4139020442962646, "lr": 1.999890071246588e-05, "epoch": 0.315018315018315, "percentage": 10.5, "elapsed_time": "0:16:22", "remaining_time": "2:19:32"} | |
| {"current_steps": 260, "total_steps": 2457, "loss": 1.3567984104156494, "lr": 1.9998464643859853e-05, "epoch": 0.31746031746031744, "percentage": 10.58, "elapsed_time": "0:16:28", "remaining_time": "2:19:16"} | |
| {"current_steps": 262, "total_steps": 2457, "loss": 1.6041795015335083, "lr": 1.999795590605471e-05, "epoch": 0.3199023199023199, "percentage": 10.66, "elapsed_time": "0:16:37", "remaining_time": "2:19:19"} | |
| {"current_steps": 264, "total_steps": 2457, "loss": 0.9505234956741333, "lr": 1.9997374503158877e-05, "epoch": 0.32234432234432236, "percentage": 10.74, "elapsed_time": "0:16:44", "remaining_time": "2:19:07"} | |
| {"current_steps": 266, "total_steps": 2457, "loss": 1.1375908851623535, "lr": 1.9996720439867617e-05, "epoch": 0.3247863247863248, "percentage": 10.83, "elapsed_time": "0:16:51", "remaining_time": "2:18:52"} | |
| {"current_steps": 268, "total_steps": 2457, "loss": 1.5744917392730713, "lr": 1.9995993721462966e-05, "epoch": 0.32722832722832723, "percentage": 10.91, "elapsed_time": "0:17:00", "remaining_time": "2:18:53"} | |
| {"current_steps": 270, "total_steps": 2457, "loss": 1.1887890100479126, "lr": 1.9995194353813707e-05, "epoch": 0.32967032967032966, "percentage": 10.99, "elapsed_time": "0:17:07", "remaining_time": "2:18:45"} | |
| {"current_steps": 272, "total_steps": 2457, "loss": 1.4438523054122925, "lr": 1.999432234337532e-05, "epoch": 0.3321123321123321, "percentage": 11.07, "elapsed_time": "0:17:15", "remaining_time": "2:18:39"} | |
| {"current_steps": 274, "total_steps": 2457, "loss": 1.2220399379730225, "lr": 1.999337769718993e-05, "epoch": 0.33455433455433453, "percentage": 11.15, "elapsed_time": "0:17:23", "remaining_time": "2:18:36"} | |
| {"current_steps": 276, "total_steps": 2457, "loss": 1.1481637954711914, "lr": 1.9992360422886246e-05, "epoch": 0.336996336996337, "percentage": 11.23, "elapsed_time": "0:17:35", "remaining_time": "2:18:58"} | |
| {"current_steps": 278, "total_steps": 2457, "loss": 1.5834959745407104, "lr": 1.9991270528679508e-05, "epoch": 0.33943833943833945, "percentage": 11.31, "elapsed_time": "0:17:43", "remaining_time": "2:18:56"} | |
| {"current_steps": 280, "total_steps": 2457, "loss": 1.4441936016082764, "lr": 1.9990108023371403e-05, "epoch": 0.3418803418803419, "percentage": 11.4, "elapsed_time": "0:17:51", "remaining_time": "2:18:54"} | |
| {"current_steps": 282, "total_steps": 2457, "loss": 1.376705288887024, "lr": 1.9988872916350022e-05, "epoch": 0.3443223443223443, "percentage": 11.48, "elapsed_time": "0:17:59", "remaining_time": "2:18:49"} | |
| {"current_steps": 284, "total_steps": 2457, "loss": 1.4534231424331665, "lr": 1.9987565217589756e-05, "epoch": 0.34676434676434675, "percentage": 11.56, "elapsed_time": "0:18:05", "remaining_time": "2:18:27"} | |
| {"current_steps": 286, "total_steps": 2457, "loss": 1.2641198635101318, "lr": 1.9986184937651227e-05, "epoch": 0.3492063492063492, "percentage": 11.64, "elapsed_time": "0:18:15", "remaining_time": "2:18:37"} | |
| {"current_steps": 288, "total_steps": 2457, "loss": 1.834381341934204, "lr": 1.9984732087681215e-05, "epoch": 0.3516483516483517, "percentage": 11.72, "elapsed_time": "0:18:24", "remaining_time": "2:18:39"} | |
| {"current_steps": 290, "total_steps": 2457, "loss": 1.1039708852767944, "lr": 1.9983206679412542e-05, "epoch": 0.3540903540903541, "percentage": 11.8, "elapsed_time": "0:18:32", "remaining_time": "2:18:34"} | |
| {"current_steps": 292, "total_steps": 2457, "loss": 1.4267356395721436, "lr": 1.9981608725164002e-05, "epoch": 0.35653235653235654, "percentage": 11.88, "elapsed_time": "0:18:41", "remaining_time": "2:18:31"} | |
| {"current_steps": 294, "total_steps": 2457, "loss": 1.198704481124878, "lr": 1.9979938237840247e-05, "epoch": 0.358974358974359, "percentage": 11.97, "elapsed_time": "0:18:49", "remaining_time": "2:18:27"} | |
| {"current_steps": 296, "total_steps": 2457, "loss": 1.1538225412368774, "lr": 1.9978195230931686e-05, "epoch": 0.3614163614163614, "percentage": 12.05, "elapsed_time": "0:18:55", "remaining_time": "2:18:10"} | |
| {"current_steps": 298, "total_steps": 2457, "loss": 1.5473830699920654, "lr": 1.997637971851438e-05, "epoch": 0.36385836385836384, "percentage": 12.13, "elapsed_time": "0:19:04", "remaining_time": "2:18:09"} | |
| {"current_steps": 300, "total_steps": 2457, "loss": 1.357876181602478, "lr": 1.9974491715249917e-05, "epoch": 0.3663003663003663, "percentage": 12.21, "elapsed_time": "0:19:11", "remaining_time": "2:17:59"} | |
| {"current_steps": 302, "total_steps": 2457, "loss": 1.0178381204605103, "lr": 1.9972531236385314e-05, "epoch": 0.36874236874236876, "percentage": 12.29, "elapsed_time": "0:19:18", "remaining_time": "2:17:43"} | |
| {"current_steps": 304, "total_steps": 2457, "loss": 1.327938199043274, "lr": 1.997049829775287e-05, "epoch": 0.3711843711843712, "percentage": 12.37, "elapsed_time": "0:19:24", "remaining_time": "2:17:30"} | |
| {"current_steps": 306, "total_steps": 2457, "loss": 1.4819612503051758, "lr": 1.996839291577006e-05, "epoch": 0.37362637362637363, "percentage": 12.45, "elapsed_time": "0:19:30", "remaining_time": "2:17:06"} | |
| {"current_steps": 308, "total_steps": 2457, "loss": 1.418102741241455, "lr": 1.996621510743938e-05, "epoch": 0.37606837606837606, "percentage": 12.54, "elapsed_time": "0:19:36", "remaining_time": "2:16:50"} | |
| {"current_steps": 310, "total_steps": 2457, "loss": 1.4227708578109741, "lr": 1.9963964890348236e-05, "epoch": 0.3785103785103785, "percentage": 12.62, "elapsed_time": "0:19:42", "remaining_time": "2:16:33"} | |
| {"current_steps": 312, "total_steps": 2457, "loss": 1.1034045219421387, "lr": 1.9961642282668776e-05, "epoch": 0.38095238095238093, "percentage": 12.7, "elapsed_time": "0:19:49", "remaining_time": "2:16:14"} | |
| {"current_steps": 314, "total_steps": 2457, "loss": 1.4926037788391113, "lr": 1.9959247303157763e-05, "epoch": 0.3833943833943834, "percentage": 12.78, "elapsed_time": "0:19:56", "remaining_time": "2:16:08"} | |
| {"current_steps": 316, "total_steps": 2457, "loss": 0.8862283229827881, "lr": 1.995677997115641e-05, "epoch": 0.38583638583638585, "percentage": 12.86, "elapsed_time": "0:20:02", "remaining_time": "2:15:48"} | |
| {"current_steps": 318, "total_steps": 2457, "loss": 1.15045166015625, "lr": 1.9954240306590235e-05, "epoch": 0.3882783882783883, "percentage": 12.94, "elapsed_time": "0:20:08", "remaining_time": "2:15:29"} | |
| {"current_steps": 320, "total_steps": 2457, "loss": 1.4402953386306763, "lr": 1.9951628329968885e-05, "epoch": 0.3907203907203907, "percentage": 13.02, "elapsed_time": "0:20:14", "remaining_time": "2:15:12"} | |
| {"current_steps": 322, "total_steps": 2457, "loss": 1.456636667251587, "lr": 1.9948944062385994e-05, "epoch": 0.39316239316239315, "percentage": 13.11, "elapsed_time": "0:20:20", "remaining_time": "2:14:55"} | |
| {"current_steps": 324, "total_steps": 2457, "loss": 1.4146589040756226, "lr": 1.9946187525518986e-05, "epoch": 0.3956043956043956, "percentage": 13.19, "elapsed_time": "0:20:28", "remaining_time": "2:14:50"} | |
| {"current_steps": 326, "total_steps": 2457, "loss": 1.3673632144927979, "lr": 1.994335874162892e-05, "epoch": 0.398046398046398, "percentage": 13.27, "elapsed_time": "0:20:37", "remaining_time": "2:14:52"} | |
| {"current_steps": 328, "total_steps": 2457, "loss": 1.3601889610290527, "lr": 1.9940457733560293e-05, "epoch": 0.4004884004884005, "percentage": 13.35, "elapsed_time": "0:20:49", "remaining_time": "2:15:07"} | |
| {"current_steps": 330, "total_steps": 2457, "loss": 0.9897390007972717, "lr": 1.993748452474088e-05, "epoch": 0.40293040293040294, "percentage": 13.43, "elapsed_time": "0:20:57", "remaining_time": "2:15:08"} | |
| {"current_steps": 332, "total_steps": 2457, "loss": 0.6906993389129639, "lr": 1.9934439139181516e-05, "epoch": 0.4053724053724054, "percentage": 13.51, "elapsed_time": "0:21:06", "remaining_time": "2:15:05"} | |
| {"current_steps": 334, "total_steps": 2457, "loss": 1.1328214406967163, "lr": 1.993132160147593e-05, "epoch": 0.4078144078144078, "percentage": 13.59, "elapsed_time": "0:21:13", "remaining_time": "2:14:53"} | |
| {"current_steps": 336, "total_steps": 2457, "loss": 1.4789706468582153, "lr": 1.9928131936800514e-05, "epoch": 0.41025641025641024, "percentage": 13.68, "elapsed_time": "0:21:19", "remaining_time": "2:14:36"} | |
| {"current_steps": 338, "total_steps": 2457, "loss": 1.0828137397766113, "lr": 1.9924870170914157e-05, "epoch": 0.4126984126984127, "percentage": 13.76, "elapsed_time": "0:21:27", "remaining_time": "2:14:30"} | |
| {"current_steps": 340, "total_steps": 2457, "loss": 1.1599012613296509, "lr": 1.9921536330158007e-05, "epoch": 0.41514041514041516, "percentage": 13.84, "elapsed_time": "0:21:34", "remaining_time": "2:14:18"} | |
| {"current_steps": 342, "total_steps": 2457, "loss": 1.6682945489883423, "lr": 1.9918130441455273e-05, "epoch": 0.4175824175824176, "percentage": 13.92, "elapsed_time": "0:21:41", "remaining_time": "2:14:10"} | |
| {"current_steps": 344, "total_steps": 2457, "loss": 0.9947870969772339, "lr": 1.9914652532311005e-05, "epoch": 0.42002442002442003, "percentage": 14.0, "elapsed_time": "0:21:48", "remaining_time": "2:13:59"} | |
| {"current_steps": 346, "total_steps": 2457, "loss": 1.315640926361084, "lr": 1.991110263081186e-05, "epoch": 0.42246642246642246, "percentage": 14.08, "elapsed_time": "0:21:54", "remaining_time": "2:13:38"} | |
| {"current_steps": 348, "total_steps": 2457, "loss": 1.39967679977417, "lr": 1.9907480765625906e-05, "epoch": 0.4249084249084249, "percentage": 14.16, "elapsed_time": "0:22:00", "remaining_time": "2:13:23"} | |
| {"current_steps": 350, "total_steps": 2457, "loss": 0.9204920530319214, "lr": 1.9903786966002352e-05, "epoch": 0.42735042735042733, "percentage": 14.25, "elapsed_time": "0:22:06", "remaining_time": "2:13:08"} | |
| {"current_steps": 352, "total_steps": 2457, "loss": 1.1823644638061523, "lr": 1.9900021261771348e-05, "epoch": 0.4297924297924298, "percentage": 14.33, "elapsed_time": "0:22:12", "remaining_time": "2:12:50"} | |
| {"current_steps": 354, "total_steps": 2457, "loss": 1.3596951961517334, "lr": 1.9896183683343706e-05, "epoch": 0.43223443223443225, "percentage": 14.41, "elapsed_time": "0:22:20", "remaining_time": "2:12:42"} | |
| {"current_steps": 356, "total_steps": 2457, "loss": 1.03623628616333, "lr": 1.989227426171069e-05, "epoch": 0.4346764346764347, "percentage": 14.49, "elapsed_time": "0:22:27", "remaining_time": "2:12:30"} | |
| {"current_steps": 358, "total_steps": 2457, "loss": 1.240249514579773, "lr": 1.9888293028443747e-05, "epoch": 0.4371184371184371, "percentage": 14.57, "elapsed_time": "0:22:33", "remaining_time": "2:12:16"} | |
| {"current_steps": 360, "total_steps": 2457, "loss": 1.281577467918396, "lr": 1.9884240015694248e-05, "epoch": 0.43956043956043955, "percentage": 14.65, "elapsed_time": "0:22:40", "remaining_time": "2:12:03"} | |
| {"current_steps": 362, "total_steps": 2457, "loss": 1.1424391269683838, "lr": 1.988011525619325e-05, "epoch": 0.442002442002442, "percentage": 14.73, "elapsed_time": "0:22:46", "remaining_time": "2:11:48"} | |
| {"current_steps": 364, "total_steps": 2457, "loss": 1.2371528148651123, "lr": 1.9875918783251207e-05, "epoch": 0.4444444444444444, "percentage": 14.81, "elapsed_time": "0:22:51", "remaining_time": "2:11:28"} | |
| {"current_steps": 366, "total_steps": 2457, "loss": 1.4550820589065552, "lr": 1.9871650630757716e-05, "epoch": 0.4468864468864469, "percentage": 14.9, "elapsed_time": "0:22:58", "remaining_time": "2:11:17"} | |
| {"current_steps": 368, "total_steps": 2457, "loss": 1.1890130043029785, "lr": 1.9867310833181234e-05, "epoch": 0.44932844932844934, "percentage": 14.98, "elapsed_time": "0:23:04", "remaining_time": "2:10:59"} | |
| {"current_steps": 370, "total_steps": 2457, "loss": 1.2029908895492554, "lr": 1.986289942556881e-05, "epoch": 0.4517704517704518, "percentage": 15.06, "elapsed_time": "0:23:09", "remaining_time": "2:10:39"} | |
| {"current_steps": 372, "total_steps": 2457, "loss": 1.3851736783981323, "lr": 1.9858416443545794e-05, "epoch": 0.4542124542124542, "percentage": 15.14, "elapsed_time": "0:23:15", "remaining_time": "2:10:23"} | |
| {"current_steps": 374, "total_steps": 2457, "loss": 1.0434424877166748, "lr": 1.9853861923315555e-05, "epoch": 0.45665445665445664, "percentage": 15.22, "elapsed_time": "0:23:22", "remaining_time": "2:10:11"} | |
| {"current_steps": 376, "total_steps": 2457, "loss": 1.301484227180481, "lr": 1.984923590165918e-05, "epoch": 0.4590964590964591, "percentage": 15.3, "elapsed_time": "0:23:30", "remaining_time": "2:10:07"} | |
| {"current_steps": 378, "total_steps": 2457, "loss": 1.0400949716567993, "lr": 1.9844538415935187e-05, "epoch": 0.46153846153846156, "percentage": 15.38, "elapsed_time": "0:23:36", "remaining_time": "2:09:50"} | |
| {"current_steps": 380, "total_steps": 2457, "loss": 0.9666699767112732, "lr": 1.983976950407922e-05, "epoch": 0.463980463980464, "percentage": 15.47, "elapsed_time": "0:23:42", "remaining_time": "2:09:35"} | |
| {"current_steps": 382, "total_steps": 2457, "loss": 1.3446414470672607, "lr": 1.983492920460373e-05, "epoch": 0.46642246642246643, "percentage": 15.55, "elapsed_time": "0:23:48", "remaining_time": "2:09:18"} | |
| {"current_steps": 384, "total_steps": 2457, "loss": 1.2357232570648193, "lr": 1.983001755659769e-05, "epoch": 0.46886446886446886, "percentage": 15.63, "elapsed_time": "0:23:53", "remaining_time": "2:08:59"} | |
| {"current_steps": 386, "total_steps": 2457, "loss": 1.2619645595550537, "lr": 1.9825034599726263e-05, "epoch": 0.4713064713064713, "percentage": 15.71, "elapsed_time": "0:23:59", "remaining_time": "2:08:42"} | |
| {"current_steps": 388, "total_steps": 2457, "loss": 1.6904096603393555, "lr": 1.9819980374230468e-05, "epoch": 0.47374847374847373, "percentage": 15.79, "elapsed_time": "0:24:06", "remaining_time": "2:08:31"} | |
| {"current_steps": 390, "total_steps": 2457, "loss": 0.9965710639953613, "lr": 1.981485492092689e-05, "epoch": 0.47619047619047616, "percentage": 15.87, "elapsed_time": "0:24:14", "remaining_time": "2:08:30"} | |
| {"current_steps": 392, "total_steps": 2457, "loss": 0.9120445251464844, "lr": 1.9809658281207318e-05, "epoch": 0.47863247863247865, "percentage": 15.95, "elapsed_time": "0:24:20", "remaining_time": "2:08:14"} | |
| {"current_steps": 394, "total_steps": 2457, "loss": 1.0203512907028198, "lr": 1.980439049703843e-05, "epoch": 0.4810744810744811, "percentage": 16.04, "elapsed_time": "0:24:26", "remaining_time": "2:07:58"} | |
| {"current_steps": 396, "total_steps": 2457, "loss": 1.3058192729949951, "lr": 1.979905161096144e-05, "epoch": 0.4835164835164835, "percentage": 16.12, "elapsed_time": "0:24:32", "remaining_time": "2:07:45"} | |
| {"current_steps": 398, "total_steps": 2457, "loss": 1.3444452285766602, "lr": 1.9793641666091773e-05, "epoch": 0.48595848595848595, "percentage": 16.2, "elapsed_time": "0:24:39", "remaining_time": "2:07:32"} | |
| {"current_steps": 400, "total_steps": 2457, "loss": 0.6673938035964966, "lr": 1.9788160706118698e-05, "epoch": 0.4884004884004884, "percentage": 16.28, "elapsed_time": "0:24:45", "remaining_time": "2:07:20"} | |
| {"current_steps": 402, "total_steps": 2457, "loss": 1.3050227165222168, "lr": 1.978260877530499e-05, "epoch": 0.4908424908424908, "percentage": 16.36, "elapsed_time": "0:24:53", "remaining_time": "2:07:12"} | |
| {"current_steps": 404, "total_steps": 2457, "loss": 1.4215201139450073, "lr": 1.9776985918486552e-05, "epoch": 0.4932844932844933, "percentage": 16.44, "elapsed_time": "0:25:00", "remaining_time": "2:07:02"} | |
| {"current_steps": 406, "total_steps": 2457, "loss": 0.8944355845451355, "lr": 1.9771292181072076e-05, "epoch": 0.49572649572649574, "percentage": 16.52, "elapsed_time": "0:25:05", "remaining_time": "2:06:46"} | |
| {"current_steps": 408, "total_steps": 2457, "loss": 1.0254771709442139, "lr": 1.9765527609042676e-05, "epoch": 0.4981684981684982, "percentage": 16.61, "elapsed_time": "0:25:12", "remaining_time": "2:06:34"} | |
| {"current_steps": 410, "total_steps": 2457, "loss": 1.3571816682815552, "lr": 1.9759692248951482e-05, "epoch": 0.5006105006105006, "percentage": 16.69, "elapsed_time": "0:25:20", "remaining_time": "2:06:29"} | |
| {"current_steps": 412, "total_steps": 2457, "loss": 0.6523332595825195, "lr": 1.975378614792332e-05, "epoch": 0.503052503052503, "percentage": 16.77, "elapsed_time": "0:25:27", "remaining_time": "2:06:22"} | |
| {"current_steps": 414, "total_steps": 2457, "loss": 1.3964738845825195, "lr": 1.9747809353654276e-05, "epoch": 0.5054945054945055, "percentage": 16.85, "elapsed_time": "0:25:35", "remaining_time": "2:06:15"} | |
| {"current_steps": 416, "total_steps": 2457, "loss": 1.3599458932876587, "lr": 1.974176191441135e-05, "epoch": 0.5079365079365079, "percentage": 16.93, "elapsed_time": "0:25:43", "remaining_time": "2:06:11"} | |
| {"current_steps": 418, "total_steps": 2457, "loss": 1.1259132623672485, "lr": 1.973564387903204e-05, "epoch": 0.5103785103785103, "percentage": 17.01, "elapsed_time": "0:25:52", "remaining_time": "2:06:15"} | |
| {"current_steps": 420, "total_steps": 2457, "loss": 1.3250101804733276, "lr": 1.972945529692398e-05, "epoch": 0.5128205128205128, "percentage": 17.09, "elapsed_time": "0:26:03", "remaining_time": "2:06:23"} | |
| {"current_steps": 422, "total_steps": 2457, "loss": 1.3246148824691772, "lr": 1.97231962180645e-05, "epoch": 0.5152625152625152, "percentage": 17.18, "elapsed_time": "0:26:11", "remaining_time": "2:06:16"} | |
| {"current_steps": 424, "total_steps": 2457, "loss": 1.3295143842697144, "lr": 1.9716866693000248e-05, "epoch": 0.5177045177045178, "percentage": 17.26, "elapsed_time": "0:26:18", "remaining_time": "2:06:10"} | |
| {"current_steps": 426, "total_steps": 2457, "loss": 1.1310526132583618, "lr": 1.9710466772846784e-05, "epoch": 0.5201465201465202, "percentage": 17.34, "elapsed_time": "0:26:27", "remaining_time": "2:06:07"} | |
| {"current_steps": 428, "total_steps": 2457, "loss": 1.341339111328125, "lr": 1.9703996509288153e-05, "epoch": 0.5225885225885226, "percentage": 17.42, "elapsed_time": "0:26:37", "remaining_time": "2:06:13"} | |
| {"current_steps": 430, "total_steps": 2457, "loss": 0.984380841255188, "lr": 1.9697455954576478e-05, "epoch": 0.525030525030525, "percentage": 17.5, "elapsed_time": "0:26:48", "remaining_time": "2:06:22"} | |
| {"current_steps": 432, "total_steps": 2457, "loss": 0.6374328136444092, "lr": 1.9690845161531532e-05, "epoch": 0.5274725274725275, "percentage": 17.58, "elapsed_time": "0:26:56", "remaining_time": "2:06:16"} | |
| {"current_steps": 434, "total_steps": 2457, "loss": 1.363136887550354, "lr": 1.968416418354032e-05, "epoch": 0.5299145299145299, "percentage": 17.66, "elapsed_time": "0:27:04", "remaining_time": "2:06:11"} | |
| {"current_steps": 436, "total_steps": 2457, "loss": 1.3728197813034058, "lr": 1.967741307455663e-05, "epoch": 0.5323565323565324, "percentage": 17.75, "elapsed_time": "0:27:12", "remaining_time": "2:06:09"} | |
| {"current_steps": 438, "total_steps": 2457, "loss": 1.3319021463394165, "lr": 1.967059188910062e-05, "epoch": 0.5347985347985348, "percentage": 17.83, "elapsed_time": "0:27:21", "remaining_time": "2:06:07"} | |
| {"current_steps": 440, "total_steps": 2457, "loss": 1.299553394317627, "lr": 1.9663700682258367e-05, "epoch": 0.5372405372405372, "percentage": 17.91, "elapsed_time": "0:27:33", "remaining_time": "2:06:19"} | |
| {"current_steps": 442, "total_steps": 2457, "loss": 1.1493945121765137, "lr": 1.9656739509681413e-05, "epoch": 0.5396825396825397, "percentage": 17.99, "elapsed_time": "0:27:41", "remaining_time": "2:06:13"} | |
| {"current_steps": 444, "total_steps": 2457, "loss": 1.0136598348617554, "lr": 1.9649708427586333e-05, "epoch": 0.5421245421245421, "percentage": 18.07, "elapsed_time": "0:27:50", "remaining_time": "2:06:12"} | |
| {"current_steps": 446, "total_steps": 2457, "loss": 1.1629705429077148, "lr": 1.964260749275427e-05, "epoch": 0.5445665445665445, "percentage": 18.15, "elapsed_time": "0:27:57", "remaining_time": "2:06:04"} | |
| {"current_steps": 448, "total_steps": 2457, "loss": 1.1858645677566528, "lr": 1.963543676253048e-05, "epoch": 0.5470085470085471, "percentage": 18.23, "elapsed_time": "0:28:05", "remaining_time": "2:05:58"} | |
| {"current_steps": 450, "total_steps": 2457, "loss": 1.1235462427139282, "lr": 1.962819629482386e-05, "epoch": 0.5494505494505495, "percentage": 18.32, "elapsed_time": "0:28:14", "remaining_time": "2:05:58"} | |
| {"current_steps": 452, "total_steps": 2457, "loss": 0.9178623557090759, "lr": 1.9620886148106498e-05, "epoch": 0.5518925518925519, "percentage": 18.4, "elapsed_time": "0:28:23", "remaining_time": "2:05:57"} | |
| {"current_steps": 454, "total_steps": 2457, "loss": 1.377665400505066, "lr": 1.9613506381413194e-05, "epoch": 0.5543345543345544, "percentage": 18.48, "elapsed_time": "0:28:30", "remaining_time": "2:05:46"} | |
| {"current_steps": 456, "total_steps": 2457, "loss": 1.3081351518630981, "lr": 1.960605705434097e-05, "epoch": 0.5567765567765568, "percentage": 18.56, "elapsed_time": "0:28:38", "remaining_time": "2:05:38"} | |
| {"current_steps": 458, "total_steps": 2457, "loss": 0.8939856290817261, "lr": 1.95985382270486e-05, "epoch": 0.5592185592185592, "percentage": 18.64, "elapsed_time": "0:28:45", "remaining_time": "2:05:31"} | |
| {"current_steps": 460, "total_steps": 2457, "loss": 1.266584873199463, "lr": 1.9590949960256132e-05, "epoch": 0.5616605616605617, "percentage": 18.72, "elapsed_time": "0:28:52", "remaining_time": "2:05:23"} | |
| {"current_steps": 462, "total_steps": 2457, "loss": 1.2569012641906738, "lr": 1.9583292315244383e-05, "epoch": 0.5641025641025641, "percentage": 18.8, "elapsed_time": "0:29:02", "remaining_time": "2:05:22"} | |
| {"current_steps": 464, "total_steps": 2457, "loss": 0.641703724861145, "lr": 1.9575565353854448e-05, "epoch": 0.5665445665445665, "percentage": 18.88, "elapsed_time": "0:29:12", "remaining_time": "2:05:25"} | |
| {"current_steps": 466, "total_steps": 2457, "loss": 1.567794680595398, "lr": 1.9567769138487208e-05, "epoch": 0.568986568986569, "percentage": 18.97, "elapsed_time": "0:29:21", "remaining_time": "2:05:25"} | |
| {"current_steps": 468, "total_steps": 2457, "loss": 1.3980201482772827, "lr": 1.955990373210281e-05, "epoch": 0.5714285714285714, "percentage": 19.05, "elapsed_time": "0:29:29", "remaining_time": "2:05:20"} | |
| {"current_steps": 470, "total_steps": 2457, "loss": 1.1457037925720215, "lr": 1.9551969198220188e-05, "epoch": 0.5738705738705738, "percentage": 19.13, "elapsed_time": "0:29:38", "remaining_time": "2:05:20"} | |
| {"current_steps": 472, "total_steps": 2457, "loss": 1.344892144203186, "lr": 1.954396560091652e-05, "epoch": 0.5763125763125763, "percentage": 19.21, "elapsed_time": "0:29:47", "remaining_time": "2:05:16"} | |
| {"current_steps": 474, "total_steps": 2457, "loss": 0.9534360766410828, "lr": 1.953589300482671e-05, "epoch": 0.5787545787545788, "percentage": 19.29, "elapsed_time": "0:29:57", "remaining_time": "2:05:20"} | |
| {"current_steps": 476, "total_steps": 2457, "loss": 1.0838558673858643, "lr": 1.9527751475142904e-05, "epoch": 0.5811965811965812, "percentage": 19.37, "elapsed_time": "0:30:06", "remaining_time": "2:05:19"} | |
| {"current_steps": 478, "total_steps": 2457, "loss": 1.2320207357406616, "lr": 1.951954107761391e-05, "epoch": 0.5836385836385837, "percentage": 19.45, "elapsed_time": "0:30:16", "remaining_time": "2:05:20"} | |
| {"current_steps": 480, "total_steps": 2457, "loss": 1.3821120262145996, "lr": 1.9511261878544715e-05, "epoch": 0.5860805860805861, "percentage": 19.54, "elapsed_time": "0:30:24", "remaining_time": "2:05:15"} | |
| {"current_steps": 482, "total_steps": 2457, "loss": 0.5741876363754272, "lr": 1.950291394479592e-05, "epoch": 0.5885225885225885, "percentage": 19.62, "elapsed_time": "0:30:32", "remaining_time": "2:05:10"} | |
| {"current_steps": 484, "total_steps": 2457, "loss": 1.1259833574295044, "lr": 1.9494497343783212e-05, "epoch": 0.590964590964591, "percentage": 19.7, "elapsed_time": "0:30:42", "remaining_time": "2:05:12"} | |
| {"current_steps": 486, "total_steps": 2457, "loss": 1.1523076295852661, "lr": 1.9486012143476813e-05, "epoch": 0.5934065934065934, "percentage": 19.78, "elapsed_time": "0:30:51", "remaining_time": "2:05:08"} | |
| {"current_steps": 488, "total_steps": 2457, "loss": 1.0496693849563599, "lr": 1.9477458412400934e-05, "epoch": 0.5958485958485958, "percentage": 19.86, "elapsed_time": "0:30:57", "remaining_time": "2:04:54"} | |
| {"current_steps": 490, "total_steps": 2457, "loss": 1.1105148792266846, "lr": 1.946883621963323e-05, "epoch": 0.5982905982905983, "percentage": 19.94, "elapsed_time": "0:31:03", "remaining_time": "2:04:39"} | |
| {"current_steps": 492, "total_steps": 2457, "loss": 0.9300603866577148, "lr": 1.946014563480422e-05, "epoch": 0.6007326007326007, "percentage": 20.02, "elapsed_time": "0:31:08", "remaining_time": "2:04:21"} | |
| {"current_steps": 494, "total_steps": 2457, "loss": 1.0661330223083496, "lr": 1.9451386728096758e-05, "epoch": 0.6031746031746031, "percentage": 20.11, "elapsed_time": "0:31:14", "remaining_time": "2:04:07"} | |
| {"current_steps": 496, "total_steps": 2457, "loss": 1.304194450378418, "lr": 1.9442559570245433e-05, "epoch": 0.6056166056166056, "percentage": 20.19, "elapsed_time": "0:31:20", "remaining_time": "2:03:53"} | |
| {"current_steps": 498, "total_steps": 2457, "loss": 0.6469916105270386, "lr": 1.9433664232536014e-05, "epoch": 0.608058608058608, "percentage": 20.27, "elapsed_time": "0:31:26", "remaining_time": "2:03:41"} | |
| {"current_steps": 500, "total_steps": 2457, "loss": 0.9863432049751282, "lr": 1.9424700786804877e-05, "epoch": 0.6105006105006106, "percentage": 20.35, "elapsed_time": "0:31:33", "remaining_time": "2:03:32"} | |
| {"current_steps": 502, "total_steps": 2457, "loss": 1.2856956720352173, "lr": 1.9415669305438413e-05, "epoch": 0.612942612942613, "percentage": 20.43, "elapsed_time": "0:31:41", "remaining_time": "2:03:26"} | |
| {"current_steps": 504, "total_steps": 2457, "loss": 1.3286441564559937, "lr": 1.9406569861372466e-05, "epoch": 0.6153846153846154, "percentage": 20.51, "elapsed_time": "0:31:48", "remaining_time": "2:03:14"} | |
| {"current_steps": 506, "total_steps": 2457, "loss": 1.3130193948745728, "lr": 1.9397402528091707e-05, "epoch": 0.6178266178266179, "percentage": 20.59, "elapsed_time": "0:31:54", "remaining_time": "2:03:01"} | |
| {"current_steps": 508, "total_steps": 2457, "loss": 1.380988597869873, "lr": 1.9388167379629076e-05, "epoch": 0.6202686202686203, "percentage": 20.68, "elapsed_time": "0:31:59", "remaining_time": "2:02:46"} | |
| {"current_steps": 510, "total_steps": 2457, "loss": 1.3338630199432373, "lr": 1.9378864490565172e-05, "epoch": 0.6227106227106227, "percentage": 20.76, "elapsed_time": "0:32:07", "remaining_time": "2:02:38"} | |
| {"current_steps": 512, "total_steps": 2457, "loss": 1.2690256834030151, "lr": 1.9369493936027642e-05, "epoch": 0.6251526251526252, "percentage": 20.84, "elapsed_time": "0:32:14", "remaining_time": "2:02:29"} | |
| {"current_steps": 514, "total_steps": 2457, "loss": 1.1770192384719849, "lr": 1.9360055791690584e-05, "epoch": 0.6275946275946276, "percentage": 20.92, "elapsed_time": "0:32:22", "remaining_time": "2:02:24"} | |
| {"current_steps": 516, "total_steps": 2457, "loss": 1.119304895401001, "lr": 1.935055013377393e-05, "epoch": 0.63003663003663, "percentage": 21.0, "elapsed_time": "0:32:33", "remaining_time": "2:02:29"} | |
| {"current_steps": 518, "total_steps": 2457, "loss": 1.34721040725708, "lr": 1.934097703904284e-05, "epoch": 0.6324786324786325, "percentage": 21.08, "elapsed_time": "0:32:44", "remaining_time": "2:02:33"} | |
| {"current_steps": 520, "total_steps": 2457, "loss": 0.9806722402572632, "lr": 1.933133658480707e-05, "epoch": 0.6349206349206349, "percentage": 21.16, "elapsed_time": "0:32:52", "remaining_time": "2:02:27"} | |
| {"current_steps": 522, "total_steps": 2457, "loss": 1.0333569049835205, "lr": 1.9321628848920358e-05, "epoch": 0.6373626373626373, "percentage": 21.25, "elapsed_time": "0:33:00", "remaining_time": "2:02:22"} | |
| {"current_steps": 524, "total_steps": 2457, "loss": 1.087817907333374, "lr": 1.9311853909779785e-05, "epoch": 0.6398046398046398, "percentage": 21.33, "elapsed_time": "0:33:09", "remaining_time": "2:02:17"} | |
| {"current_steps": 526, "total_steps": 2457, "loss": 1.3438972234725952, "lr": 1.9302011846325156e-05, "epoch": 0.6422466422466423, "percentage": 21.41, "elapsed_time": "0:33:20", "remaining_time": "2:02:25"} | |
| {"current_steps": 528, "total_steps": 2457, "loss": 1.38664972782135, "lr": 1.9292102738038347e-05, "epoch": 0.6446886446886447, "percentage": 21.49, "elapsed_time": "0:33:30", "remaining_time": "2:02:24"} | |
| {"current_steps": 530, "total_steps": 2457, "loss": 1.1136956214904785, "lr": 1.9282126664942667e-05, "epoch": 0.6471306471306472, "percentage": 21.57, "elapsed_time": "0:33:38", "remaining_time": "2:02:18"} | |
| {"current_steps": 532, "total_steps": 2457, "loss": 1.0266146659851074, "lr": 1.927208370760223e-05, "epoch": 0.6495726495726496, "percentage": 21.65, "elapsed_time": "0:33:45", "remaining_time": "2:02:09"} | |
| {"current_steps": 534, "total_steps": 2457, "loss": 1.6666396856307983, "lr": 1.9261973947121273e-05, "epoch": 0.652014652014652, "percentage": 21.73, "elapsed_time": "0:33:53", "remaining_time": "2:02:02"} | |
| {"current_steps": 536, "total_steps": 2457, "loss": 0.9882057309150696, "lr": 1.925179746514352e-05, "epoch": 0.6544566544566545, "percentage": 21.82, "elapsed_time": "0:34:02", "remaining_time": "2:02:01"} | |
| {"current_steps": 538, "total_steps": 2457, "loss": 1.368809461593628, "lr": 1.9241554343851537e-05, "epoch": 0.6568986568986569, "percentage": 21.9, "elapsed_time": "0:34:10", "remaining_time": "2:01:55"} | |
| {"current_steps": 540, "total_steps": 2457, "loss": 1.3585935831069946, "lr": 1.923124466596602e-05, "epoch": 0.6593406593406593, "percentage": 21.98, "elapsed_time": "0:34:18", "remaining_time": "2:01:46"} | |
| {"current_steps": 542, "total_steps": 2457, "loss": 1.0160579681396484, "lr": 1.922086851474519e-05, "epoch": 0.6617826617826618, "percentage": 22.06, "elapsed_time": "0:34:25", "remaining_time": "2:01:38"} | |
| {"current_steps": 544, "total_steps": 2457, "loss": 1.3244247436523438, "lr": 1.9210425973984074e-05, "epoch": 0.6642246642246642, "percentage": 22.14, "elapsed_time": "0:34:35", "remaining_time": "2:01:36"} | |
| {"current_steps": 546, "total_steps": 2457, "loss": 1.2471184730529785, "lr": 1.9199917128013836e-05, "epoch": 0.6666666666666666, "percentage": 22.22, "elapsed_time": "0:34:43", "remaining_time": "2:01:30"} | |
| {"current_steps": 548, "total_steps": 2457, "loss": 1.3621915578842163, "lr": 1.918934206170112e-05, "epoch": 0.6691086691086691, "percentage": 22.3, "elapsed_time": "0:34:50", "remaining_time": "2:01:23"} | |
| {"current_steps": 550, "total_steps": 2457, "loss": 1.230018973350525, "lr": 1.917870086044734e-05, "epoch": 0.6715506715506715, "percentage": 22.39, "elapsed_time": "0:35:00", "remaining_time": "2:01:23"} | |
| {"current_steps": 552, "total_steps": 2457, "loss": 1.0613629817962646, "lr": 1.9167993610187988e-05, "epoch": 0.673992673992674, "percentage": 22.47, "elapsed_time": "0:35:09", "remaining_time": "2:01:21"} | |
| {"current_steps": 554, "total_steps": 2457, "loss": 1.1644939184188843, "lr": 1.915722039739197e-05, "epoch": 0.6764346764346765, "percentage": 22.55, "elapsed_time": "0:35:17", "remaining_time": "2:01:12"} | |
| {"current_steps": 556, "total_steps": 2457, "loss": 0.9099707007408142, "lr": 1.9146381309060874e-05, "epoch": 0.6788766788766789, "percentage": 22.63, "elapsed_time": "0:35:24", "remaining_time": "2:01:05"} | |
| {"current_steps": 558, "total_steps": 2457, "loss": 1.228736400604248, "lr": 1.913547643272828e-05, "epoch": 0.6813186813186813, "percentage": 22.71, "elapsed_time": "0:35:33", "remaining_time": "2:01:00"} | |
| {"current_steps": 560, "total_steps": 2457, "loss": 1.3034601211547852, "lr": 1.912450585645907e-05, "epoch": 0.6837606837606838, "percentage": 22.79, "elapsed_time": "0:35:42", "remaining_time": "2:00:56"} | |
| {"current_steps": 562, "total_steps": 2457, "loss": 1.072668433189392, "lr": 1.9113469668848675e-05, "epoch": 0.6862026862026862, "percentage": 22.87, "elapsed_time": "0:35:54", "remaining_time": "2:01:04"} | |
| {"current_steps": 564, "total_steps": 2457, "loss": 1.3628251552581787, "lr": 1.9102367959022417e-05, "epoch": 0.6886446886446886, "percentage": 22.95, "elapsed_time": "0:36:02", "remaining_time": "2:00:57"} | |
| {"current_steps": 566, "total_steps": 2457, "loss": 1.1910985708236694, "lr": 1.909120081663473e-05, "epoch": 0.6910866910866911, "percentage": 23.04, "elapsed_time": "0:36:10", "remaining_time": "2:00:51"} | |
| {"current_steps": 568, "total_steps": 2457, "loss": 1.4165751934051514, "lr": 1.9079968331868487e-05, "epoch": 0.6935286935286935, "percentage": 23.12, "elapsed_time": "0:36:16", "remaining_time": "2:00:37"} | |
| {"current_steps": 570, "total_steps": 2457, "loss": 1.1330338716506958, "lr": 1.9068670595434228e-05, "epoch": 0.6959706959706959, "percentage": 23.2, "elapsed_time": "0:36:22", "remaining_time": "2:00:24"} | |
| {"current_steps": 572, "total_steps": 2457, "loss": 1.0612688064575195, "lr": 1.9057307698569458e-05, "epoch": 0.6984126984126984, "percentage": 23.28, "elapsed_time": "0:36:29", "remaining_time": "2:00:15"} | |
| {"current_steps": 574, "total_steps": 2457, "loss": 1.4824306964874268, "lr": 1.9045879733037907e-05, "epoch": 0.7008547008547008, "percentage": 23.36, "elapsed_time": "0:36:37", "remaining_time": "2:00:08"} | |
| {"current_steps": 576, "total_steps": 2457, "loss": 1.28273606300354, "lr": 1.9034386791128766e-05, "epoch": 0.7032967032967034, "percentage": 23.44, "elapsed_time": "0:36:43", "remaining_time": "1:59:57"} | |
| {"current_steps": 578, "total_steps": 2457, "loss": 1.2495508193969727, "lr": 1.9022828965655975e-05, "epoch": 0.7057387057387058, "percentage": 23.52, "elapsed_time": "0:36:50", "remaining_time": "1:59:45"} | |
| {"current_steps": 580, "total_steps": 2457, "loss": 1.2048630714416504, "lr": 1.9011206349957444e-05, "epoch": 0.7081807081807082, "percentage": 23.61, "elapsed_time": "0:36:56", "remaining_time": "1:59:33"} | |
| {"current_steps": 582, "total_steps": 2457, "loss": 1.2845754623413086, "lr": 1.899951903789431e-05, "epoch": 0.7106227106227107, "percentage": 23.69, "elapsed_time": "0:37:02", "remaining_time": "1:59:19"} | |
| {"current_steps": 584, "total_steps": 2457, "loss": 1.2032135725021362, "lr": 1.8987767123850197e-05, "epoch": 0.7130647130647131, "percentage": 23.77, "elapsed_time": "0:37:07", "remaining_time": "1:59:04"} | |
| {"current_steps": 586, "total_steps": 2457, "loss": 1.375983715057373, "lr": 1.8975950702730425e-05, "epoch": 0.7155067155067155, "percentage": 23.85, "elapsed_time": "0:37:14", "remaining_time": "1:58:54"} | |
| {"current_steps": 588, "total_steps": 2457, "loss": 1.1112651824951172, "lr": 1.8964069869961254e-05, "epoch": 0.717948717948718, "percentage": 23.93, "elapsed_time": "0:37:23", "remaining_time": "1:58:50"} | |
| {"current_steps": 590, "total_steps": 2457, "loss": 1.0283359289169312, "lr": 1.8952124721489115e-05, "epoch": 0.7203907203907204, "percentage": 24.01, "elapsed_time": "0:37:30", "remaining_time": "1:58:42"} | |
| {"current_steps": 592, "total_steps": 2457, "loss": 0.9025493860244751, "lr": 1.8940115353779847e-05, "epoch": 0.7228327228327228, "percentage": 24.09, "elapsed_time": "0:37:37", "remaining_time": "1:58:30"} | |
| {"current_steps": 594, "total_steps": 2457, "loss": 1.2699706554412842, "lr": 1.8928041863817896e-05, "epoch": 0.7252747252747253, "percentage": 24.18, "elapsed_time": "0:37:43", "remaining_time": "1:58:19"} | |
| {"current_steps": 596, "total_steps": 2457, "loss": 1.0194693803787231, "lr": 1.891590434910554e-05, "epoch": 0.7277167277167277, "percentage": 24.26, "elapsed_time": "0:37:49", "remaining_time": "1:58:07"} | |
| {"current_steps": 598, "total_steps": 2457, "loss": 1.160589337348938, "lr": 1.890370290766212e-05, "epoch": 0.7301587301587301, "percentage": 24.34, "elapsed_time": "0:37:54", "remaining_time": "1:57:51"} | |
| {"current_steps": 600, "total_steps": 2457, "loss": 1.2648638486862183, "lr": 1.8891437638023212e-05, "epoch": 0.7326007326007326, "percentage": 24.42, "elapsed_time": "0:38:00", "remaining_time": "1:57:39"} | |
| {"current_steps": 602, "total_steps": 2457, "loss": 1.3810834884643555, "lr": 1.8879108639239864e-05, "epoch": 0.7350427350427351, "percentage": 24.5, "elapsed_time": "0:38:07", "remaining_time": "1:57:29"} | |
| {"current_steps": 604, "total_steps": 2457, "loss": 1.2209972143173218, "lr": 1.8866716010877774e-05, "epoch": 0.7374847374847375, "percentage": 24.58, "elapsed_time": "0:38:14", "remaining_time": "1:57:17"} | |
| {"current_steps": 606, "total_steps": 2457, "loss": 1.510741949081421, "lr": 1.885425985301651e-05, "epoch": 0.73992673992674, "percentage": 24.66, "elapsed_time": "0:38:21", "remaining_time": "1:57:09"} | |
| {"current_steps": 608, "total_steps": 2457, "loss": 1.3180582523345947, "lr": 1.884174026624868e-05, "epoch": 0.7423687423687424, "percentage": 24.75, "elapsed_time": "0:38:28", "remaining_time": "1:57:01"} | |
| {"current_steps": 610, "total_steps": 2457, "loss": 0.9663639664649963, "lr": 1.8829157351679116e-05, "epoch": 0.7448107448107448, "percentage": 24.83, "elapsed_time": "0:38:34", "remaining_time": "1:56:48"} | |
| {"current_steps": 612, "total_steps": 2457, "loss": 1.2966718673706055, "lr": 1.881651121092408e-05, "epoch": 0.7472527472527473, "percentage": 24.91, "elapsed_time": "0:38:41", "remaining_time": "1:56:38"} | |
| {"current_steps": 614, "total_steps": 2457, "loss": 1.2717726230621338, "lr": 1.880380194611044e-05, "epoch": 0.7496947496947497, "percentage": 24.99, "elapsed_time": "0:38:47", "remaining_time": "1:56:27"} | |
| {"current_steps": 616, "total_steps": 2457, "loss": 1.0650262832641602, "lr": 1.8791029659874817e-05, "epoch": 0.7521367521367521, "percentage": 25.07, "elapsed_time": "0:38:52", "remaining_time": "1:56:10"} | |
| {"current_steps": 618, "total_steps": 2457, "loss": 1.6179522275924683, "lr": 1.877819445536279e-05, "epoch": 0.7545787545787546, "percentage": 25.15, "elapsed_time": "0:38:58", "remaining_time": "1:55:59"} | |
| {"current_steps": 620, "total_steps": 2457, "loss": 1.1963871717453003, "lr": 1.8765296436228043e-05, "epoch": 0.757020757020757, "percentage": 25.23, "elapsed_time": "0:39:05", "remaining_time": "1:55:48"} | |
| {"current_steps": 622, "total_steps": 2457, "loss": 0.9286983013153076, "lr": 1.875233570663154e-05, "epoch": 0.7594627594627594, "percentage": 25.32, "elapsed_time": "0:39:12", "remaining_time": "1:55:41"} | |
| {"current_steps": 624, "total_steps": 2457, "loss": 1.2990517616271973, "lr": 1.8739312371240678e-05, "epoch": 0.7619047619047619, "percentage": 25.4, "elapsed_time": "0:39:19", "remaining_time": "1:55:31"} | |
| {"current_steps": 626, "total_steps": 2457, "loss": 1.352059006690979, "lr": 1.8726226535228425e-05, "epoch": 0.7643467643467643, "percentage": 25.48, "elapsed_time": "0:39:26", "remaining_time": "1:55:20"} | |
| {"current_steps": 628, "total_steps": 2457, "loss": 1.1491894721984863, "lr": 1.871307830427251e-05, "epoch": 0.7667887667887668, "percentage": 25.56, "elapsed_time": "0:39:32", "remaining_time": "1:55:08"} | |
| {"current_steps": 630, "total_steps": 2457, "loss": 1.3350757360458374, "lr": 1.8699867784554537e-05, "epoch": 0.7692307692307693, "percentage": 25.64, "elapsed_time": "0:39:38", "remaining_time": "1:54:56"} | |
| {"current_steps": 632, "total_steps": 2457, "loss": 1.0210474729537964, "lr": 1.868659508275914e-05, "epoch": 0.7716727716727717, "percentage": 25.72, "elapsed_time": "0:39:43", "remaining_time": "1:54:43"} | |
| {"current_steps": 634, "total_steps": 2457, "loss": 1.0034987926483154, "lr": 1.867326030607311e-05, "epoch": 0.7741147741147741, "percentage": 25.8, "elapsed_time": "0:39:51", "remaining_time": "1:54:36"} | |
| {"current_steps": 636, "total_steps": 2457, "loss": 1.3230623006820679, "lr": 1.8659863562184552e-05, "epoch": 0.7765567765567766, "percentage": 25.89, "elapsed_time": "0:39:58", "remaining_time": "1:54:28"} | |
| {"current_steps": 638, "total_steps": 2457, "loss": 1.3143547773361206, "lr": 1.8646404959281986e-05, "epoch": 0.778998778998779, "percentage": 25.97, "elapsed_time": "0:40:04", "remaining_time": "1:54:16"} | |
| {"current_steps": 640, "total_steps": 2457, "loss": 0.9751634001731873, "lr": 1.8632884606053506e-05, "epoch": 0.7814407814407814, "percentage": 26.05, "elapsed_time": "0:40:11", "remaining_time": "1:54:05"} | |
| {"current_steps": 642, "total_steps": 2457, "loss": 1.1349761486053467, "lr": 1.861930261168587e-05, "epoch": 0.7838827838827839, "percentage": 26.13, "elapsed_time": "0:40:16", "remaining_time": "1:53:51"} | |
| {"current_steps": 644, "total_steps": 2457, "loss": 1.2226810455322266, "lr": 1.860565908586365e-05, "epoch": 0.7863247863247863, "percentage": 26.21, "elapsed_time": "0:40:23", "remaining_time": "1:53:42"} | |
| {"current_steps": 646, "total_steps": 2457, "loss": 1.0119144916534424, "lr": 1.859195413876831e-05, "epoch": 0.7887667887667887, "percentage": 26.29, "elapsed_time": "0:40:30", "remaining_time": "1:53:32"} | |
| {"current_steps": 648, "total_steps": 2457, "loss": 1.26012122631073, "lr": 1.857818788107734e-05, "epoch": 0.7912087912087912, "percentage": 26.37, "elapsed_time": "0:40:36", "remaining_time": "1:53:21"} | |
| {"current_steps": 650, "total_steps": 2457, "loss": 0.5898873209953308, "lr": 1.856436042396338e-05, "epoch": 0.7936507936507936, "percentage": 26.46, "elapsed_time": "0:40:42", "remaining_time": "1:53:09"} | |
| {"current_steps": 652, "total_steps": 2457, "loss": 0.8887655138969421, "lr": 1.8550471879093275e-05, "epoch": 0.796092796092796, "percentage": 26.54, "elapsed_time": "0:40:48", "remaining_time": "1:52:58"} | |
| {"current_steps": 654, "total_steps": 2457, "loss": 1.2602205276489258, "lr": 1.8536522358627205e-05, "epoch": 0.7985347985347986, "percentage": 26.62, "elapsed_time": "0:40:55", "remaining_time": "1:52:49"} | |
| {"current_steps": 656, "total_steps": 2457, "loss": 1.2750191688537598, "lr": 1.852251197521778e-05, "epoch": 0.800976800976801, "percentage": 26.7, "elapsed_time": "0:41:02", "remaining_time": "1:52:40"} | |
| {"current_steps": 658, "total_steps": 2457, "loss": 0.5839018225669861, "lr": 1.8508440842009113e-05, "epoch": 0.8034188034188035, "percentage": 26.78, "elapsed_time": "0:41:08", "remaining_time": "1:52:29"} | |
| {"current_steps": 660, "total_steps": 2457, "loss": 1.297167181968689, "lr": 1.849430907263592e-05, "epoch": 0.8058608058608059, "percentage": 26.86, "elapsed_time": "0:41:16", "remaining_time": "1:52:22"} | |
| {"current_steps": 662, "total_steps": 2457, "loss": 1.2555423974990845, "lr": 1.8480116781222604e-05, "epoch": 0.8083028083028083, "percentage": 26.94, "elapsed_time": "0:41:23", "remaining_time": "1:52:13"} | |
| {"current_steps": 664, "total_steps": 2457, "loss": 1.3545968532562256, "lr": 1.846586408238232e-05, "epoch": 0.8107448107448108, "percentage": 27.02, "elapsed_time": "0:41:31", "remaining_time": "1:52:06"} | |
| {"current_steps": 666, "total_steps": 2457, "loss": 0.9384480118751526, "lr": 1.8451551091216064e-05, "epoch": 0.8131868131868132, "percentage": 27.11, "elapsed_time": "0:41:41", "remaining_time": "1:52:07"} | |
| {"current_steps": 668, "total_steps": 2457, "loss": 1.0872721672058105, "lr": 1.8437177923311728e-05, "epoch": 0.8156288156288156, "percentage": 27.19, "elapsed_time": "0:41:49", "remaining_time": "1:51:59"} | |
| {"current_steps": 670, "total_steps": 2457, "loss": 1.4501525163650513, "lr": 1.842274469474318e-05, "epoch": 0.818070818070818, "percentage": 27.27, "elapsed_time": "0:41:55", "remaining_time": "1:51:48"} | |
| {"current_steps": 672, "total_steps": 2457, "loss": 1.296190857887268, "lr": 1.8408251522069323e-05, "epoch": 0.8205128205128205, "percentage": 27.35, "elapsed_time": "0:42:01", "remaining_time": "1:51:37"} | |
| {"current_steps": 674, "total_steps": 2457, "loss": 1.076781153678894, "lr": 1.8393698522333158e-05, "epoch": 0.8229548229548229, "percentage": 27.43, "elapsed_time": "0:42:07", "remaining_time": "1:51:25"} | |
| {"current_steps": 676, "total_steps": 2457, "loss": 0.963850200176239, "lr": 1.837908581306082e-05, "epoch": 0.8253968253968254, "percentage": 27.51, "elapsed_time": "0:42:13", "remaining_time": "1:51:14"} | |
| {"current_steps": 678, "total_steps": 2457, "loss": 1.2688353061676025, "lr": 1.8364413512260656e-05, "epoch": 0.8278388278388278, "percentage": 27.59, "elapsed_time": "0:42:20", "remaining_time": "1:51:04"} | |
| {"current_steps": 680, "total_steps": 2457, "loss": 1.3245513439178467, "lr": 1.8349681738422245e-05, "epoch": 0.8302808302808303, "percentage": 27.68, "elapsed_time": "0:42:27", "remaining_time": "1:50:56"} | |
| {"current_steps": 682, "total_steps": 2457, "loss": 1.2618424892425537, "lr": 1.8334890610515465e-05, "epoch": 0.8327228327228328, "percentage": 27.76, "elapsed_time": "0:42:34", "remaining_time": "1:50:49"} | |
| {"current_steps": 684, "total_steps": 2457, "loss": 0.9116923213005066, "lr": 1.8320040247989516e-05, "epoch": 0.8351648351648352, "percentage": 27.84, "elapsed_time": "0:42:39", "remaining_time": "1:50:35"} | |
| {"current_steps": 686, "total_steps": 2457, "loss": 1.4006067514419556, "lr": 1.8305130770771966e-05, "epoch": 0.8376068376068376, "percentage": 27.92, "elapsed_time": "0:42:46", "remaining_time": "1:50:25"} | |
| {"current_steps": 688, "total_steps": 2457, "loss": 1.3707760572433472, "lr": 1.829016229926777e-05, "epoch": 0.8400488400488401, "percentage": 28.0, "elapsed_time": "0:42:55", "remaining_time": "1:50:22"} | |
| {"current_steps": 690, "total_steps": 2457, "loss": 1.0350643396377563, "lr": 1.827513495435831e-05, "epoch": 0.8424908424908425, "percentage": 28.08, "elapsed_time": "0:43:03", "remaining_time": "1:50:16"} | |
| {"current_steps": 692, "total_steps": 2457, "loss": 1.3101565837860107, "lr": 1.826004885740042e-05, "epoch": 0.8449328449328449, "percentage": 28.16, "elapsed_time": "0:43:13", "remaining_time": "1:50:13"} | |
| {"current_steps": 694, "total_steps": 2457, "loss": 1.1183477640151978, "lr": 1.8244904130225383e-05, "epoch": 0.8473748473748474, "percentage": 28.25, "elapsed_time": "0:43:20", "remaining_time": "1:50:07"} | |
| {"current_steps": 696, "total_steps": 2457, "loss": 1.2185040712356567, "lr": 1.8229700895137977e-05, "epoch": 0.8498168498168498, "percentage": 28.33, "elapsed_time": "0:43:28", "remaining_time": "1:49:59"} | |
| {"current_steps": 698, "total_steps": 2457, "loss": 1.0439921617507935, "lr": 1.821443927491548e-05, "epoch": 0.8522588522588522, "percentage": 28.41, "elapsed_time": "0:43:34", "remaining_time": "1:49:49"} | |
| {"current_steps": 700, "total_steps": 2457, "loss": 1.179707646369934, "lr": 1.819911939280665e-05, "epoch": 0.8547008547008547, "percentage": 28.49, "elapsed_time": "0:43:40", "remaining_time": "1:49:37"} | |
| {"current_steps": 702, "total_steps": 2457, "loss": 1.1061705350875854, "lr": 1.8183741372530778e-05, "epoch": 0.8571428571428571, "percentage": 28.57, "elapsed_time": "0:43:47", "remaining_time": "1:49:28"} | |
| {"current_steps": 704, "total_steps": 2457, "loss": 1.0052831172943115, "lr": 1.816830533827665e-05, "epoch": 0.8595848595848596, "percentage": 28.65, "elapsed_time": "0:43:54", "remaining_time": "1:49:20"} | |
| {"current_steps": 706, "total_steps": 2457, "loss": 0.5395532250404358, "lr": 1.815281141470155e-05, "epoch": 0.8620268620268621, "percentage": 28.73, "elapsed_time": "0:44:01", "remaining_time": "1:49:11"} | |
| {"current_steps": 708, "total_steps": 2457, "loss": 1.2419100999832153, "lr": 1.8137259726930283e-05, "epoch": 0.8644688644688645, "percentage": 28.82, "elapsed_time": "0:44:11", "remaining_time": "1:49:10"} | |
| {"current_steps": 710, "total_steps": 2457, "loss": 0.9318399429321289, "lr": 1.8121650400554125e-05, "epoch": 0.8669108669108669, "percentage": 28.9, "elapsed_time": "0:44:20", "remaining_time": "1:49:07"} | |
| {"current_steps": 712, "total_steps": 2457, "loss": 1.4534571170806885, "lr": 1.8105983561629827e-05, "epoch": 0.8693528693528694, "percentage": 28.98, "elapsed_time": "0:44:28", "remaining_time": "1:49:01"} | |
| {"current_steps": 714, "total_steps": 2457, "loss": 1.6200733184814453, "lr": 1.8090259336678598e-05, "epoch": 0.8717948717948718, "percentage": 29.06, "elapsed_time": "0:44:36", "remaining_time": "1:48:54"} | |
| {"current_steps": 716, "total_steps": 2457, "loss": 1.4871742725372314, "lr": 1.8074477852685088e-05, "epoch": 0.8742368742368742, "percentage": 29.14, "elapsed_time": "0:44:46", "remaining_time": "1:48:52"} | |
| {"current_steps": 718, "total_steps": 2457, "loss": 1.0001909732818604, "lr": 1.805863923709635e-05, "epoch": 0.8766788766788767, "percentage": 29.22, "elapsed_time": "0:44:54", "remaining_time": "1:48:46"} | |
| {"current_steps": 720, "total_steps": 2457, "loss": 1.2416490316390991, "lr": 1.8042743617820814e-05, "epoch": 0.8791208791208791, "percentage": 29.3, "elapsed_time": "0:45:04", "remaining_time": "1:48:44"} | |
| {"current_steps": 722, "total_steps": 2457, "loss": 0.8903718590736389, "lr": 1.8026791123227255e-05, "epoch": 0.8815628815628815, "percentage": 29.39, "elapsed_time": "0:45:10", "remaining_time": "1:48:34"} | |
| {"current_steps": 724, "total_steps": 2457, "loss": 1.285760521888733, "lr": 1.8010781882143773e-05, "epoch": 0.884004884004884, "percentage": 29.47, "elapsed_time": "0:45:19", "remaining_time": "1:48:30"} | |
| {"current_steps": 726, "total_steps": 2457, "loss": 1.2185858488082886, "lr": 1.799471602385672e-05, "epoch": 0.8864468864468864, "percentage": 29.55, "elapsed_time": "0:45:27", "remaining_time": "1:48:23"} | |
| {"current_steps": 728, "total_steps": 2457, "loss": 1.2078474760055542, "lr": 1.797859367810968e-05, "epoch": 0.8888888888888888, "percentage": 29.63, "elapsed_time": "0:45:35", "remaining_time": "1:48:16"} | |
| {"current_steps": 730, "total_steps": 2457, "loss": 1.4831866025924683, "lr": 1.7962414975102416e-05, "epoch": 0.8913308913308914, "percentage": 29.71, "elapsed_time": "0:45:40", "remaining_time": "1:48:04"} | |
| {"current_steps": 732, "total_steps": 2457, "loss": 1.2522797584533691, "lr": 1.794618004548982e-05, "epoch": 0.8937728937728938, "percentage": 29.79, "elapsed_time": "0:45:48", "remaining_time": "1:47:57"} | |
| {"current_steps": 734, "total_steps": 2457, "loss": 1.0359210968017578, "lr": 1.7929889020380842e-05, "epoch": 0.8962148962148963, "percentage": 29.87, "elapsed_time": "0:45:56", "remaining_time": "1:47:49"} | |
| {"current_steps": 736, "total_steps": 2457, "loss": 0.8198949098587036, "lr": 1.791354203133746e-05, "epoch": 0.8986568986568987, "percentage": 29.96, "elapsed_time": "0:46:01", "remaining_time": "1:47:36"} | |
| {"current_steps": 738, "total_steps": 2457, "loss": 0.9690486788749695, "lr": 1.7897139210373594e-05, "epoch": 0.9010989010989011, "percentage": 30.04, "elapsed_time": "0:46:07", "remaining_time": "1:47:26"} | |
| {"current_steps": 740, "total_steps": 2457, "loss": 1.0706011056900024, "lr": 1.7880680689954047e-05, "epoch": 0.9035409035409036, "percentage": 30.12, "elapsed_time": "0:46:13", "remaining_time": "1:47:15"} | |
| {"current_steps": 742, "total_steps": 2457, "loss": 0.9173503518104553, "lr": 1.786416660299344e-05, "epoch": 0.905982905982906, "percentage": 30.2, "elapsed_time": "0:46:19", "remaining_time": "1:47:04"} | |
| {"current_steps": 744, "total_steps": 2457, "loss": 0.9544399976730347, "lr": 1.7847597082855133e-05, "epoch": 0.9084249084249084, "percentage": 30.28, "elapsed_time": "0:46:25", "remaining_time": "1:46:53"} | |
| {"current_steps": 746, "total_steps": 2457, "loss": 1.2056411504745483, "lr": 1.7830972263350142e-05, "epoch": 0.9108669108669109, "percentage": 30.36, "elapsed_time": "0:46:34", "remaining_time": "1:46:48"} | |
| {"current_steps": 748, "total_steps": 2457, "loss": 0.9109166264533997, "lr": 1.7814292278736084e-05, "epoch": 0.9133089133089133, "percentage": 30.44, "elapsed_time": "0:46:40", "remaining_time": "1:46:39"} | |
| {"current_steps": 750, "total_steps": 2457, "loss": 1.401995301246643, "lr": 1.7797557263716054e-05, "epoch": 0.9157509157509157, "percentage": 30.53, "elapsed_time": "0:46:47", "remaining_time": "1:46:28"} | |
| {"current_steps": 752, "total_steps": 2457, "loss": 1.2727299928665161, "lr": 1.7780767353437573e-05, "epoch": 0.9181929181929182, "percentage": 30.61, "elapsed_time": "0:46:52", "remaining_time": "1:46:17"} | |
| {"current_steps": 754, "total_steps": 2457, "loss": 1.2869514226913452, "lr": 1.7763922683491476e-05, "epoch": 0.9206349206349206, "percentage": 30.69, "elapsed_time": "0:46:58", "remaining_time": "1:46:05"} | |
| {"current_steps": 756, "total_steps": 2457, "loss": 1.2656826972961426, "lr": 1.7747023389910815e-05, "epoch": 0.9230769230769231, "percentage": 30.77, "elapsed_time": "0:47:05", "remaining_time": "1:45:56"} | |
| {"current_steps": 758, "total_steps": 2457, "loss": 1.3375307321548462, "lr": 1.773006960916978e-05, "epoch": 0.9255189255189256, "percentage": 30.85, "elapsed_time": "0:47:11", "remaining_time": "1:45:46"} | |
| {"current_steps": 760, "total_steps": 2457, "loss": 0.8308702111244202, "lr": 1.7713061478182582e-05, "epoch": 0.927960927960928, "percentage": 30.93, "elapsed_time": "0:47:17", "remaining_time": "1:45:36"} | |
| {"current_steps": 762, "total_steps": 2457, "loss": 1.2227895259857178, "lr": 1.7695999134302348e-05, "epoch": 0.9304029304029304, "percentage": 31.01, "elapsed_time": "0:47:22", "remaining_time": "1:45:23"} | |
| {"current_steps": 764, "total_steps": 2457, "loss": 0.9452077150344849, "lr": 1.767888271532001e-05, "epoch": 0.9328449328449329, "percentage": 31.09, "elapsed_time": "0:47:29", "remaining_time": "1:45:13"} | |
| {"current_steps": 766, "total_steps": 2457, "loss": 0.6139346957206726, "lr": 1.7661712359463202e-05, "epoch": 0.9352869352869353, "percentage": 31.18, "elapsed_time": "0:47:36", "remaining_time": "1:45:06"} | |
| {"current_steps": 768, "total_steps": 2457, "loss": 0.9175626039505005, "lr": 1.7644488205395136e-05, "epoch": 0.9377289377289377, "percentage": 31.26, "elapsed_time": "0:47:44", "remaining_time": "1:44:58"} | |
| {"current_steps": 770, "total_steps": 2457, "loss": 0.7235321402549744, "lr": 1.7627210392213484e-05, "epoch": 0.9401709401709402, "percentage": 31.34, "elapsed_time": "0:47:49", "remaining_time": "1:44:45"} | |
| {"current_steps": 772, "total_steps": 2457, "loss": 1.1240880489349365, "lr": 1.7609879059449256e-05, "epoch": 0.9426129426129426, "percentage": 31.42, "elapsed_time": "0:47:54", "remaining_time": "1:44:34"} | |
| {"current_steps": 774, "total_steps": 2457, "loss": 1.3139581680297852, "lr": 1.7592494347065667e-05, "epoch": 0.945054945054945, "percentage": 31.5, "elapsed_time": "0:48:01", "remaining_time": "1:44:25"} | |
| {"current_steps": 776, "total_steps": 2457, "loss": 1.2285006046295166, "lr": 1.7575056395457017e-05, "epoch": 0.9474969474969475, "percentage": 31.58, "elapsed_time": "0:48:07", "remaining_time": "1:44:15"} | |
| {"current_steps": 778, "total_steps": 2457, "loss": 0.9121115207672119, "lr": 1.7557565345447548e-05, "epoch": 0.9499389499389499, "percentage": 31.66, "elapsed_time": "0:48:13", "remaining_time": "1:44:04"} | |
| {"current_steps": 780, "total_steps": 2457, "loss": 1.1289280652999878, "lr": 1.754002133829031e-05, "epoch": 0.9523809523809523, "percentage": 31.75, "elapsed_time": "0:48:21", "remaining_time": "1:43:58"} | |
| {"current_steps": 782, "total_steps": 2457, "loss": 1.1398252248764038, "lr": 1.752242451566603e-05, "epoch": 0.9548229548229549, "percentage": 31.83, "elapsed_time": "0:48:28", "remaining_time": "1:43:50"} | |
| {"current_steps": 784, "total_steps": 2457, "loss": 1.263461709022522, "lr": 1.7504775019681946e-05, "epoch": 0.9572649572649573, "percentage": 31.91, "elapsed_time": "0:48:34", "remaining_time": "1:43:40"} | |
| {"current_steps": 786, "total_steps": 2457, "loss": 1.2938859462738037, "lr": 1.7487072992870683e-05, "epoch": 0.9597069597069597, "percentage": 31.99, "elapsed_time": "0:48:41", "remaining_time": "1:43:29"} | |
| {"current_steps": 788, "total_steps": 2457, "loss": 1.3971589803695679, "lr": 1.746931857818908e-05, "epoch": 0.9621489621489622, "percentage": 32.07, "elapsed_time": "0:48:47", "remaining_time": "1:43:20"} | |
| {"current_steps": 790, "total_steps": 2457, "loss": 1.341101884841919, "lr": 1.7451511919017054e-05, "epoch": 0.9645909645909646, "percentage": 32.15, "elapsed_time": "0:48:54", "remaining_time": "1:43:12"} | |
| {"current_steps": 792, "total_steps": 2457, "loss": 1.0966370105743408, "lr": 1.743365315915643e-05, "epoch": 0.967032967032967, "percentage": 32.23, "elapsed_time": "0:49:01", "remaining_time": "1:43:03"} | |
| {"current_steps": 794, "total_steps": 2457, "loss": 1.3368990421295166, "lr": 1.7415742442829792e-05, "epoch": 0.9694749694749695, "percentage": 32.32, "elapsed_time": "0:49:08", "remaining_time": "1:42:55"} | |
| {"current_steps": 796, "total_steps": 2457, "loss": 1.2155550718307495, "lr": 1.7397779914679303e-05, "epoch": 0.9719169719169719, "percentage": 32.4, "elapsed_time": "0:49:14", "remaining_time": "1:42:45"} | |
| {"current_steps": 798, "total_steps": 2457, "loss": 1.2150750160217285, "lr": 1.7379765719765542e-05, "epoch": 0.9743589743589743, "percentage": 32.48, "elapsed_time": "0:49:20", "remaining_time": "1:42:34"} | |
| {"current_steps": 800, "total_steps": 2457, "loss": 1.2871735095977783, "lr": 1.7361700003566348e-05, "epoch": 0.9768009768009768, "percentage": 32.56, "elapsed_time": "0:49:26", "remaining_time": "1:42:24"} | |
| {"current_steps": 802, "total_steps": 2457, "loss": 0.9395040273666382, "lr": 1.734358291197562e-05, "epoch": 0.9792429792429792, "percentage": 32.64, "elapsed_time": "0:49:33", "remaining_time": "1:42:16"} | |
| {"current_steps": 804, "total_steps": 2457, "loss": 1.1477895975112915, "lr": 1.732541459130215e-05, "epoch": 0.9816849816849816, "percentage": 32.72, "elapsed_time": "0:49:39", "remaining_time": "1:42:06"} | |
| {"current_steps": 806, "total_steps": 2457, "loss": 1.573718547821045, "lr": 1.730719518826846e-05, "epoch": 0.9841269841269841, "percentage": 32.8, "elapsed_time": "0:49:45", "remaining_time": "1:41:56"} | |
| {"current_steps": 808, "total_steps": 2457, "loss": 0.9391233325004578, "lr": 1.7288924850009576e-05, "epoch": 0.9865689865689866, "percentage": 32.89, "elapsed_time": "0:49:52", "remaining_time": "1:41:47"} | |
| {"current_steps": 810, "total_steps": 2457, "loss": 1.364790916442871, "lr": 1.7270603724071876e-05, "epoch": 0.989010989010989, "percentage": 32.97, "elapsed_time": "0:49:59", "remaining_time": "1:41:38"} | |
| {"current_steps": 812, "total_steps": 2457, "loss": 1.2704541683197021, "lr": 1.725223195841189e-05, "epoch": 0.9914529914529915, "percentage": 33.05, "elapsed_time": "0:50:07", "remaining_time": "1:41:32"} | |
| {"current_steps": 814, "total_steps": 2457, "loss": 1.35564386844635, "lr": 1.7233809701395087e-05, "epoch": 0.9938949938949939, "percentage": 33.13, "elapsed_time": "0:50:14", "remaining_time": "1:41:23"} | |
| {"current_steps": 816, "total_steps": 2457, "loss": 1.233031153678894, "lr": 1.72153371017947e-05, "epoch": 0.9963369963369964, "percentage": 33.21, "elapsed_time": "0:50:20", "remaining_time": "1:41:13"} | |
| {"current_steps": 818, "total_steps": 2457, "loss": 1.1463748216629028, "lr": 1.7196814308790516e-05, "epoch": 0.9987789987789988, "percentage": 33.29, "elapsed_time": "0:50:26", "remaining_time": "1:41:04"} | |
| {"current_steps": 820, "total_steps": 2457, "loss": 1.007127285003662, "lr": 1.717824147196767e-05, "epoch": 1.0012210012210012, "percentage": 33.37, "elapsed_time": "0:50:33", "remaining_time": "1:40:55"} | |
| {"current_steps": 822, "total_steps": 2457, "loss": 1.0883307456970215, "lr": 1.7159618741315433e-05, "epoch": 1.0036630036630036, "percentage": 33.46, "elapsed_time": "0:50:39", "remaining_time": "1:40:46"} | |
| {"current_steps": 824, "total_steps": 2457, "loss": 0.4619407653808594, "lr": 1.7140946267226006e-05, "epoch": 1.006105006105006, "percentage": 33.54, "elapsed_time": "0:50:46", "remaining_time": "1:40:37"} | |
| {"current_steps": 826, "total_steps": 2457, "loss": 0.8937675356864929, "lr": 1.712222420049331e-05, "epoch": 1.0085470085470085, "percentage": 33.62, "elapsed_time": "0:50:55", "remaining_time": "1:40:32"} | |
| {"current_steps": 828, "total_steps": 2457, "loss": 0.7834187150001526, "lr": 1.7103452692311756e-05, "epoch": 1.010989010989011, "percentage": 33.7, "elapsed_time": "0:51:01", "remaining_time": "1:40:22"} | |
| {"current_steps": 830, "total_steps": 2457, "loss": 0.7017002105712891, "lr": 1.708463189427504e-05, "epoch": 1.0134310134310134, "percentage": 33.78, "elapsed_time": "0:51:06", "remaining_time": "1:40:11"} | |
| {"current_steps": 832, "total_steps": 2457, "loss": 0.9201502203941345, "lr": 1.7065761958374905e-05, "epoch": 1.0158730158730158, "percentage": 33.86, "elapsed_time": "0:51:12", "remaining_time": "1:40:01"} | |
| {"current_steps": 834, "total_steps": 2457, "loss": 0.9217178821563721, "lr": 1.7046843036999912e-05, "epoch": 1.0183150183150182, "percentage": 33.94, "elapsed_time": "0:51:18", "remaining_time": "1:39:51"} | |
| {"current_steps": 836, "total_steps": 2457, "loss": 1.00894033908844, "lr": 1.7027875282934224e-05, "epoch": 1.0207570207570207, "percentage": 34.03, "elapsed_time": "0:51:26", "remaining_time": "1:39:44"} | |
| {"current_steps": 838, "total_steps": 2457, "loss": 1.0666855573654175, "lr": 1.7008858849356363e-05, "epoch": 1.0231990231990231, "percentage": 34.11, "elapsed_time": "0:51:33", "remaining_time": "1:39:37"} | |
| {"current_steps": 840, "total_steps": 2457, "loss": 0.7795441746711731, "lr": 1.6989793889837966e-05, "epoch": 1.0256410256410255, "percentage": 34.19, "elapsed_time": "0:51:41", "remaining_time": "1:39:29"} | |
| {"current_steps": 842, "total_steps": 2457, "loss": 0.7524101734161377, "lr": 1.6970680558342566e-05, "epoch": 1.028083028083028, "percentage": 34.27, "elapsed_time": "0:51:47", "remaining_time": "1:39:20"} | |
| {"current_steps": 844, "total_steps": 2457, "loss": 0.9602640271186829, "lr": 1.695151900922432e-05, "epoch": 1.0305250305250304, "percentage": 34.35, "elapsed_time": "0:51:54", "remaining_time": "1:39:11"} | |
| {"current_steps": 846, "total_steps": 2457, "loss": 0.8459327816963196, "lr": 1.6932309397226792e-05, "epoch": 1.032967032967033, "percentage": 34.43, "elapsed_time": "0:52:02", "remaining_time": "1:39:06"} | |
| {"current_steps": 848, "total_steps": 2457, "loss": 1.1561813354492188, "lr": 1.6913051877481676e-05, "epoch": 1.0354090354090355, "percentage": 34.51, "elapsed_time": "0:52:09", "remaining_time": "1:38:57"} | |
| {"current_steps": 850, "total_steps": 2457, "loss": 0.7689896821975708, "lr": 1.6893746605507567e-05, "epoch": 1.037851037851038, "percentage": 34.6, "elapsed_time": "0:52:16", "remaining_time": "1:38:49"} | |
| {"current_steps": 852, "total_steps": 2457, "loss": 0.5241991281509399, "lr": 1.6874393737208688e-05, "epoch": 1.0402930402930404, "percentage": 34.68, "elapsed_time": "0:52:22", "remaining_time": "1:38:39"} | |
| {"current_steps": 854, "total_steps": 2457, "loss": 1.0428876876831055, "lr": 1.685499342887364e-05, "epoch": 1.0427350427350428, "percentage": 34.76, "elapsed_time": "0:52:28", "remaining_time": "1:38:30"} | |
| {"current_steps": 856, "total_steps": 2457, "loss": 0.668832004070282, "lr": 1.6835545837174132e-05, "epoch": 1.0451770451770452, "percentage": 34.84, "elapsed_time": "0:52:34", "remaining_time": "1:38:20"} | |
| {"current_steps": 858, "total_steps": 2457, "loss": 1.2478870153427124, "lr": 1.681605111916373e-05, "epoch": 1.0476190476190477, "percentage": 34.92, "elapsed_time": "0:52:41", "remaining_time": "1:38:11"} | |
| {"current_steps": 860, "total_steps": 2457, "loss": 0.8985828161239624, "lr": 1.679650943227657e-05, "epoch": 1.05006105006105, "percentage": 35.0, "elapsed_time": "0:52:47", "remaining_time": "1:38:02"} | |
| {"current_steps": 862, "total_steps": 2457, "loss": 1.0257023572921753, "lr": 1.6776920934326103e-05, "epoch": 1.0525030525030525, "percentage": 35.08, "elapsed_time": "0:52:53", "remaining_time": "1:37:51"} | |
| {"current_steps": 864, "total_steps": 2457, "loss": 1.0212005376815796, "lr": 1.675728578350381e-05, "epoch": 1.054945054945055, "percentage": 35.16, "elapsed_time": "0:52:59", "remaining_time": "1:37:42"} | |
| {"current_steps": 866, "total_steps": 2457, "loss": 1.4508510828018188, "lr": 1.673760413837793e-05, "epoch": 1.0573870573870574, "percentage": 35.25, "elapsed_time": "0:53:06", "remaining_time": "1:37:33"} | |
| {"current_steps": 868, "total_steps": 2457, "loss": 0.5031489729881287, "lr": 1.6717876157892175e-05, "epoch": 1.0598290598290598, "percentage": 35.33, "elapsed_time": "0:53:14", "remaining_time": "1:37:27"} | |
| {"current_steps": 870, "total_steps": 2457, "loss": 0.9893677234649658, "lr": 1.6698102001364456e-05, "epoch": 1.0622710622710623, "percentage": 35.41, "elapsed_time": "0:53:20", "remaining_time": "1:37:17"} | |
| {"current_steps": 872, "total_steps": 2457, "loss": 0.897520124912262, "lr": 1.6678281828485576e-05, "epoch": 1.0647130647130647, "percentage": 35.49, "elapsed_time": "0:53:26", "remaining_time": "1:37:08"} | |
| {"current_steps": 874, "total_steps": 2457, "loss": 0.7381224036216736, "lr": 1.6658415799317966e-05, "epoch": 1.0671550671550671, "percentage": 35.57, "elapsed_time": "0:53:32", "remaining_time": "1:36:59"} | |
| {"current_steps": 876, "total_steps": 2457, "loss": 0.9826089143753052, "lr": 1.6638504074294375e-05, "epoch": 1.0695970695970696, "percentage": 35.65, "elapsed_time": "0:53:39", "remaining_time": "1:36:49"} | |
| {"current_steps": 878, "total_steps": 2457, "loss": 1.0204219818115234, "lr": 1.6618546814216586e-05, "epoch": 1.072039072039072, "percentage": 35.73, "elapsed_time": "0:53:45", "remaining_time": "1:36:40"} | |
| {"current_steps": 880, "total_steps": 2457, "loss": 0.6614128947257996, "lr": 1.65985441802541e-05, "epoch": 1.0744810744810744, "percentage": 35.82, "elapsed_time": "0:53:52", "remaining_time": "1:36:32"} | |
| {"current_steps": 882, "total_steps": 2457, "loss": 0.9977365732192993, "lr": 1.6578496333942848e-05, "epoch": 1.0769230769230769, "percentage": 35.9, "elapsed_time": "0:53:58", "remaining_time": "1:36:23"} | |
| {"current_steps": 884, "total_steps": 2457, "loss": 0.6593250036239624, "lr": 1.655840343718389e-05, "epoch": 1.0793650793650793, "percentage": 35.98, "elapsed_time": "0:54:05", "remaining_time": "1:36:15"} | |
| {"current_steps": 886, "total_steps": 2457, "loss": 0.7343877553939819, "lr": 1.6538265652242103e-05, "epoch": 1.0818070818070817, "percentage": 36.06, "elapsed_time": "0:54:12", "remaining_time": "1:36:06"} | |
| {"current_steps": 888, "total_steps": 2457, "loss": 1.0775821208953857, "lr": 1.6518083141744862e-05, "epoch": 1.0842490842490842, "percentage": 36.14, "elapsed_time": "0:54:18", "remaining_time": "1:35:57"} | |
| {"current_steps": 890, "total_steps": 2457, "loss": 0.7265040874481201, "lr": 1.649785606868073e-05, "epoch": 1.0866910866910866, "percentage": 36.22, "elapsed_time": "0:54:24", "remaining_time": "1:35:48"} | |
| {"current_steps": 892, "total_steps": 2457, "loss": 0.94173663854599, "lr": 1.647758459639816e-05, "epoch": 1.089133089133089, "percentage": 36.3, "elapsed_time": "0:54:32", "remaining_time": "1:35:40"} | |
| {"current_steps": 894, "total_steps": 2457, "loss": 1.1309514045715332, "lr": 1.6457268888604143e-05, "epoch": 1.0915750915750915, "percentage": 36.39, "elapsed_time": "0:54:38", "remaining_time": "1:35:31"} | |
| {"current_steps": 896, "total_steps": 2457, "loss": 1.1048157215118408, "lr": 1.643690910936292e-05, "epoch": 1.0940170940170941, "percentage": 36.47, "elapsed_time": "0:54:45", "remaining_time": "1:35:24"} | |
| {"current_steps": 898, "total_steps": 2457, "loss": 0.8980664014816284, "lr": 1.6416505423094636e-05, "epoch": 1.0964590964590966, "percentage": 36.55, "elapsed_time": "0:54:52", "remaining_time": "1:35:15"} | |
| {"current_steps": 900, "total_steps": 2457, "loss": 0.6644148826599121, "lr": 1.639605799457401e-05, "epoch": 1.098901098901099, "percentage": 36.63, "elapsed_time": "0:54:57", "remaining_time": "1:35:04"} | |
| {"current_steps": 902, "total_steps": 2457, "loss": 0.6176282167434692, "lr": 1.6375566988929025e-05, "epoch": 1.1013431013431014, "percentage": 36.71, "elapsed_time": "0:55:03", "remaining_time": "1:34:55"} | |
| {"current_steps": 904, "total_steps": 2457, "loss": 0.5790269374847412, "lr": 1.6355032571639574e-05, "epoch": 1.1037851037851039, "percentage": 36.79, "elapsed_time": "0:55:09", "remaining_time": "1:34:45"} | |
| {"current_steps": 906, "total_steps": 2457, "loss": 0.8540843725204468, "lr": 1.6334454908536123e-05, "epoch": 1.1062271062271063, "percentage": 36.87, "elapsed_time": "0:55:15", "remaining_time": "1:34:35"} | |
| {"current_steps": 908, "total_steps": 2457, "loss": 1.0307986736297607, "lr": 1.631383416579839e-05, "epoch": 1.1086691086691087, "percentage": 36.96, "elapsed_time": "0:55:22", "remaining_time": "1:34:27"} | |
| {"current_steps": 910, "total_steps": 2457, "loss": 0.7846847176551819, "lr": 1.6293170509954e-05, "epoch": 1.1111111111111112, "percentage": 37.04, "elapsed_time": "0:55:28", "remaining_time": "1:34:17"} | |
| {"current_steps": 912, "total_steps": 2457, "loss": 1.0868881940841675, "lr": 1.6272464107877112e-05, "epoch": 1.1135531135531136, "percentage": 37.12, "elapsed_time": "0:55:34", "remaining_time": "1:34:08"} | |
| {"current_steps": 914, "total_steps": 2457, "loss": 0.6077226400375366, "lr": 1.6251715126787114e-05, "epoch": 1.115995115995116, "percentage": 37.2, "elapsed_time": "0:55:40", "remaining_time": "1:33:59"} | |
| {"current_steps": 916, "total_steps": 2457, "loss": 0.7134993076324463, "lr": 1.623092373424723e-05, "epoch": 1.1184371184371185, "percentage": 37.28, "elapsed_time": "0:55:48", "remaining_time": "1:33:52"} | |
| {"current_steps": 918, "total_steps": 2457, "loss": 1.1230908632278442, "lr": 1.6210090098163206e-05, "epoch": 1.120879120879121, "percentage": 37.36, "elapsed_time": "0:55:55", "remaining_time": "1:33:45"} | |
| {"current_steps": 920, "total_steps": 2457, "loss": 0.9432562589645386, "lr": 1.618921438678192e-05, "epoch": 1.1233211233211233, "percentage": 37.44, "elapsed_time": "0:56:03", "remaining_time": "1:33:38"} | |
| {"current_steps": 922, "total_steps": 2457, "loss": 0.8601541519165039, "lr": 1.616829676869005e-05, "epoch": 1.1257631257631258, "percentage": 37.53, "elapsed_time": "0:56:10", "remaining_time": "1:33:31"} | |
| {"current_steps": 924, "total_steps": 2457, "loss": 0.7565584778785706, "lr": 1.61473374128127e-05, "epoch": 1.1282051282051282, "percentage": 37.61, "elapsed_time": "0:56:18", "remaining_time": "1:33:25"} | |
| {"current_steps": 926, "total_steps": 2457, "loss": 0.6475503444671631, "lr": 1.612633648841203e-05, "epoch": 1.1306471306471306, "percentage": 37.69, "elapsed_time": "0:56:29", "remaining_time": "1:33:24"} | |
| {"current_steps": 928, "total_steps": 2457, "loss": 0.5194863677024841, "lr": 1.61052941650859e-05, "epoch": 1.133089133089133, "percentage": 37.77, "elapsed_time": "0:56:37", "remaining_time": "1:33:17"} | |
| {"current_steps": 930, "total_steps": 2457, "loss": 0.8809158205986023, "lr": 1.608421061276651e-05, "epoch": 1.1355311355311355, "percentage": 37.85, "elapsed_time": "0:56:45", "remaining_time": "1:33:11"} | |
| {"current_steps": 932, "total_steps": 2457, "loss": 1.0729451179504395, "lr": 1.6063086001718986e-05, "epoch": 1.137973137973138, "percentage": 37.93, "elapsed_time": "0:56:53", "remaining_time": "1:33:05"} | |
| {"current_steps": 934, "total_steps": 2457, "loss": 1.008049726486206, "lr": 1.6041920502540058e-05, "epoch": 1.1404151404151404, "percentage": 38.01, "elapsed_time": "0:57:02", "remaining_time": "1:33:00"} | |
| {"current_steps": 936, "total_steps": 2457, "loss": 0.8578592538833618, "lr": 1.6020714286156646e-05, "epoch": 1.1428571428571428, "percentage": 38.1, "elapsed_time": "0:57:08", "remaining_time": "1:32:51"} | |
| {"current_steps": 938, "total_steps": 2457, "loss": 0.9546090960502625, "lr": 1.59994675238245e-05, "epoch": 1.1452991452991452, "percentage": 38.18, "elapsed_time": "0:57:16", "remaining_time": "1:32:44"} | |
| {"current_steps": 940, "total_steps": 2457, "loss": 1.0442495346069336, "lr": 1.5978180387126797e-05, "epoch": 1.1477411477411477, "percentage": 38.26, "elapsed_time": "0:57:23", "remaining_time": "1:32:36"} | |
| {"current_steps": 942, "total_steps": 2457, "loss": 0.8928858637809753, "lr": 1.5956853047972776e-05, "epoch": 1.15018315018315, "percentage": 38.34, "elapsed_time": "0:57:29", "remaining_time": "1:32:27"} | |
| {"current_steps": 944, "total_steps": 2457, "loss": 0.8579668998718262, "lr": 1.5935485678596328e-05, "epoch": 1.1526251526251525, "percentage": 38.42, "elapsed_time": "0:57:36", "remaining_time": "1:32:20"} | |
| {"current_steps": 946, "total_steps": 2457, "loss": 0.683056652545929, "lr": 1.5914078451554637e-05, "epoch": 1.155067155067155, "percentage": 38.5, "elapsed_time": "0:57:42", "remaining_time": "1:32:10"} | |
| {"current_steps": 948, "total_steps": 2457, "loss": 0.6238126754760742, "lr": 1.5892631539726754e-05, "epoch": 1.1575091575091574, "percentage": 38.58, "elapsed_time": "0:57:47", "remaining_time": "1:31:59"} | |
| {"current_steps": 950, "total_steps": 2457, "loss": 0.9421287178993225, "lr": 1.5871145116312207e-05, "epoch": 1.1599511599511598, "percentage": 38.67, "elapsed_time": "0:57:53", "remaining_time": "1:31:49"} | |
| {"current_steps": 952, "total_steps": 2457, "loss": 0.9722180366516113, "lr": 1.5849619354829627e-05, "epoch": 1.1623931623931625, "percentage": 38.75, "elapsed_time": "0:58:00", "remaining_time": "1:31:42"} | |
| {"current_steps": 954, "total_steps": 2457, "loss": 0.9436995983123779, "lr": 1.5828054429115317e-05, "epoch": 1.164835164835165, "percentage": 38.83, "elapsed_time": "0:58:06", "remaining_time": "1:31:33"} | |
| {"current_steps": 956, "total_steps": 2457, "loss": 0.8100671768188477, "lr": 1.580645051332186e-05, "epoch": 1.1672771672771673, "percentage": 38.91, "elapsed_time": "0:58:13", "remaining_time": "1:31:24"} | |
| {"current_steps": 958, "total_steps": 2457, "loss": 0.7545087337493896, "lr": 1.5784807781916714e-05, "epoch": 1.1697191697191698, "percentage": 38.99, "elapsed_time": "0:58:18", "remaining_time": "1:31:14"} | |
| {"current_steps": 960, "total_steps": 2457, "loss": 1.0842094421386719, "lr": 1.5763126409680803e-05, "epoch": 1.1721611721611722, "percentage": 39.07, "elapsed_time": "0:58:24", "remaining_time": "1:31:05"} | |
| {"current_steps": 962, "total_steps": 2457, "loss": 0.7638933062553406, "lr": 1.5741406571707108e-05, "epoch": 1.1746031746031746, "percentage": 39.15, "elapsed_time": "0:58:30", "remaining_time": "1:30:55"} | |
| {"current_steps": 964, "total_steps": 2457, "loss": 0.6498727798461914, "lr": 1.571964844339924e-05, "epoch": 1.177045177045177, "percentage": 39.23, "elapsed_time": "0:58:37", "remaining_time": "1:30:47"} | |
| {"current_steps": 966, "total_steps": 2457, "loss": 0.983795702457428, "lr": 1.569785220047003e-05, "epoch": 1.1794871794871795, "percentage": 39.32, "elapsed_time": "0:58:43", "remaining_time": "1:30:38"} | |
| {"current_steps": 968, "total_steps": 2457, "loss": 1.1204752922058105, "lr": 1.5676018018940134e-05, "epoch": 1.181929181929182, "percentage": 39.4, "elapsed_time": "0:58:49", "remaining_time": "1:30:29"} | |
| {"current_steps": 970, "total_steps": 2457, "loss": 0.7088498473167419, "lr": 1.5654146075136565e-05, "epoch": 1.1843711843711844, "percentage": 39.48, "elapsed_time": "0:58:55", "remaining_time": "1:30:20"} | |
| {"current_steps": 972, "total_steps": 2457, "loss": 0.9644913077354431, "lr": 1.5632236545691308e-05, "epoch": 1.1868131868131868, "percentage": 39.56, "elapsed_time": "0:59:02", "remaining_time": "1:30:12"} | |
| {"current_steps": 974, "total_steps": 2457, "loss": 0.7552489638328552, "lr": 1.561028960753988e-05, "epoch": 1.1892551892551892, "percentage": 39.64, "elapsed_time": "0:59:08", "remaining_time": "1:30:03"} | |
| {"current_steps": 976, "total_steps": 2457, "loss": 0.6645691990852356, "lr": 1.5588305437919884e-05, "epoch": 1.1916971916971917, "percentage": 39.72, "elapsed_time": "0:59:15", "remaining_time": "1:29:54"} | |
| {"current_steps": 978, "total_steps": 2457, "loss": 0.8974350094795227, "lr": 1.556628421436962e-05, "epoch": 1.1941391941391941, "percentage": 39.8, "elapsed_time": "0:59:20", "remaining_time": "1:29:44"} | |
| {"current_steps": 980, "total_steps": 2457, "loss": 1.0676953792572021, "lr": 1.554422611472661e-05, "epoch": 1.1965811965811965, "percentage": 39.89, "elapsed_time": "0:59:27", "remaining_time": "1:29:36"} | |
| {"current_steps": 982, "total_steps": 2457, "loss": 1.0465797185897827, "lr": 1.552213131712617e-05, "epoch": 1.199023199023199, "percentage": 39.97, "elapsed_time": "0:59:34", "remaining_time": "1:29:29"} | |
| {"current_steps": 984, "total_steps": 2457, "loss": 1.1170203685760498, "lr": 1.55e-05, "epoch": 1.2014652014652014, "percentage": 40.05, "elapsed_time": "0:59:42", "remaining_time": "1:29:22"} | |
| {"current_steps": 986, "total_steps": 2457, "loss": 0.7278258800506592, "lr": 1.5477832342074713e-05, "epoch": 1.2039072039072038, "percentage": 40.13, "elapsed_time": "0:59:48", "remaining_time": "1:29:13"} | |
| {"current_steps": 988, "total_steps": 2457, "loss": 0.7073162794113159, "lr": 1.545562852237039e-05, "epoch": 1.2063492063492063, "percentage": 40.21, "elapsed_time": "0:59:54", "remaining_time": "1:29:04"} | |
| {"current_steps": 990, "total_steps": 2457, "loss": 0.891094982624054, "lr": 1.5433388720199156e-05, "epoch": 1.2087912087912087, "percentage": 40.29, "elapsed_time": "1:00:00", "remaining_time": "1:28:55"} | |
| {"current_steps": 992, "total_steps": 2457, "loss": 0.9304923415184021, "lr": 1.5411113115163722e-05, "epoch": 1.2112332112332111, "percentage": 40.37, "elapsed_time": "1:00:06", "remaining_time": "1:28:46"} | |
| {"current_steps": 994, "total_steps": 2457, "loss": 0.9996479749679565, "lr": 1.538880188715593e-05, "epoch": 1.2136752136752136, "percentage": 40.46, "elapsed_time": "1:00:13", "remaining_time": "1:28:39"} | |
| {"current_steps": 996, "total_steps": 2457, "loss": 0.8368605971336365, "lr": 1.5366455216355298e-05, "epoch": 1.2161172161172162, "percentage": 40.54, "elapsed_time": "1:00:20", "remaining_time": "1:28:31"} | |
| {"current_steps": 998, "total_steps": 2457, "loss": 0.9793355464935303, "lr": 1.534407328322758e-05, "epoch": 1.2185592185592187, "percentage": 40.62, "elapsed_time": "1:00:27", "remaining_time": "1:28:23"} | |
| {"current_steps": 1000, "total_steps": 2457, "loss": 0.6125832796096802, "lr": 1.5321656268523294e-05, "epoch": 1.221001221001221, "percentage": 40.7, "elapsed_time": "1:00:32", "remaining_time": "1:28:13"} | |
| {"current_steps": 1002, "total_steps": 2457, "loss": 0.7384300827980042, "lr": 1.5299204353276268e-05, "epoch": 1.2234432234432235, "percentage": 40.78, "elapsed_time": "1:00:37", "remaining_time": "1:28:02"} | |
| {"current_steps": 1004, "total_steps": 2457, "loss": 0.9433239698410034, "lr": 1.5276717718802183e-05, "epoch": 1.225885225885226, "percentage": 40.86, "elapsed_time": "1:00:43", "remaining_time": "1:27:53"} | |
| {"current_steps": 1006, "total_steps": 2457, "loss": 0.9707098603248596, "lr": 1.5254196546697088e-05, "epoch": 1.2283272283272284, "percentage": 40.94, "elapsed_time": "1:00:49", "remaining_time": "1:27:43"} | |
| {"current_steps": 1008, "total_steps": 2457, "loss": 0.5824246406555176, "lr": 1.523164101883597e-05, "epoch": 1.2307692307692308, "percentage": 41.03, "elapsed_time": "1:00:55", "remaining_time": "1:27:34"} | |
| {"current_steps": 1010, "total_steps": 2457, "loss": 1.0274351835250854, "lr": 1.5209051317371242e-05, "epoch": 1.2332112332112333, "percentage": 41.11, "elapsed_time": "1:01:02", "remaining_time": "1:27:26"} | |
| {"current_steps": 1012, "total_steps": 2457, "loss": 0.6757472157478333, "lr": 1.5186427624731313e-05, "epoch": 1.2356532356532357, "percentage": 41.19, "elapsed_time": "1:01:08", "remaining_time": "1:27:17"} | |
| {"current_steps": 1014, "total_steps": 2457, "loss": 1.041149616241455, "lr": 1.5163770123619083e-05, "epoch": 1.2380952380952381, "percentage": 41.27, "elapsed_time": "1:01:14", "remaining_time": "1:27:09"} | |
| {"current_steps": 1016, "total_steps": 2457, "loss": 0.886056125164032, "lr": 1.5141078997010486e-05, "epoch": 1.2405372405372406, "percentage": 41.35, "elapsed_time": "1:01:20", "remaining_time": "1:27:00"} | |
| {"current_steps": 1018, "total_steps": 2457, "loss": 0.9722467660903931, "lr": 1.5118354428153008e-05, "epoch": 1.242979242979243, "percentage": 41.43, "elapsed_time": "1:01:27", "remaining_time": "1:26:52"} | |
| {"current_steps": 1020, "total_steps": 2457, "loss": 0.6366119980812073, "lr": 1.5095596600564197e-05, "epoch": 1.2454212454212454, "percentage": 41.51, "elapsed_time": "1:01:33", "remaining_time": "1:26:43"} | |
| {"current_steps": 1022, "total_steps": 2457, "loss": 0.7901923656463623, "lr": 1.5072805698030197e-05, "epoch": 1.2478632478632479, "percentage": 41.6, "elapsed_time": "1:01:39", "remaining_time": "1:26:34"} | |
| {"current_steps": 1024, "total_steps": 2457, "loss": 0.9346777200698853, "lr": 1.504998190460426e-05, "epoch": 1.2503052503052503, "percentage": 41.68, "elapsed_time": "1:01:45", "remaining_time": "1:26:25"} | |
| {"current_steps": 1026, "total_steps": 2457, "loss": 0.8927645087242126, "lr": 1.5027125404605246e-05, "epoch": 1.2527472527472527, "percentage": 41.76, "elapsed_time": "1:01:52", "remaining_time": "1:26:17"} | |
| {"current_steps": 1028, "total_steps": 2457, "loss": 0.8685034513473511, "lr": 1.500423638261615e-05, "epoch": 1.2551892551892552, "percentage": 41.84, "elapsed_time": "1:01:58", "remaining_time": "1:26:09"} | |
| {"current_steps": 1030, "total_steps": 2457, "loss": 0.8063104152679443, "lr": 1.4981315023482605e-05, "epoch": 1.2576312576312576, "percentage": 41.92, "elapsed_time": "1:02:05", "remaining_time": "1:26:01"} | |
| {"current_steps": 1032, "total_steps": 2457, "loss": 1.0881439447402954, "lr": 1.4958361512311394e-05, "epoch": 1.26007326007326, "percentage": 42.0, "elapsed_time": "1:02:12", "remaining_time": "1:25:53"} | |
| {"current_steps": 1034, "total_steps": 2457, "loss": 1.1380131244659424, "lr": 1.4935376034468944e-05, "epoch": 1.2625152625152625, "percentage": 42.08, "elapsed_time": "1:02:18", "remaining_time": "1:25:44"} | |
| {"current_steps": 1036, "total_steps": 2457, "loss": 0.6871868968009949, "lr": 1.4912358775579841e-05, "epoch": 1.264957264957265, "percentage": 42.17, "elapsed_time": "1:02:23", "remaining_time": "1:25:35"} | |
| {"current_steps": 1038, "total_steps": 2457, "loss": 0.6862649321556091, "lr": 1.4889309921525325e-05, "epoch": 1.2673992673992673, "percentage": 42.25, "elapsed_time": "1:02:28", "remaining_time": "1:25:25"} | |
| {"current_steps": 1040, "total_steps": 2457, "loss": 0.7429234385490417, "lr": 1.4866229658441793e-05, "epoch": 1.2698412698412698, "percentage": 42.33, "elapsed_time": "1:02:34", "remaining_time": "1:25:15"} | |
| {"current_steps": 1042, "total_steps": 2457, "loss": 0.9307520389556885, "lr": 1.4843118172719289e-05, "epoch": 1.2722832722832722, "percentage": 42.41, "elapsed_time": "1:02:40", "remaining_time": "1:25:06"} | |
| {"current_steps": 1044, "total_steps": 2457, "loss": 0.7104328274726868, "lr": 1.4819975650999998e-05, "epoch": 1.2747252747252746, "percentage": 42.49, "elapsed_time": "1:02:48", "remaining_time": "1:25:00"} | |
| {"current_steps": 1046, "total_steps": 2457, "loss": 1.0070260763168335, "lr": 1.4796802280176762e-05, "epoch": 1.277167277167277, "percentage": 42.57, "elapsed_time": "1:02:54", "remaining_time": "1:24:52"} | |
| {"current_steps": 1048, "total_steps": 2457, "loss": 0.690989077091217, "lr": 1.4773598247391527e-05, "epoch": 1.2796092796092795, "percentage": 42.65, "elapsed_time": "1:03:00", "remaining_time": "1:24:43"} | |
| {"current_steps": 1050, "total_steps": 2457, "loss": 0.42399048805236816, "lr": 1.4750363740033881e-05, "epoch": 1.282051282051282, "percentage": 42.74, "elapsed_time": "1:03:05", "remaining_time": "1:24:33"} | |
| {"current_steps": 1052, "total_steps": 2457, "loss": 1.0426183938980103, "lr": 1.4727098945739497e-05, "epoch": 1.2844932844932844, "percentage": 42.82, "elapsed_time": "1:03:12", "remaining_time": "1:24:24"} | |
| {"current_steps": 1054, "total_steps": 2457, "loss": 0.8385255336761475, "lr": 1.470380405238865e-05, "epoch": 1.2869352869352868, "percentage": 42.9, "elapsed_time": "1:03:19", "remaining_time": "1:24:17"} | |
| {"current_steps": 1056, "total_steps": 2457, "loss": 0.6596496105194092, "lr": 1.4680479248104678e-05, "epoch": 1.2893772893772895, "percentage": 42.98, "elapsed_time": "1:03:26", "remaining_time": "1:24:09"} | |
| {"current_steps": 1058, "total_steps": 2457, "loss": 1.232382893562317, "lr": 1.4657124721252476e-05, "epoch": 1.291819291819292, "percentage": 43.06, "elapsed_time": "1:03:33", "remaining_time": "1:24:01"} | |
| {"current_steps": 1060, "total_steps": 2457, "loss": 1.0262730121612549, "lr": 1.4633740660436974e-05, "epoch": 1.2942612942612943, "percentage": 43.14, "elapsed_time": "1:03:39", "remaining_time": "1:23:54"} | |
| {"current_steps": 1062, "total_steps": 2457, "loss": 0.6136125326156616, "lr": 1.4610327254501607e-05, "epoch": 1.2967032967032968, "percentage": 43.22, "elapsed_time": "1:03:44", "remaining_time": "1:23:44"} | |
| {"current_steps": 1064, "total_steps": 2457, "loss": 0.8876266479492188, "lr": 1.4586884692526791e-05, "epoch": 1.2991452991452992, "percentage": 43.3, "elapsed_time": "1:03:51", "remaining_time": "1:23:35"} | |
| {"current_steps": 1066, "total_steps": 2457, "loss": 0.7026379108428955, "lr": 1.4563413163828397e-05, "epoch": 1.3015873015873016, "percentage": 43.39, "elapsed_time": "1:03:58", "remaining_time": "1:23:28"} | |
| {"current_steps": 1068, "total_steps": 2457, "loss": 0.9727767705917358, "lr": 1.4539912857956234e-05, "epoch": 1.304029304029304, "percentage": 43.47, "elapsed_time": "1:04:04", "remaining_time": "1:23:20"} | |
| {"current_steps": 1070, "total_steps": 2457, "loss": 0.7625731825828552, "lr": 1.4516383964692495e-05, "epoch": 1.3064713064713065, "percentage": 43.55, "elapsed_time": "1:04:12", "remaining_time": "1:23:14"} | |
| {"current_steps": 1072, "total_steps": 2457, "loss": 0.9061781167984009, "lr": 1.4492826674050248e-05, "epoch": 1.308913308913309, "percentage": 43.63, "elapsed_time": "1:04:19", "remaining_time": "1:23:06"} | |
| {"current_steps": 1074, "total_steps": 2457, "loss": 0.7514428496360779, "lr": 1.4469241176271884e-05, "epoch": 1.3113553113553114, "percentage": 43.71, "elapsed_time": "1:04:25", "remaining_time": "1:22:57"} | |
| {"current_steps": 1076, "total_steps": 2457, "loss": 0.6796785593032837, "lr": 1.4445627661827589e-05, "epoch": 1.3137973137973138, "percentage": 43.79, "elapsed_time": "1:04:32", "remaining_time": "1:22:49"} | |
| {"current_steps": 1078, "total_steps": 2457, "loss": 0.9605479836463928, "lr": 1.4421986321413801e-05, "epoch": 1.3162393162393162, "percentage": 43.87, "elapsed_time": "1:04:38", "remaining_time": "1:22:41"} | |
| {"current_steps": 1080, "total_steps": 2457, "loss": 0.8200567364692688, "lr": 1.439831734595168e-05, "epoch": 1.3186813186813187, "percentage": 43.96, "elapsed_time": "1:04:44", "remaining_time": "1:22:32"} | |
| {"current_steps": 1082, "total_steps": 2457, "loss": 0.881037175655365, "lr": 1.4374620926585556e-05, "epoch": 1.321123321123321, "percentage": 44.04, "elapsed_time": "1:04:50", "remaining_time": "1:22:23"} | |
| {"current_steps": 1084, "total_steps": 2457, "loss": 0.8864683508872986, "lr": 1.4350897254681386e-05, "epoch": 1.3235653235653235, "percentage": 44.12, "elapsed_time": "1:04:56", "remaining_time": "1:22:15"} | |
| {"current_steps": 1086, "total_steps": 2457, "loss": 1.0031923055648804, "lr": 1.4327146521825213e-05, "epoch": 1.326007326007326, "percentage": 44.2, "elapsed_time": "1:05:03", "remaining_time": "1:22:07"} | |
| {"current_steps": 1088, "total_steps": 2457, "loss": 1.0991631746292114, "lr": 1.4303368919821619e-05, "epoch": 1.3284493284493284, "percentage": 44.28, "elapsed_time": "1:05:11", "remaining_time": "1:22:01"} | |
| {"current_steps": 1090, "total_steps": 2457, "loss": 0.6553327441215515, "lr": 1.4279564640692172e-05, "epoch": 1.3308913308913308, "percentage": 44.36, "elapsed_time": "1:05:17", "remaining_time": "1:21:52"} | |
| {"current_steps": 1092, "total_steps": 2457, "loss": 0.7461038827896118, "lr": 1.4255733876673874e-05, "epoch": 1.3333333333333333, "percentage": 44.44, "elapsed_time": "1:05:23", "remaining_time": "1:21:44"} | |
| {"current_steps": 1094, "total_steps": 2457, "loss": 0.9785415530204773, "lr": 1.4231876820217623e-05, "epoch": 1.3357753357753357, "percentage": 44.53, "elapsed_time": "1:05:29", "remaining_time": "1:21:35"} | |
| {"current_steps": 1096, "total_steps": 2457, "loss": 0.47891128063201904, "lr": 1.4207993663986636e-05, "epoch": 1.3382173382173383, "percentage": 44.61, "elapsed_time": "1:05:34", "remaining_time": "1:21:25"} | |
| {"current_steps": 1098, "total_steps": 2457, "loss": 1.1681262254714966, "lr": 1.4184084600854906e-05, "epoch": 1.3406593406593408, "percentage": 44.69, "elapsed_time": "1:05:40", "remaining_time": "1:21:16"} | |
| {"current_steps": 1100, "total_steps": 2457, "loss": 1.0751440525054932, "lr": 1.4160149823905654e-05, "epoch": 1.3431013431013432, "percentage": 44.77, "elapsed_time": "1:05:47", "remaining_time": "1:21:09"} | |
| {"current_steps": 1102, "total_steps": 2457, "loss": 1.000352144241333, "lr": 1.4136189526429749e-05, "epoch": 1.3455433455433456, "percentage": 44.85, "elapsed_time": "1:05:53", "remaining_time": "1:21:01"} | |
| {"current_steps": 1104, "total_steps": 2457, "loss": 0.8417548537254333, "lr": 1.4112203901924153e-05, "epoch": 1.347985347985348, "percentage": 44.93, "elapsed_time": "1:06:01", "remaining_time": "1:20:54"} | |
| {"current_steps": 1106, "total_steps": 2457, "loss": 0.9740299582481384, "lr": 1.4088193144090376e-05, "epoch": 1.3504273504273505, "percentage": 45.01, "elapsed_time": "1:06:07", "remaining_time": "1:20:46"} | |
| {"current_steps": 1108, "total_steps": 2457, "loss": 0.7925201058387756, "lr": 1.406415744683289e-05, "epoch": 1.352869352869353, "percentage": 45.1, "elapsed_time": "1:06:14", "remaining_time": "1:20:38"} | |
| {"current_steps": 1110, "total_steps": 2457, "loss": 1.042458415031433, "lr": 1.4040097004257567e-05, "epoch": 1.3553113553113554, "percentage": 45.18, "elapsed_time": "1:06:20", "remaining_time": "1:20:30"} | |
| {"current_steps": 1112, "total_steps": 2457, "loss": 0.9074981808662415, "lr": 1.4016012010670125e-05, "epoch": 1.3577533577533578, "percentage": 45.26, "elapsed_time": "1:06:27", "remaining_time": "1:20:22"} | |
| {"current_steps": 1114, "total_steps": 2457, "loss": 0.8596875667572021, "lr": 1.3991902660574544e-05, "epoch": 1.3601953601953602, "percentage": 45.34, "elapsed_time": "1:06:33", "remaining_time": "1:20:14"} | |
| {"current_steps": 1116, "total_steps": 2457, "loss": 0.5096735954284668, "lr": 1.39677691486715e-05, "epoch": 1.3626373626373627, "percentage": 45.42, "elapsed_time": "1:06:39", "remaining_time": "1:20:06"} | |
| {"current_steps": 1118, "total_steps": 2457, "loss": 0.8825461268424988, "lr": 1.3943611669856797e-05, "epoch": 1.3650793650793651, "percentage": 45.5, "elapsed_time": "1:06:46", "remaining_time": "1:19:58"} | |
| {"current_steps": 1120, "total_steps": 2457, "loss": 0.9512450695037842, "lr": 1.3919430419219787e-05, "epoch": 1.3675213675213675, "percentage": 45.58, "elapsed_time": "1:06:52", "remaining_time": "1:19:49"} | |
| {"current_steps": 1122, "total_steps": 2457, "loss": 0.9308354258537292, "lr": 1.389522559204179e-05, "epoch": 1.36996336996337, "percentage": 45.67, "elapsed_time": "1:06:58", "remaining_time": "1:19:41"} | |
| {"current_steps": 1124, "total_steps": 2457, "loss": 0.8262976408004761, "lr": 1.387099738379454e-05, "epoch": 1.3724053724053724, "percentage": 45.75, "elapsed_time": "1:07:05", "remaining_time": "1:19:33"} | |
| {"current_steps": 1126, "total_steps": 2457, "loss": 1.28501558303833, "lr": 1.3846745990138581e-05, "epoch": 1.3748473748473748, "percentage": 45.83, "elapsed_time": "1:07:11", "remaining_time": "1:19:25"} | |
| {"current_steps": 1128, "total_steps": 2457, "loss": 0.9468799829483032, "lr": 1.382247160692169e-05, "epoch": 1.3772893772893773, "percentage": 45.91, "elapsed_time": "1:07:17", "remaining_time": "1:19:16"} | |
| {"current_steps": 1130, "total_steps": 2457, "loss": 0.6640329360961914, "lr": 1.3798174430177314e-05, "epoch": 1.3797313797313797, "percentage": 45.99, "elapsed_time": "1:07:22", "remaining_time": "1:19:07"} | |
| {"current_steps": 1132, "total_steps": 2457, "loss": 0.7266710996627808, "lr": 1.3773854656122962e-05, "epoch": 1.3821733821733821, "percentage": 46.07, "elapsed_time": "1:07:29", "remaining_time": "1:19:00"} | |
| {"current_steps": 1134, "total_steps": 2457, "loss": 0.5124362707138062, "lr": 1.3749512481158649e-05, "epoch": 1.3846153846153846, "percentage": 46.15, "elapsed_time": "1:07:36", "remaining_time": "1:18:52"} | |
| {"current_steps": 1136, "total_steps": 2457, "loss": 0.6932591199874878, "lr": 1.3725148101865275e-05, "epoch": 1.387057387057387, "percentage": 46.24, "elapsed_time": "1:07:42", "remaining_time": "1:18:43"} | |
| {"current_steps": 1138, "total_steps": 2457, "loss": 1.0207314491271973, "lr": 1.3700761715003068e-05, "epoch": 1.3894993894993894, "percentage": 46.32, "elapsed_time": "1:07:49", "remaining_time": "1:18:36"} | |
| {"current_steps": 1140, "total_steps": 2457, "loss": 0.8703376650810242, "lr": 1.3676353517509981e-05, "epoch": 1.3919413919413919, "percentage": 46.4, "elapsed_time": "1:07:57", "remaining_time": "1:18:30"} | |
| {"current_steps": 1142, "total_steps": 2457, "loss": 0.9365097284317017, "lr": 1.3651923706500105e-05, "epoch": 1.3943833943833943, "percentage": 46.48, "elapsed_time": "1:08:04", "remaining_time": "1:18:23"} | |
| {"current_steps": 1144, "total_steps": 2457, "loss": 0.7051898837089539, "lr": 1.362747247926207e-05, "epoch": 1.3968253968253967, "percentage": 46.56, "elapsed_time": "1:08:12", "remaining_time": "1:18:16"} | |
| {"current_steps": 1146, "total_steps": 2457, "loss": 1.0435025691986084, "lr": 1.3603000033257465e-05, "epoch": 1.3992673992673992, "percentage": 46.64, "elapsed_time": "1:08:21", "remaining_time": "1:18:11"} | |
| {"current_steps": 1148, "total_steps": 2457, "loss": 0.8728469610214233, "lr": 1.3578506566119236e-05, "epoch": 1.4017094017094016, "percentage": 46.72, "elapsed_time": "1:08:32", "remaining_time": "1:18:08"} | |
| {"current_steps": 1150, "total_steps": 2457, "loss": 0.7566535472869873, "lr": 1.355399227565008e-05, "epoch": 1.404151404151404, "percentage": 46.81, "elapsed_time": "1:08:40", "remaining_time": "1:18:02"} | |
| {"current_steps": 1152, "total_steps": 2457, "loss": 0.7982299327850342, "lr": 1.352945735982087e-05, "epoch": 1.4065934065934065, "percentage": 46.89, "elapsed_time": "1:08:48", "remaining_time": "1:17:56"} | |
| {"current_steps": 1154, "total_steps": 2457, "loss": 0.7825957536697388, "lr": 1.3504902016769039e-05, "epoch": 1.409035409035409, "percentage": 46.97, "elapsed_time": "1:08:56", "remaining_time": "1:17:50"} | |
| {"current_steps": 1156, "total_steps": 2457, "loss": 0.6891085505485535, "lr": 1.348032644479698e-05, "epoch": 1.4114774114774113, "percentage": 47.05, "elapsed_time": "1:09:06", "remaining_time": "1:17:47"} | |
| {"current_steps": 1158, "total_steps": 2457, "loss": 0.8980281352996826, "lr": 1.3455730842370462e-05, "epoch": 1.4139194139194138, "percentage": 47.13, "elapsed_time": "1:09:17", "remaining_time": "1:17:43"} | |
| {"current_steps": 1160, "total_steps": 2457, "loss": 0.8913061618804932, "lr": 1.3431115408117002e-05, "epoch": 1.4163614163614164, "percentage": 47.21, "elapsed_time": "1:09:25", "remaining_time": "1:17:36"} | |
| {"current_steps": 1162, "total_steps": 2457, "loss": 0.7366968393325806, "lr": 1.3406480340824272e-05, "epoch": 1.4188034188034189, "percentage": 47.29, "elapsed_time": "1:09:31", "remaining_time": "1:17:29"} | |
| {"current_steps": 1164, "total_steps": 2457, "loss": 0.6932869553565979, "lr": 1.3381825839438514e-05, "epoch": 1.4212454212454213, "percentage": 47.37, "elapsed_time": "1:09:39", "remaining_time": "1:17:22"} | |
| {"current_steps": 1166, "total_steps": 2457, "loss": 1.1828283071517944, "lr": 1.3357152103062892e-05, "epoch": 1.4236874236874237, "percentage": 47.46, "elapsed_time": "1:09:47", "remaining_time": "1:17:16"} | |
| {"current_steps": 1168, "total_steps": 2457, "loss": 0.966327428817749, "lr": 1.3332459330955921e-05, "epoch": 1.4261294261294262, "percentage": 47.54, "elapsed_time": "1:09:55", "remaining_time": "1:17:10"} | |
| {"current_steps": 1170, "total_steps": 2457, "loss": 0.8709004521369934, "lr": 1.3307747722529838e-05, "epoch": 1.4285714285714286, "percentage": 47.62, "elapsed_time": "1:10:02", "remaining_time": "1:17:03"} | |
| {"current_steps": 1172, "total_steps": 2457, "loss": 0.9068043231964111, "lr": 1.3283017477348993e-05, "epoch": 1.431013431013431, "percentage": 47.7, "elapsed_time": "1:10:09", "remaining_time": "1:16:55"} | |
| {"current_steps": 1174, "total_steps": 2457, "loss": 0.9378133416175842, "lr": 1.3258268795128258e-05, "epoch": 1.4334554334554335, "percentage": 47.78, "elapsed_time": "1:10:18", "remaining_time": "1:16:49"} | |
| {"current_steps": 1176, "total_steps": 2457, "loss": 1.0176819562911987, "lr": 1.3233501875731376e-05, "epoch": 1.435897435897436, "percentage": 47.86, "elapsed_time": "1:10:24", "remaining_time": "1:16:41"} | |
| {"current_steps": 1178, "total_steps": 2457, "loss": 0.7393254041671753, "lr": 1.320871691916938e-05, "epoch": 1.4383394383394383, "percentage": 47.94, "elapsed_time": "1:10:30", "remaining_time": "1:16:33"} | |
| {"current_steps": 1180, "total_steps": 2457, "loss": 0.8406731486320496, "lr": 1.3183914125598966e-05, "epoch": 1.4407814407814408, "percentage": 48.03, "elapsed_time": "1:10:36", "remaining_time": "1:16:24"} | |
| {"current_steps": 1182, "total_steps": 2457, "loss": 0.756401002407074, "lr": 1.3159093695320881e-05, "epoch": 1.4432234432234432, "percentage": 48.11, "elapsed_time": "1:10:42", "remaining_time": "1:16:16"} | |
| {"current_steps": 1184, "total_steps": 2457, "loss": 1.055999755859375, "lr": 1.313425582877829e-05, "epoch": 1.4456654456654456, "percentage": 48.19, "elapsed_time": "1:10:51", "remaining_time": "1:16:10"} | |
| {"current_steps": 1186, "total_steps": 2457, "loss": 0.8509088754653931, "lr": 1.3109400726555179e-05, "epoch": 1.448107448107448, "percentage": 48.27, "elapsed_time": "1:10:57", "remaining_time": "1:16:03"} | |
| {"current_steps": 1188, "total_steps": 2457, "loss": 0.7348777651786804, "lr": 1.3084528589374718e-05, "epoch": 1.4505494505494505, "percentage": 48.35, "elapsed_time": "1:11:03", "remaining_time": "1:15:54"} | |
| {"current_steps": 1190, "total_steps": 2457, "loss": 0.9267134666442871, "lr": 1.305963961809765e-05, "epoch": 1.452991452991453, "percentage": 48.43, "elapsed_time": "1:11:10", "remaining_time": "1:15:46"} | |
| {"current_steps": 1192, "total_steps": 2457, "loss": 0.8056920170783997, "lr": 1.3034734013720669e-05, "epoch": 1.4554334554334554, "percentage": 48.51, "elapsed_time": "1:11:15", "remaining_time": "1:15:37"} | |
| {"current_steps": 1194, "total_steps": 2457, "loss": 0.6724956631660461, "lr": 1.3009811977374784e-05, "epoch": 1.4578754578754578, "percentage": 48.6, "elapsed_time": "1:11:21", "remaining_time": "1:15:28"} | |
| {"current_steps": 1196, "total_steps": 2457, "loss": 0.6628673076629639, "lr": 1.2984873710323711e-05, "epoch": 1.4603174603174602, "percentage": 48.68, "elapsed_time": "1:11:30", "remaining_time": "1:15:23"} | |
| {"current_steps": 1198, "total_steps": 2457, "loss": 0.8408687710762024, "lr": 1.2959919413962242e-05, "epoch": 1.462759462759463, "percentage": 48.76, "elapsed_time": "1:11:41", "remaining_time": "1:15:20"} | |
| {"current_steps": 1200, "total_steps": 2457, "loss": 1.1985151767730713, "lr": 1.2934949289814611e-05, "epoch": 1.4652014652014653, "percentage": 48.84, "elapsed_time": "1:11:49", "remaining_time": "1:15:14"} | |
| {"current_steps": 1202, "total_steps": 2457, "loss": 0.9667496681213379, "lr": 1.290996353953288e-05, "epoch": 1.4676434676434678, "percentage": 48.92, "elapsed_time": "1:11:57", "remaining_time": "1:15:07"} | |
| {"current_steps": 1204, "total_steps": 2457, "loss": 0.9893684983253479, "lr": 1.2884962364895304e-05, "epoch": 1.4700854700854702, "percentage": 49.0, "elapsed_time": "1:12:06", "remaining_time": "1:15:02"} | |
| {"current_steps": 1206, "total_steps": 2457, "loss": 0.8230042457580566, "lr": 1.2859945967804687e-05, "epoch": 1.4725274725274726, "percentage": 49.08, "elapsed_time": "1:12:13", "remaining_time": "1:14:54"} | |
| {"current_steps": 1208, "total_steps": 2457, "loss": 0.7464233040809631, "lr": 1.2834914550286789e-05, "epoch": 1.474969474969475, "percentage": 49.17, "elapsed_time": "1:12:20", "remaining_time": "1:14:47"} | |
| {"current_steps": 1210, "total_steps": 2457, "loss": 0.8318718671798706, "lr": 1.2809868314488647e-05, "epoch": 1.4774114774114775, "percentage": 49.25, "elapsed_time": "1:12:27", "remaining_time": "1:14:40"} | |
| {"current_steps": 1212, "total_steps": 2457, "loss": 0.8906052708625793, "lr": 1.2784807462676983e-05, "epoch": 1.47985347985348, "percentage": 49.33, "elapsed_time": "1:12:32", "remaining_time": "1:14:31"} | |
| {"current_steps": 1214, "total_steps": 2457, "loss": 0.9788769483566284, "lr": 1.2759732197236548e-05, "epoch": 1.4822954822954824, "percentage": 49.41, "elapsed_time": "1:12:39", "remaining_time": "1:14:23"} | |
| {"current_steps": 1216, "total_steps": 2457, "loss": 0.9402112364768982, "lr": 1.2734642720668494e-05, "epoch": 1.4847374847374848, "percentage": 49.49, "elapsed_time": "1:12:46", "remaining_time": "1:14:15"} | |
| {"current_steps": 1218, "total_steps": 2457, "loss": 0.27936387062072754, "lr": 1.2709539235588739e-05, "epoch": 1.4871794871794872, "percentage": 49.57, "elapsed_time": "1:12:51", "remaining_time": "1:14:06"} | |
| {"current_steps": 1220, "total_steps": 2457, "loss": 0.7066472768783569, "lr": 1.2684421944726323e-05, "epoch": 1.4896214896214897, "percentage": 49.65, "elapsed_time": "1:12:55", "remaining_time": "1:13:56"} | |
| {"current_steps": 1222, "total_steps": 2457, "loss": 0.8000496029853821, "lr": 1.2659291050921798e-05, "epoch": 1.492063492063492, "percentage": 49.74, "elapsed_time": "1:13:00", "remaining_time": "1:13:47"} | |
| {"current_steps": 1224, "total_steps": 2457, "loss": 0.733214259147644, "lr": 1.263414675712554e-05, "epoch": 1.4945054945054945, "percentage": 49.82, "elapsed_time": "1:13:07", "remaining_time": "1:13:39"} | |
| {"current_steps": 1226, "total_steps": 2457, "loss": 0.8229939341545105, "lr": 1.2608989266396165e-05, "epoch": 1.496947496947497, "percentage": 49.9, "elapsed_time": "1:13:13", "remaining_time": "1:13:30"} | |
| {"current_steps": 1228, "total_steps": 2457, "loss": 0.4456430971622467, "lr": 1.2583818781898855e-05, "epoch": 1.4993894993894994, "percentage": 49.98, "elapsed_time": "1:13:19", "remaining_time": "1:13:23"} | |
| {"current_steps": 1230, "total_steps": 2457, "loss": 0.6831130981445312, "lr": 1.2558635506903717e-05, "epoch": 1.5018315018315018, "percentage": 50.06, "elapsed_time": "1:13:25", "remaining_time": "1:13:14"} | |
| {"current_steps": 1232, "total_steps": 2457, "loss": 0.6764166951179504, "lr": 1.253343964478417e-05, "epoch": 1.5042735042735043, "percentage": 50.14, "elapsed_time": "1:13:30", "remaining_time": "1:13:05"} | |
| {"current_steps": 1234, "total_steps": 2457, "loss": 0.9079239368438721, "lr": 1.250823139901527e-05, "epoch": 1.5067155067155067, "percentage": 50.22, "elapsed_time": "1:13:36", "remaining_time": "1:12:57"} | |
| {"current_steps": 1236, "total_steps": 2457, "loss": 0.9452921748161316, "lr": 1.2483010973172077e-05, "epoch": 1.5091575091575091, "percentage": 50.31, "elapsed_time": "1:13:42", "remaining_time": "1:12:48"} | |
| {"current_steps": 1238, "total_steps": 2457, "loss": 0.8234338760375977, "lr": 1.2457778570928026e-05, "epoch": 1.5115995115995116, "percentage": 50.39, "elapsed_time": "1:13:48", "remaining_time": "1:12:40"} | |
| {"current_steps": 1240, "total_steps": 2457, "loss": 0.8415461778640747, "lr": 1.2432534396053261e-05, "epoch": 1.514041514041514, "percentage": 50.47, "elapsed_time": "1:13:55", "remaining_time": "1:12:33"} | |
| {"current_steps": 1242, "total_steps": 2457, "loss": 1.0288302898406982, "lr": 1.2407278652413001e-05, "epoch": 1.5164835164835164, "percentage": 50.55, "elapsed_time": "1:14:02", "remaining_time": "1:12:25"} | |
| {"current_steps": 1244, "total_steps": 2457, "loss": 0.7554802298545837, "lr": 1.2382011543965896e-05, "epoch": 1.5189255189255189, "percentage": 50.63, "elapsed_time": "1:14:08", "remaining_time": "1:12:17"} | |
| {"current_steps": 1246, "total_steps": 2457, "loss": 0.7608579397201538, "lr": 1.2356733274762367e-05, "epoch": 1.5213675213675213, "percentage": 50.71, "elapsed_time": "1:14:15", "remaining_time": "1:12:10"} | |
| {"current_steps": 1248, "total_steps": 2457, "loss": 0.8119852542877197, "lr": 1.2331444048942969e-05, "epoch": 1.5238095238095237, "percentage": 50.79, "elapsed_time": "1:14:21", "remaining_time": "1:12:01"} | |
| {"current_steps": 1250, "total_steps": 2457, "loss": 1.1432095766067505, "lr": 1.2306144070736747e-05, "epoch": 1.5262515262515262, "percentage": 50.88, "elapsed_time": "1:14:27", "remaining_time": "1:11:53"} | |
| {"current_steps": 1252, "total_steps": 2457, "loss": 0.7118352055549622, "lr": 1.228083354445957e-05, "epoch": 1.5286935286935286, "percentage": 50.96, "elapsed_time": "1:14:34", "remaining_time": "1:11:46"} | |
| {"current_steps": 1254, "total_steps": 2457, "loss": 0.9391320943832397, "lr": 1.2255512674512491e-05, "epoch": 1.531135531135531, "percentage": 51.04, "elapsed_time": "1:14:40", "remaining_time": "1:11:38"} | |
| {"current_steps": 1256, "total_steps": 2457, "loss": 1.0426268577575684, "lr": 1.2230181665380101e-05, "epoch": 1.5335775335775335, "percentage": 51.12, "elapsed_time": "1:14:47", "remaining_time": "1:11:31"} | |
| {"current_steps": 1258, "total_steps": 2457, "loss": 0.35382741689682007, "lr": 1.220484072162887e-05, "epoch": 1.536019536019536, "percentage": 51.2, "elapsed_time": "1:14:54", "remaining_time": "1:11:23"} | |
| {"current_steps": 1260, "total_steps": 2457, "loss": 0.6097034215927124, "lr": 1.2179490047905495e-05, "epoch": 1.5384615384615383, "percentage": 51.28, "elapsed_time": "1:14:59", "remaining_time": "1:11:14"} | |
| {"current_steps": 1262, "total_steps": 2457, "loss": 0.6083784103393555, "lr": 1.2154129848935258e-05, "epoch": 1.5409035409035408, "percentage": 51.36, "elapsed_time": "1:15:05", "remaining_time": "1:11:06"} | |
| {"current_steps": 1264, "total_steps": 2457, "loss": 0.7916078567504883, "lr": 1.2128760329520355e-05, "epoch": 1.5433455433455432, "percentage": 51.44, "elapsed_time": "1:15:13", "remaining_time": "1:10:59"} | |
| {"current_steps": 1266, "total_steps": 2457, "loss": 0.8106079697608948, "lr": 1.210338169453825e-05, "epoch": 1.5457875457875456, "percentage": 51.53, "elapsed_time": "1:15:19", "remaining_time": "1:10:51"} | |
| {"current_steps": 1268, "total_steps": 2457, "loss": 0.8362663984298706, "lr": 1.2077994148940033e-05, "epoch": 1.5482295482295483, "percentage": 51.61, "elapsed_time": "1:15:25", "remaining_time": "1:10:43"} | |
| {"current_steps": 1270, "total_steps": 2457, "loss": 0.4818616807460785, "lr": 1.2052597897748746e-05, "epoch": 1.5506715506715507, "percentage": 51.69, "elapsed_time": "1:15:32", "remaining_time": "1:10:35"} | |
| {"current_steps": 1272, "total_steps": 2457, "loss": 1.0731854438781738, "lr": 1.202719314605773e-05, "epoch": 1.5531135531135531, "percentage": 51.77, "elapsed_time": "1:15:38", "remaining_time": "1:10:28"} | |
| {"current_steps": 1274, "total_steps": 2457, "loss": 0.943490207195282, "lr": 1.2001780099028988e-05, "epoch": 1.5555555555555556, "percentage": 51.85, "elapsed_time": "1:15:45", "remaining_time": "1:10:20"} | |
| {"current_steps": 1276, "total_steps": 2457, "loss": 1.3021904230117798, "lr": 1.1976358961891504e-05, "epoch": 1.557997557997558, "percentage": 51.93, "elapsed_time": "1:15:51", "remaining_time": "1:10:13"} | |
| {"current_steps": 1278, "total_steps": 2457, "loss": 0.7510530948638916, "lr": 1.1950929939939596e-05, "epoch": 1.5604395604395604, "percentage": 52.01, "elapsed_time": "1:15:57", "remaining_time": "1:10:04"} | |
| {"current_steps": 1280, "total_steps": 2457, "loss": 0.9113296270370483, "lr": 1.192549323853126e-05, "epoch": 1.5628815628815629, "percentage": 52.1, "elapsed_time": "1:16:03", "remaining_time": "1:09:56"} | |
| {"current_steps": 1282, "total_steps": 2457, "loss": 0.6182503700256348, "lr": 1.1900049063086508e-05, "epoch": 1.5653235653235653, "percentage": 52.18, "elapsed_time": "1:16:09", "remaining_time": "1:09:48"} | |
| {"current_steps": 1284, "total_steps": 2457, "loss": 0.9308310151100159, "lr": 1.1874597619085712e-05, "epoch": 1.5677655677655677, "percentage": 52.26, "elapsed_time": "1:16:15", "remaining_time": "1:09:40"} | |
| {"current_steps": 1286, "total_steps": 2457, "loss": 0.9331011772155762, "lr": 1.1849139112067937e-05, "epoch": 1.5702075702075702, "percentage": 52.34, "elapsed_time": "1:16:23", "remaining_time": "1:09:33"} | |
| {"current_steps": 1288, "total_steps": 2457, "loss": 0.490848183631897, "lr": 1.18236737476293e-05, "epoch": 1.5726495726495726, "percentage": 52.42, "elapsed_time": "1:16:30", "remaining_time": "1:09:26"} | |
| {"current_steps": 1290, "total_steps": 2457, "loss": 0.7262513637542725, "lr": 1.1798201731421286e-05, "epoch": 1.575091575091575, "percentage": 52.5, "elapsed_time": "1:16:36", "remaining_time": "1:09:18"} | |
| {"current_steps": 1292, "total_steps": 2457, "loss": 0.43270692229270935, "lr": 1.1772723269149096e-05, "epoch": 1.5775335775335775, "percentage": 52.58, "elapsed_time": "1:16:42", "remaining_time": "1:09:09"} | |
| {"current_steps": 1294, "total_steps": 2457, "loss": 0.6380181908607483, "lr": 1.1747238566569993e-05, "epoch": 1.5799755799755801, "percentage": 52.67, "elapsed_time": "1:16:47", "remaining_time": "1:09:01"} | |
| {"current_steps": 1296, "total_steps": 2457, "loss": 0.9579664468765259, "lr": 1.1721747829491639e-05, "epoch": 1.5824175824175826, "percentage": 52.75, "elapsed_time": "1:16:53", "remaining_time": "1:08:53"} | |
| {"current_steps": 1298, "total_steps": 2457, "loss": 1.1132162809371948, "lr": 1.169625126377042e-05, "epoch": 1.584859584859585, "percentage": 52.83, "elapsed_time": "1:17:00", "remaining_time": "1:08:45"} | |
| {"current_steps": 1300, "total_steps": 2457, "loss": 0.9595221877098083, "lr": 1.1670749075309798e-05, "epoch": 1.5873015873015874, "percentage": 52.91, "elapsed_time": "1:17:07", "remaining_time": "1:08:38"} | |
| {"current_steps": 1302, "total_steps": 2457, "loss": 1.0293970108032227, "lr": 1.164524147005864e-05, "epoch": 1.5897435897435899, "percentage": 52.99, "elapsed_time": "1:17:13", "remaining_time": "1:08:30"} | |
| {"current_steps": 1304, "total_steps": 2457, "loss": 0.9469819664955139, "lr": 1.1619728654009561e-05, "epoch": 1.5921855921855923, "percentage": 53.07, "elapsed_time": "1:17:20", "remaining_time": "1:08:23"} | |
| {"current_steps": 1306, "total_steps": 2457, "loss": 0.6112901568412781, "lr": 1.1594210833197252e-05, "epoch": 1.5946275946275947, "percentage": 53.15, "elapsed_time": "1:17:26", "remaining_time": "1:08:15"} | |
| {"current_steps": 1308, "total_steps": 2457, "loss": 0.9325740337371826, "lr": 1.156868821369683e-05, "epoch": 1.5970695970695972, "percentage": 53.24, "elapsed_time": "1:17:33", "remaining_time": "1:08:07"} | |
| {"current_steps": 1310, "total_steps": 2457, "loss": 0.821311891078949, "lr": 1.1543161001622154e-05, "epoch": 1.5995115995115996, "percentage": 53.32, "elapsed_time": "1:17:38", "remaining_time": "1:07:58"} | |
| {"current_steps": 1312, "total_steps": 2457, "loss": 0.8008186221122742, "lr": 1.1517629403124175e-05, "epoch": 1.601953601953602, "percentage": 53.4, "elapsed_time": "1:17:45", "remaining_time": "1:07:51"} | |
| {"current_steps": 1314, "total_steps": 2457, "loss": 0.9607588648796082, "lr": 1.1492093624389274e-05, "epoch": 1.6043956043956045, "percentage": 53.48, "elapsed_time": "1:17:52", "remaining_time": "1:07:44"} | |
| {"current_steps": 1316, "total_steps": 2457, "loss": 1.0678871870040894, "lr": 1.1466553871637585e-05, "epoch": 1.606837606837607, "percentage": 53.56, "elapsed_time": "1:17:59", "remaining_time": "1:07:37"} | |
| {"current_steps": 1318, "total_steps": 2457, "loss": 0.927726686000824, "lr": 1.1441010351121332e-05, "epoch": 1.6092796092796093, "percentage": 53.64, "elapsed_time": "1:18:06", "remaining_time": "1:07:29"} | |
| {"current_steps": 1320, "total_steps": 2457, "loss": 1.1496163606643677, "lr": 1.1415463269123172e-05, "epoch": 1.6117216117216118, "percentage": 53.72, "elapsed_time": "1:18:12", "remaining_time": "1:07:21"} | |
| {"current_steps": 1322, "total_steps": 2457, "loss": 0.849646270275116, "lr": 1.1389912831954524e-05, "epoch": 1.6141636141636142, "percentage": 53.81, "elapsed_time": "1:18:18", "remaining_time": "1:07:14"} | |
| {"current_steps": 1324, "total_steps": 2457, "loss": 1.0158569812774658, "lr": 1.1364359245953897e-05, "epoch": 1.6166056166056166, "percentage": 53.89, "elapsed_time": "1:18:24", "remaining_time": "1:07:06"} | |
| {"current_steps": 1326, "total_steps": 2457, "loss": 0.6589023470878601, "lr": 1.1338802717485234e-05, "epoch": 1.619047619047619, "percentage": 53.97, "elapsed_time": "1:18:30", "remaining_time": "1:06:57"} | |
| {"current_steps": 1328, "total_steps": 2457, "loss": 0.9295322895050049, "lr": 1.1313243452936235e-05, "epoch": 1.6214896214896215, "percentage": 54.05, "elapsed_time": "1:18:37", "remaining_time": "1:06:50"} | |
| {"current_steps": 1330, "total_steps": 2457, "loss": 1.0116742849349976, "lr": 1.1287681658716706e-05, "epoch": 1.623931623931624, "percentage": 54.13, "elapsed_time": "1:18:44", "remaining_time": "1:06:43"} | |
| {"current_steps": 1332, "total_steps": 2457, "loss": 0.8862733244895935, "lr": 1.1262117541256872e-05, "epoch": 1.6263736263736264, "percentage": 54.21, "elapsed_time": "1:18:50", "remaining_time": "1:06:35"} | |
| {"current_steps": 1334, "total_steps": 2457, "loss": 0.9096848368644714, "lr": 1.1236551307005722e-05, "epoch": 1.6288156288156288, "percentage": 54.29, "elapsed_time": "1:18:57", "remaining_time": "1:06:28"} | |
| {"current_steps": 1336, "total_steps": 2457, "loss": 0.5657076835632324, "lr": 1.1210983162429347e-05, "epoch": 1.6312576312576312, "percentage": 54.38, "elapsed_time": "1:19:02", "remaining_time": "1:06:19"} | |
| {"current_steps": 1338, "total_steps": 2457, "loss": 0.9815369248390198, "lr": 1.1185413314009254e-05, "epoch": 1.6336996336996337, "percentage": 54.46, "elapsed_time": "1:19:09", "remaining_time": "1:06:11"} | |
| {"current_steps": 1340, "total_steps": 2457, "loss": 0.5724242925643921, "lr": 1.1159841968240714e-05, "epoch": 1.636141636141636, "percentage": 54.54, "elapsed_time": "1:19:15", "remaining_time": "1:06:03"} | |
| {"current_steps": 1342, "total_steps": 2457, "loss": 0.4281773269176483, "lr": 1.1134269331631096e-05, "epoch": 1.6385836385836385, "percentage": 54.62, "elapsed_time": "1:19:20", "remaining_time": "1:05:55"} | |
| {"current_steps": 1344, "total_steps": 2457, "loss": 1.0027917623519897, "lr": 1.1108695610698187e-05, "epoch": 1.641025641025641, "percentage": 54.7, "elapsed_time": "1:19:26", "remaining_time": "1:05:47"} | |
| {"current_steps": 1346, "total_steps": 2457, "loss": 0.9550279378890991, "lr": 1.1083121011968531e-05, "epoch": 1.6434676434676434, "percentage": 54.78, "elapsed_time": "1:19:34", "remaining_time": "1:05:40"} | |
| {"current_steps": 1348, "total_steps": 2457, "loss": 0.6426241993904114, "lr": 1.1057545741975768e-05, "epoch": 1.6459096459096458, "percentage": 54.86, "elapsed_time": "1:19:41", "remaining_time": "1:05:33"} | |
| {"current_steps": 1350, "total_steps": 2457, "loss": 0.8278497457504272, "lr": 1.1031970007258947e-05, "epoch": 1.6483516483516483, "percentage": 54.95, "elapsed_time": "1:19:46", "remaining_time": "1:05:24"} | |
| {"current_steps": 1352, "total_steps": 2457, "loss": 0.9407053589820862, "lr": 1.1006394014360882e-05, "epoch": 1.6507936507936507, "percentage": 55.03, "elapsed_time": "1:19:52", "remaining_time": "1:05:16"} | |
| {"current_steps": 1354, "total_steps": 2457, "loss": 0.9099552035331726, "lr": 1.0980817969826458e-05, "epoch": 1.6532356532356531, "percentage": 55.11, "elapsed_time": "1:19:58", "remaining_time": "1:05:08"} | |
| {"current_steps": 1356, "total_steps": 2457, "loss": 0.9383828639984131, "lr": 1.0955242080200994e-05, "epoch": 1.6556776556776556, "percentage": 55.19, "elapsed_time": "1:20:04", "remaining_time": "1:05:01"} | |
| {"current_steps": 1358, "total_steps": 2457, "loss": 0.52699214220047, "lr": 1.0929666552028545e-05, "epoch": 1.658119658119658, "percentage": 55.27, "elapsed_time": "1:20:11", "remaining_time": "1:04:53"} | |
| {"current_steps": 1360, "total_steps": 2457, "loss": 0.6198506355285645, "lr": 1.0904091591850255e-05, "epoch": 1.6605616605616604, "percentage": 55.35, "elapsed_time": "1:20:17", "remaining_time": "1:04:45"} | |
| {"current_steps": 1362, "total_steps": 2457, "loss": 0.9911934733390808, "lr": 1.0878517406202674e-05, "epoch": 1.6630036630036629, "percentage": 55.43, "elapsed_time": "1:20:24", "remaining_time": "1:04:38"} | |
| {"current_steps": 1364, "total_steps": 2457, "loss": 1.0504215955734253, "lr": 1.0852944201616097e-05, "epoch": 1.6654456654456653, "percentage": 55.51, "elapsed_time": "1:20:32", "remaining_time": "1:04:32"} | |
| {"current_steps": 1366, "total_steps": 2457, "loss": 1.0229471921920776, "lr": 1.082737218461291e-05, "epoch": 1.6678876678876677, "percentage": 55.6, "elapsed_time": "1:20:38", "remaining_time": "1:04:24"} | |
| {"current_steps": 1368, "total_steps": 2457, "loss": 1.049717903137207, "lr": 1.080180156170589e-05, "epoch": 1.6703296703296702, "percentage": 55.68, "elapsed_time": "1:20:45", "remaining_time": "1:04:17"} | |
| {"current_steps": 1370, "total_steps": 2457, "loss": 1.006693720817566, "lr": 1.0776232539396567e-05, "epoch": 1.6727716727716728, "percentage": 55.76, "elapsed_time": "1:20:51", "remaining_time": "1:04:09"} | |
| {"current_steps": 1372, "total_steps": 2457, "loss": 0.615381121635437, "lr": 1.0750665324173542e-05, "epoch": 1.6752136752136753, "percentage": 55.84, "elapsed_time": "1:20:57", "remaining_time": "1:04:01"} | |
| {"current_steps": 1374, "total_steps": 2457, "loss": 0.36105355620384216, "lr": 1.0725100122510819e-05, "epoch": 1.6776556776556777, "percentage": 55.92, "elapsed_time": "1:21:01", "remaining_time": "1:03:51"} | |
| {"current_steps": 1376, "total_steps": 2457, "loss": 1.1695616245269775, "lr": 1.0699537140866146e-05, "epoch": 1.6800976800976801, "percentage": 56.0, "elapsed_time": "1:21:07", "remaining_time": "1:03:44"} | |
| {"current_steps": 1378, "total_steps": 2457, "loss": 0.9196591377258301, "lr": 1.0673976585679341e-05, "epoch": 1.6825396825396826, "percentage": 56.08, "elapsed_time": "1:21:14", "remaining_time": "1:03:37"} | |
| {"current_steps": 1380, "total_steps": 2457, "loss": 0.7695765495300293, "lr": 1.0648418663370628e-05, "epoch": 1.684981684981685, "percentage": 56.17, "elapsed_time": "1:21:20", "remaining_time": "1:03:28"} | |
| {"current_steps": 1382, "total_steps": 2457, "loss": 1.0195831060409546, "lr": 1.0622863580338967e-05, "epoch": 1.6874236874236874, "percentage": 56.25, "elapsed_time": "1:21:27", "remaining_time": "1:03:21"} | |
| {"current_steps": 1384, "total_steps": 2457, "loss": 0.8976457715034485, "lr": 1.0597311542960385e-05, "epoch": 1.6898656898656899, "percentage": 56.33, "elapsed_time": "1:21:33", "remaining_time": "1:03:14"} | |
| {"current_steps": 1386, "total_steps": 2457, "loss": 0.9752371907234192, "lr": 1.0571762757586321e-05, "epoch": 1.6923076923076923, "percentage": 56.41, "elapsed_time": "1:21:39", "remaining_time": "1:03:06"} | |
| {"current_steps": 1388, "total_steps": 2457, "loss": 0.9225857257843018, "lr": 1.0546217430541947e-05, "epoch": 1.6947496947496947, "percentage": 56.49, "elapsed_time": "1:21:46", "remaining_time": "1:02:58"} | |
| {"current_steps": 1390, "total_steps": 2457, "loss": 0.47266364097595215, "lr": 1.0520675768124507e-05, "epoch": 1.6971916971916972, "percentage": 56.57, "elapsed_time": "1:21:51", "remaining_time": "1:02:49"} | |
| {"current_steps": 1392, "total_steps": 2457, "loss": 0.8273367881774902, "lr": 1.0495137976601648e-05, "epoch": 1.6996336996336996, "percentage": 56.65, "elapsed_time": "1:21:58", "remaining_time": "1:02:42"} | |
| {"current_steps": 1394, "total_steps": 2457, "loss": 0.7290286421775818, "lr": 1.0469604262209765e-05, "epoch": 1.702075702075702, "percentage": 56.74, "elapsed_time": "1:22:05", "remaining_time": "1:02:35"} | |
| {"current_steps": 1396, "total_steps": 2457, "loss": 0.9373266100883484, "lr": 1.0444074831152317e-05, "epoch": 1.7045177045177047, "percentage": 56.82, "elapsed_time": "1:22:11", "remaining_time": "1:02:28"} | |
| {"current_steps": 1398, "total_steps": 2457, "loss": 0.8240612149238586, "lr": 1.0418549889598175e-05, "epoch": 1.7069597069597071, "percentage": 56.9, "elapsed_time": "1:22:17", "remaining_time": "1:02:19"} | |
| {"current_steps": 1400, "total_steps": 2457, "loss": 0.44202497601509094, "lr": 1.0393029643679962e-05, "epoch": 1.7094017094017095, "percentage": 56.98, "elapsed_time": "1:22:23", "remaining_time": "1:02:12"} | |
| {"current_steps": 1402, "total_steps": 2457, "loss": 0.9583691954612732, "lr": 1.0367514299492366e-05, "epoch": 1.711843711843712, "percentage": 57.06, "elapsed_time": "1:22:29", "remaining_time": "1:02:04"} | |
| {"current_steps": 1404, "total_steps": 2457, "loss": 1.0398838520050049, "lr": 1.0342004063090503e-05, "epoch": 1.7142857142857144, "percentage": 57.14, "elapsed_time": "1:22:35", "remaining_time": "1:01:56"} | |
| {"current_steps": 1406, "total_steps": 2457, "loss": 0.4760570824146271, "lr": 1.0316499140488232e-05, "epoch": 1.7167277167277168, "percentage": 57.22, "elapsed_time": "1:22:42", "remaining_time": "1:01:49"} | |
| {"current_steps": 1408, "total_steps": 2457, "loss": 0.907942533493042, "lr": 1.0290999737656497e-05, "epoch": 1.7191697191697193, "percentage": 57.31, "elapsed_time": "1:22:48", "remaining_time": "1:01:41"} | |
| {"current_steps": 1410, "total_steps": 2457, "loss": 0.6862547397613525, "lr": 1.026550606052168e-05, "epoch": 1.7216117216117217, "percentage": 57.39, "elapsed_time": "1:22:55", "remaining_time": "1:01:34"} | |
| {"current_steps": 1412, "total_steps": 2457, "loss": 0.8768781423568726, "lr": 1.0240018314963909e-05, "epoch": 1.7240537240537241, "percentage": 57.47, "elapsed_time": "1:23:01", "remaining_time": "1:01:27"} | |
| {"current_steps": 1414, "total_steps": 2457, "loss": 0.986327588558197, "lr": 1.0214536706815418e-05, "epoch": 1.7264957264957266, "percentage": 57.55, "elapsed_time": "1:23:07", "remaining_time": "1:01:19"} | |
| {"current_steps": 1416, "total_steps": 2457, "loss": 0.8355549573898315, "lr": 1.0189061441858873e-05, "epoch": 1.728937728937729, "percentage": 57.63, "elapsed_time": "1:23:15", "remaining_time": "1:01:12"} | |
| {"current_steps": 1418, "total_steps": 2457, "loss": 0.8929445743560791, "lr": 1.0163592725825712e-05, "epoch": 1.7313797313797314, "percentage": 57.71, "elapsed_time": "1:23:21", "remaining_time": "1:01:04"} | |
| {"current_steps": 1420, "total_steps": 2457, "loss": 0.7870601415634155, "lr": 1.0138130764394496e-05, "epoch": 1.7338217338217339, "percentage": 57.79, "elapsed_time": "1:23:26", "remaining_time": "1:00:56"} | |
| {"current_steps": 1422, "total_steps": 2457, "loss": 0.7534129023551941, "lr": 1.0112675763189224e-05, "epoch": 1.7362637362637363, "percentage": 57.88, "elapsed_time": "1:23:32", "remaining_time": "1:00:48"} | |
| {"current_steps": 1424, "total_steps": 2457, "loss": 0.8370426893234253, "lr": 1.0087227927777696e-05, "epoch": 1.7387057387057387, "percentage": 57.96, "elapsed_time": "1:23:41", "remaining_time": "1:00:42"} | |
| {"current_steps": 1426, "total_steps": 2457, "loss": 0.6909109354019165, "lr": 1.006178746366984e-05, "epoch": 1.7411477411477412, "percentage": 58.04, "elapsed_time": "1:23:48", "remaining_time": "1:00:35"} | |
| {"current_steps": 1428, "total_steps": 2457, "loss": 1.014011263847351, "lr": 1.0036354576316052e-05, "epoch": 1.7435897435897436, "percentage": 58.12, "elapsed_time": "1:23:55", "remaining_time": "1:00:28"} | |
| {"current_steps": 1430, "total_steps": 2457, "loss": 1.2392351627349854, "lr": 1.0010929471105548e-05, "epoch": 1.746031746031746, "percentage": 58.2, "elapsed_time": "1:24:01", "remaining_time": "1:00:20"} | |
| {"current_steps": 1432, "total_steps": 2457, "loss": 0.6340602040290833, "lr": 9.98551235336469e-06, "epoch": 1.7484737484737485, "percentage": 58.28, "elapsed_time": "1:24:07", "remaining_time": "1:00:12"} | |
| {"current_steps": 1434, "total_steps": 2457, "loss": 0.7525686621665955, "lr": 9.960103428355337e-06, "epoch": 1.750915750915751, "percentage": 58.36, "elapsed_time": "1:24:12", "remaining_time": "1:00:04"} | |
| {"current_steps": 1436, "total_steps": 2457, "loss": 0.6044411063194275, "lr": 9.934702901273187e-06, "epoch": 1.7533577533577533, "percentage": 58.45, "elapsed_time": "1:24:18", "remaining_time": "0:59:56"} | |
| {"current_steps": 1438, "total_steps": 2457, "loss": 0.4377739727497101, "lr": 9.90931097724612e-06, "epoch": 1.7557997557997558, "percentage": 58.53, "elapsed_time": "1:24:24", "remaining_time": "0:59:49"} | |
| {"current_steps": 1440, "total_steps": 2457, "loss": 0.909875214099884, "lr": 9.883927861332538e-06, "epoch": 1.7582417582417582, "percentage": 58.61, "elapsed_time": "1:24:32", "remaining_time": "0:59:42"} | |
| {"current_steps": 1442, "total_steps": 2457, "loss": 0.7949923872947693, "lr": 9.85855375851971e-06, "epoch": 1.7606837606837606, "percentage": 58.69, "elapsed_time": "1:24:38", "remaining_time": "0:59:34"} | |
| {"current_steps": 1444, "total_steps": 2457, "loss": 0.6595785021781921, "lr": 9.833188873722122e-06, "epoch": 1.763125763125763, "percentage": 58.77, "elapsed_time": "1:24:44", "remaining_time": "0:59:26"} | |
| {"current_steps": 1446, "total_steps": 2457, "loss": 1.0280483961105347, "lr": 9.80783341177981e-06, "epoch": 1.7655677655677655, "percentage": 58.85, "elapsed_time": "1:24:50", "remaining_time": "0:59:19"} | |
| {"current_steps": 1448, "total_steps": 2457, "loss": 1.0123943090438843, "lr": 9.782487577456724e-06, "epoch": 1.768009768009768, "percentage": 58.93, "elapsed_time": "1:24:57", "remaining_time": "0:59:11"} | |
| {"current_steps": 1450, "total_steps": 2457, "loss": 0.8486643433570862, "lr": 9.75715157543905e-06, "epoch": 1.7704517704517704, "percentage": 59.02, "elapsed_time": "1:25:03", "remaining_time": "0:59:04"} | |
| {"current_steps": 1452, "total_steps": 2457, "loss": 0.3455406129360199, "lr": 9.731825610333587e-06, "epoch": 1.7728937728937728, "percentage": 59.1, "elapsed_time": "1:25:09", "remaining_time": "0:58:56"} | |
| {"current_steps": 1454, "total_steps": 2457, "loss": 0.8303570747375488, "lr": 9.706509886666067e-06, "epoch": 1.7753357753357752, "percentage": 59.18, "elapsed_time": "1:25:15", "remaining_time": "0:58:48"} | |
| {"current_steps": 1456, "total_steps": 2457, "loss": 0.5113586187362671, "lr": 9.681204608879518e-06, "epoch": 1.7777777777777777, "percentage": 59.26, "elapsed_time": "1:25:22", "remaining_time": "0:58:41"} | |
| {"current_steps": 1458, "total_steps": 2457, "loss": 0.8892757892608643, "lr": 9.655909981332614e-06, "epoch": 1.7802197802197801, "percentage": 59.34, "elapsed_time": "1:25:28", "remaining_time": "0:58:34"} | |
| {"current_steps": 1460, "total_steps": 2457, "loss": 0.8083629608154297, "lr": 9.63062620829801e-06, "epoch": 1.7826617826617825, "percentage": 59.42, "elapsed_time": "1:25:34", "remaining_time": "0:58:26"} | |
| {"current_steps": 1462, "total_steps": 2457, "loss": 0.9189132452011108, "lr": 9.605353493960717e-06, "epoch": 1.785103785103785, "percentage": 59.5, "elapsed_time": "1:25:39", "remaining_time": "0:58:17"} | |
| {"current_steps": 1464, "total_steps": 2457, "loss": 0.6249831318855286, "lr": 9.580092042416427e-06, "epoch": 1.7875457875457874, "percentage": 59.58, "elapsed_time": "1:25:45", "remaining_time": "0:58:10"} | |
| {"current_steps": 1466, "total_steps": 2457, "loss": 0.6827890872955322, "lr": 9.554842057669886e-06, "epoch": 1.7899877899877898, "percentage": 59.67, "elapsed_time": "1:25:51", "remaining_time": "0:58:02"} | |
| {"current_steps": 1468, "total_steps": 2457, "loss": 0.7608170509338379, "lr": 9.529603743633229e-06, "epoch": 1.7924297924297923, "percentage": 59.75, "elapsed_time": "1:25:58", "remaining_time": "0:57:55"} | |
| {"current_steps": 1470, "total_steps": 2457, "loss": 0.9152241945266724, "lr": 9.504377304124346e-06, "epoch": 1.7948717948717947, "percentage": 59.83, "elapsed_time": "1:26:04", "remaining_time": "0:57:47"} | |
| {"current_steps": 1472, "total_steps": 2457, "loss": 0.8515353202819824, "lr": 9.47916294286523e-06, "epoch": 1.7973137973137974, "percentage": 59.91, "elapsed_time": "1:26:11", "remaining_time": "0:57:40"} | |
| {"current_steps": 1474, "total_steps": 2457, "loss": 0.5703706741333008, "lr": 9.453960863480333e-06, "epoch": 1.7997557997557998, "percentage": 59.99, "elapsed_time": "1:26:19", "remaining_time": "0:57:34"} | |
| {"current_steps": 1476, "total_steps": 2457, "loss": 0.7551999092102051, "lr": 9.428771269494926e-06, "epoch": 1.8021978021978022, "percentage": 60.07, "elapsed_time": "1:26:25", "remaining_time": "0:57:26"} | |
| {"current_steps": 1478, "total_steps": 2457, "loss": 0.6955189108848572, "lr": 9.403594364333444e-06, "epoch": 1.8046398046398047, "percentage": 60.15, "elapsed_time": "1:26:31", "remaining_time": "0:57:18"} | |
| {"current_steps": 1480, "total_steps": 2457, "loss": 0.42793938517570496, "lr": 9.378430351317854e-06, "epoch": 1.807081807081807, "percentage": 60.24, "elapsed_time": "1:26:37", "remaining_time": "0:57:10"} | |
| {"current_steps": 1482, "total_steps": 2457, "loss": 0.6840672492980957, "lr": 9.353279433666014e-06, "epoch": 1.8095238095238095, "percentage": 60.32, "elapsed_time": "1:26:43", "remaining_time": "0:57:03"} | |
| {"current_steps": 1484, "total_steps": 2457, "loss": 0.893316924571991, "lr": 9.328141814490021e-06, "epoch": 1.811965811965812, "percentage": 60.4, "elapsed_time": "1:26:49", "remaining_time": "0:56:55"} | |
| {"current_steps": 1486, "total_steps": 2457, "loss": 0.872158944606781, "lr": 9.303017696794578e-06, "epoch": 1.8144078144078144, "percentage": 60.48, "elapsed_time": "1:26:57", "remaining_time": "0:56:49"} | |
| {"current_steps": 1488, "total_steps": 2457, "loss": 0.6238676905632019, "lr": 9.277907283475358e-06, "epoch": 1.8168498168498168, "percentage": 60.56, "elapsed_time": "1:27:03", "remaining_time": "0:56:41"} | |
| {"current_steps": 1490, "total_steps": 2457, "loss": 0.6716984510421753, "lr": 9.252810777317351e-06, "epoch": 1.8192918192918193, "percentage": 60.64, "elapsed_time": "1:27:09", "remaining_time": "0:56:34"} | |
| {"current_steps": 1492, "total_steps": 2457, "loss": 0.8512567281723022, "lr": 9.227728380993253e-06, "epoch": 1.8217338217338217, "percentage": 60.72, "elapsed_time": "1:27:16", "remaining_time": "0:56:26"} | |
| {"current_steps": 1494, "total_steps": 2457, "loss": 0.5891348123550415, "lr": 9.202660297061798e-06, "epoch": 1.8241758241758241, "percentage": 60.81, "elapsed_time": "1:27:21", "remaining_time": "0:56:18"} | |
| {"current_steps": 1496, "total_steps": 2457, "loss": 0.8717406392097473, "lr": 9.177606727966142e-06, "epoch": 1.8266178266178266, "percentage": 60.89, "elapsed_time": "1:27:27", "remaining_time": "0:56:10"} | |
| {"current_steps": 1498, "total_steps": 2457, "loss": 1.3341138362884521, "lr": 9.15256787603222e-06, "epoch": 1.8290598290598292, "percentage": 60.97, "elapsed_time": "1:27:33", "remaining_time": "0:56:03"} | |
| {"current_steps": 1500, "total_steps": 2457, "loss": 1.2278974056243896, "lr": 9.127543943467128e-06, "epoch": 1.8315018315018317, "percentage": 61.05, "elapsed_time": "1:27:40", "remaining_time": "0:55:56"} | |
| {"current_steps": 1502, "total_steps": 2457, "loss": 0.6873140335083008, "lr": 9.102535132357457e-06, "epoch": 1.833943833943834, "percentage": 61.13, "elapsed_time": "1:27:47", "remaining_time": "0:55:49"} | |
| {"current_steps": 1504, "total_steps": 2457, "loss": 0.7067763209342957, "lr": 9.077541644667697e-06, "epoch": 1.8363858363858365, "percentage": 61.21, "elapsed_time": "1:27:53", "remaining_time": "0:55:41"} | |
| {"current_steps": 1506, "total_steps": 2457, "loss": 0.6803405284881592, "lr": 9.052563682238587e-06, "epoch": 1.838827838827839, "percentage": 61.29, "elapsed_time": "1:27:59", "remaining_time": "0:55:34"} | |
| {"current_steps": 1508, "total_steps": 2457, "loss": 0.6593731641769409, "lr": 9.02760144678548e-06, "epoch": 1.8412698412698414, "percentage": 61.38, "elapsed_time": "1:28:05", "remaining_time": "0:55:26"} | |
| {"current_steps": 1510, "total_steps": 2457, "loss": 0.8603323101997375, "lr": 9.00265513989673e-06, "epoch": 1.8437118437118438, "percentage": 61.46, "elapsed_time": "1:28:11", "remaining_time": "0:55:18"} | |
| {"current_steps": 1512, "total_steps": 2457, "loss": 0.8412877917289734, "lr": 8.977724963032056e-06, "epoch": 1.8461538461538463, "percentage": 61.54, "elapsed_time": "1:28:18", "remaining_time": "0:55:11"} | |
| {"current_steps": 1514, "total_steps": 2457, "loss": 1.0396430492401123, "lr": 8.952811117520914e-06, "epoch": 1.8485958485958487, "percentage": 61.62, "elapsed_time": "1:28:25", "remaining_time": "0:55:04"} | |
| {"current_steps": 1516, "total_steps": 2457, "loss": 0.6088389754295349, "lr": 8.927913804560864e-06, "epoch": 1.8510378510378511, "percentage": 61.7, "elapsed_time": "1:28:31", "remaining_time": "0:54:57"} | |
| {"current_steps": 1518, "total_steps": 2457, "loss": 1.1635559797286987, "lr": 8.903033225215975e-06, "epoch": 1.8534798534798536, "percentage": 61.78, "elapsed_time": "1:28:38", "remaining_time": "0:54:50"} | |
| {"current_steps": 1520, "total_steps": 2457, "loss": 0.631327748298645, "lr": 8.878169580415154e-06, "epoch": 1.855921855921856, "percentage": 61.86, "elapsed_time": "1:28:44", "remaining_time": "0:54:42"} | |
| {"current_steps": 1522, "total_steps": 2457, "loss": 0.902554452419281, "lr": 8.85332307095057e-06, "epoch": 1.8583638583638584, "percentage": 61.95, "elapsed_time": "1:28:51", "remaining_time": "0:54:34"} | |
| {"current_steps": 1524, "total_steps": 2457, "loss": 0.8101663589477539, "lr": 8.828493897475998e-06, "epoch": 1.8608058608058609, "percentage": 62.03, "elapsed_time": "1:28:58", "remaining_time": "0:54:28"} | |
| {"current_steps": 1526, "total_steps": 2457, "loss": 0.7383776903152466, "lr": 8.803682260505216e-06, "epoch": 1.8632478632478633, "percentage": 62.11, "elapsed_time": "1:29:03", "remaining_time": "0:54:20"} | |
| {"current_steps": 1528, "total_steps": 2457, "loss": 0.7297862768173218, "lr": 8.778888360410385e-06, "epoch": 1.8656898656898657, "percentage": 62.19, "elapsed_time": "1:29:09", "remaining_time": "0:54:12"} | |
| {"current_steps": 1530, "total_steps": 2457, "loss": 0.8971010446548462, "lr": 8.754112397420426e-06, "epoch": 1.8681318681318682, "percentage": 62.27, "elapsed_time": "1:29:15", "remaining_time": "0:54:04"} | |
| {"current_steps": 1532, "total_steps": 2457, "loss": 0.7592481374740601, "lr": 8.729354571619404e-06, "epoch": 1.8705738705738706, "percentage": 62.35, "elapsed_time": "1:29:22", "remaining_time": "0:53:58"} | |
| {"current_steps": 1534, "total_steps": 2457, "loss": 0.8079948425292969, "lr": 8.704615082944914e-06, "epoch": 1.873015873015873, "percentage": 62.43, "elapsed_time": "1:29:29", "remaining_time": "0:53:50"} | |
| {"current_steps": 1536, "total_steps": 2457, "loss": 1.000016450881958, "lr": 8.679894131186462e-06, "epoch": 1.8754578754578755, "percentage": 62.52, "elapsed_time": "1:29:36", "remaining_time": "0:53:43"} | |
| {"current_steps": 1538, "total_steps": 2457, "loss": 0.8313310742378235, "lr": 8.655191915983859e-06, "epoch": 1.877899877899878, "percentage": 62.6, "elapsed_time": "1:29:42", "remaining_time": "0:53:36"} | |
| {"current_steps": 1540, "total_steps": 2457, "loss": 0.9431169033050537, "lr": 8.630508636825602e-06, "epoch": 1.8803418803418803, "percentage": 62.68, "elapsed_time": "1:29:48", "remaining_time": "0:53:28"} | |
| {"current_steps": 1542, "total_steps": 2457, "loss": 0.9815627336502075, "lr": 8.605844493047269e-06, "epoch": 1.8827838827838828, "percentage": 62.76, "elapsed_time": "1:29:54", "remaining_time": "0:53:21"} | |
| {"current_steps": 1544, "total_steps": 2457, "loss": 0.7461444735527039, "lr": 8.581199683829899e-06, "epoch": 1.8852258852258852, "percentage": 62.84, "elapsed_time": "1:30:01", "remaining_time": "0:53:13"} | |
| {"current_steps": 1546, "total_steps": 2457, "loss": 0.9441168904304504, "lr": 8.556574408198399e-06, "epoch": 1.8876678876678876, "percentage": 62.92, "elapsed_time": "1:30:08", "remaining_time": "0:53:07"} | |
| {"current_steps": 1548, "total_steps": 2457, "loss": 0.8527262210845947, "lr": 8.531968865019919e-06, "epoch": 1.89010989010989, "percentage": 63.0, "elapsed_time": "1:30:15", "remaining_time": "0:53:00"} | |
| {"current_steps": 1550, "total_steps": 2457, "loss": 0.47991418838500977, "lr": 8.507383253002264e-06, "epoch": 1.8925518925518925, "percentage": 63.09, "elapsed_time": "1:30:21", "remaining_time": "0:52:52"} | |
| {"current_steps": 1552, "total_steps": 2457, "loss": 0.8953297138214111, "lr": 8.482817770692276e-06, "epoch": 1.894993894993895, "percentage": 63.17, "elapsed_time": "1:30:27", "remaining_time": "0:52:45"} | |
| {"current_steps": 1554, "total_steps": 2457, "loss": 0.598823070526123, "lr": 8.458272616474226e-06, "epoch": 1.8974358974358974, "percentage": 63.25, "elapsed_time": "1:30:33", "remaining_time": "0:52:37"} | |
| {"current_steps": 1556, "total_steps": 2457, "loss": 1.0903539657592773, "lr": 8.43374798856824e-06, "epoch": 1.8998778998778998, "percentage": 63.33, "elapsed_time": "1:30:40", "remaining_time": "0:52:30"} | |
| {"current_steps": 1558, "total_steps": 2457, "loss": 0.6560428738594055, "lr": 8.40924408502866e-06, "epoch": 1.9023199023199022, "percentage": 63.41, "elapsed_time": "1:30:46", "remaining_time": "0:52:22"} | |
| {"current_steps": 1560, "total_steps": 2457, "loss": 0.553628146648407, "lr": 8.384761103742476e-06, "epoch": 1.9047619047619047, "percentage": 63.49, "elapsed_time": "1:30:51", "remaining_time": "0:52:14"} | |
| {"current_steps": 1562, "total_steps": 2457, "loss": 0.8809893727302551, "lr": 8.360299242427713e-06, "epoch": 1.907203907203907, "percentage": 63.57, "elapsed_time": "1:30:59", "remaining_time": "0:52:08"} | |
| {"current_steps": 1564, "total_steps": 2457, "loss": 0.7752953171730042, "lr": 8.335858698631829e-06, "epoch": 1.9096459096459095, "percentage": 63.65, "elapsed_time": "1:31:06", "remaining_time": "0:52:01"} | |
| {"current_steps": 1566, "total_steps": 2457, "loss": 0.937446653842926, "lr": 8.311439669730139e-06, "epoch": 1.912087912087912, "percentage": 63.74, "elapsed_time": "1:31:12", "remaining_time": "0:51:53"} | |
| {"current_steps": 1568, "total_steps": 2457, "loss": 0.9597198963165283, "lr": 8.287042352924206e-06, "epoch": 1.9145299145299144, "percentage": 63.82, "elapsed_time": "1:31:18", "remaining_time": "0:51:45"} | |
| {"current_steps": 1570, "total_steps": 2457, "loss": 0.6756553053855896, "lr": 8.26266694524024e-06, "epoch": 1.9169719169719168, "percentage": 63.9, "elapsed_time": "1:31:24", "remaining_time": "0:51:38"} | |
| {"current_steps": 1572, "total_steps": 2457, "loss": 0.8379277586936951, "lr": 8.238313643527533e-06, "epoch": 1.9194139194139193, "percentage": 63.98, "elapsed_time": "1:31:30", "remaining_time": "0:51:30"} | |
| {"current_steps": 1574, "total_steps": 2457, "loss": 0.7130874991416931, "lr": 8.213982644456856e-06, "epoch": 1.9218559218559217, "percentage": 64.06, "elapsed_time": "1:31:36", "remaining_time": "0:51:23"} | |
| {"current_steps": 1576, "total_steps": 2457, "loss": 0.7871428728103638, "lr": 8.189674144518864e-06, "epoch": 1.9242979242979243, "percentage": 64.14, "elapsed_time": "1:31:43", "remaining_time": "0:51:16"} | |
| {"current_steps": 1578, "total_steps": 2457, "loss": 0.7644234895706177, "lr": 8.165388340022507e-06, "epoch": 1.9267399267399268, "percentage": 64.22, "elapsed_time": "1:31:49", "remaining_time": "0:51:08"} | |
| {"current_steps": 1580, "total_steps": 2457, "loss": 0.9481227397918701, "lr": 8.14112542709347e-06, "epoch": 1.9291819291819292, "percentage": 64.31, "elapsed_time": "1:31:56", "remaining_time": "0:51:02"} | |
| {"current_steps": 1582, "total_steps": 2457, "loss": 0.2258923351764679, "lr": 8.116885601672557e-06, "epoch": 1.9316239316239316, "percentage": 64.39, "elapsed_time": "1:32:00", "remaining_time": "0:50:53"} | |
| {"current_steps": 1584, "total_steps": 2457, "loss": 0.5065496563911438, "lr": 8.09266905951413e-06, "epoch": 1.934065934065934, "percentage": 64.47, "elapsed_time": "1:32:06", "remaining_time": "0:50:45"} | |
| {"current_steps": 1586, "total_steps": 2457, "loss": 0.5920478701591492, "lr": 8.068475996184527e-06, "epoch": 1.9365079365079365, "percentage": 64.55, "elapsed_time": "1:32:13", "remaining_time": "0:50:39"} | |
| {"current_steps": 1588, "total_steps": 2457, "loss": 0.9720399379730225, "lr": 8.044306607060466e-06, "epoch": 1.938949938949939, "percentage": 64.63, "elapsed_time": "1:32:19", "remaining_time": "0:50:31"} | |
| {"current_steps": 1590, "total_steps": 2457, "loss": 1.0517313480377197, "lr": 8.02016108732748e-06, "epoch": 1.9413919413919414, "percentage": 64.71, "elapsed_time": "1:32:26", "remaining_time": "0:50:24"} | |
| {"current_steps": 1592, "total_steps": 2457, "loss": 1.0347234010696411, "lr": 7.996039631978352e-06, "epoch": 1.9438339438339438, "percentage": 64.79, "elapsed_time": "1:32:33", "remaining_time": "0:50:17"} | |
| {"current_steps": 1594, "total_steps": 2457, "loss": 0.6489905118942261, "lr": 7.97194243581151e-06, "epoch": 1.9462759462759462, "percentage": 64.88, "elapsed_time": "1:32:39", "remaining_time": "0:50:09"} | |
| {"current_steps": 1596, "total_steps": 2457, "loss": 0.568684458732605, "lr": 7.947869693429486e-06, "epoch": 1.9487179487179487, "percentage": 64.96, "elapsed_time": "1:32:44", "remaining_time": "0:50:01"} | |
| {"current_steps": 1598, "total_steps": 2457, "loss": 0.6664155125617981, "lr": 7.923821599237322e-06, "epoch": 1.9511599511599511, "percentage": 65.04, "elapsed_time": "1:32:51", "remaining_time": "0:49:54"} | |
| {"current_steps": 1600, "total_steps": 2457, "loss": 0.7015742063522339, "lr": 7.899798347441005e-06, "epoch": 1.9536019536019538, "percentage": 65.12, "elapsed_time": "1:32:56", "remaining_time": "0:49:47"} | |
| {"current_steps": 1602, "total_steps": 2457, "loss": 0.9169449210166931, "lr": 7.87580013204591e-06, "epoch": 1.9560439560439562, "percentage": 65.2, "elapsed_time": "1:33:02", "remaining_time": "0:49:39"} | |
| {"current_steps": 1604, "total_steps": 2457, "loss": 0.8345751762390137, "lr": 7.85182714685522e-06, "epoch": 1.9584859584859586, "percentage": 65.28, "elapsed_time": "1:33:08", "remaining_time": "0:49:32"} | |
| {"current_steps": 1606, "total_steps": 2457, "loss": 1.1974244117736816, "lr": 7.827879585468363e-06, "epoch": 1.960927960927961, "percentage": 65.36, "elapsed_time": "1:33:15", "remaining_time": "0:49:25"} | |
| {"current_steps": 1608, "total_steps": 2457, "loss": 1.1730899810791016, "lr": 7.803957641279457e-06, "epoch": 1.9633699633699635, "percentage": 65.45, "elapsed_time": "1:33:22", "remaining_time": "0:49:17"} | |
| {"current_steps": 1610, "total_steps": 2457, "loss": 0.9335651397705078, "lr": 7.780061507475738e-06, "epoch": 1.965811965811966, "percentage": 65.53, "elapsed_time": "1:33:29", "remaining_time": "0:49:11"} | |
| {"current_steps": 1612, "total_steps": 2457, "loss": 0.8546837568283081, "lr": 7.756191377036004e-06, "epoch": 1.9682539682539684, "percentage": 65.61, "elapsed_time": "1:33:35", "remaining_time": "0:49:03"} | |
| {"current_steps": 1614, "total_steps": 2457, "loss": 1.0305918455123901, "lr": 7.732347442729062e-06, "epoch": 1.9706959706959708, "percentage": 65.69, "elapsed_time": "1:33:41", "remaining_time": "0:48:56"} | |
| {"current_steps": 1616, "total_steps": 2457, "loss": 0.8775286674499512, "lr": 7.708529897112158e-06, "epoch": 1.9731379731379732, "percentage": 65.77, "elapsed_time": "1:33:47", "remaining_time": "0:48:48"} | |
| {"current_steps": 1618, "total_steps": 2457, "loss": 0.8464508056640625, "lr": 7.684738932529441e-06, "epoch": 1.9755799755799757, "percentage": 65.85, "elapsed_time": "1:33:53", "remaining_time": "0:48:41"} | |
| {"current_steps": 1620, "total_steps": 2457, "loss": 1.035678505897522, "lr": 7.660974741110387e-06, "epoch": 1.978021978021978, "percentage": 65.93, "elapsed_time": "1:33:59", "remaining_time": "0:48:33"} | |
| {"current_steps": 1622, "total_steps": 2457, "loss": 0.6054593324661255, "lr": 7.637237514768265e-06, "epoch": 1.9804639804639805, "percentage": 66.02, "elapsed_time": "1:34:05", "remaining_time": "0:48:26"} | |
| {"current_steps": 1624, "total_steps": 2457, "loss": 0.45836907625198364, "lr": 7.613527445198576e-06, "epoch": 1.982905982905983, "percentage": 66.1, "elapsed_time": "1:34:10", "remaining_time": "0:48:18"} | |
| {"current_steps": 1626, "total_steps": 2457, "loss": 0.7117047905921936, "lr": 7.5898447238775264e-06, "epoch": 1.9853479853479854, "percentage": 66.18, "elapsed_time": "1:34:17", "remaining_time": "0:48:11"} | |
| {"current_steps": 1628, "total_steps": 2457, "loss": 1.0821315050125122, "lr": 7.566189542060445e-06, "epoch": 1.9877899877899878, "percentage": 66.26, "elapsed_time": "1:34:23", "remaining_time": "0:48:04"} | |
| {"current_steps": 1630, "total_steps": 2457, "loss": 1.1502904891967773, "lr": 7.5425620907802655e-06, "epoch": 1.9902319902319903, "percentage": 66.34, "elapsed_time": "1:34:30", "remaining_time": "0:47:56"} | |
| {"current_steps": 1632, "total_steps": 2457, "loss": 0.8673257231712341, "lr": 7.518962560845986e-06, "epoch": 1.9926739926739927, "percentage": 66.42, "elapsed_time": "1:34:36", "remaining_time": "0:47:49"} | |
| {"current_steps": 1634, "total_steps": 2457, "loss": 0.75059574842453, "lr": 7.4953911428411085e-06, "epoch": 1.9951159951159951, "percentage": 66.5, "elapsed_time": "1:34:43", "remaining_time": "0:47:42"} | |
| {"current_steps": 1636, "total_steps": 2457, "loss": 1.0258231163024902, "lr": 7.4718480271221125e-06, "epoch": 1.9975579975579976, "percentage": 66.59, "elapsed_time": "1:34:49", "remaining_time": "0:47:35"} | |
| {"current_steps": 1638, "total_steps": 2457, "loss": 0.9197133779525757, "lr": 7.448333403816926e-06, "epoch": 2.0, "percentage": 66.67, "elapsed_time": "1:34:55", "remaining_time": "0:47:27"} | |
| {"current_steps": 1640, "total_steps": 2457, "loss": 0.6060487627983093, "lr": 7.424847462823361e-06, "epoch": 2.0024420024420024, "percentage": 66.75, "elapsed_time": "1:35:03", "remaining_time": "0:47:21"} | |
| {"current_steps": 1642, "total_steps": 2457, "loss": 0.47724178433418274, "lr": 7.401390393807615e-06, "epoch": 2.004884004884005, "percentage": 66.83, "elapsed_time": "1:35:10", "remaining_time": "0:47:14"} | |
| {"current_steps": 1644, "total_steps": 2457, "loss": 0.5051848292350769, "lr": 7.37796238620272e-06, "epoch": 2.0073260073260073, "percentage": 66.91, "elapsed_time": "1:35:17", "remaining_time": "0:47:07"} | |
| {"current_steps": 1646, "total_steps": 2457, "loss": 0.438951700925827, "lr": 7.3545636292070055e-06, "epoch": 2.0097680097680097, "percentage": 66.99, "elapsed_time": "1:35:23", "remaining_time": "0:47:00"} | |
| {"current_steps": 1648, "total_steps": 2457, "loss": 0.528706431388855, "lr": 7.331194311782597e-06, "epoch": 2.012210012210012, "percentage": 67.07, "elapsed_time": "1:35:29", "remaining_time": "0:46:52"} | |
| {"current_steps": 1650, "total_steps": 2457, "loss": 0.3387841284275055, "lr": 7.307854622653863e-06, "epoch": 2.0146520146520146, "percentage": 67.16, "elapsed_time": "1:35:35", "remaining_time": "0:46:45"} | |
| {"current_steps": 1652, "total_steps": 2457, "loss": 0.6135000586509705, "lr": 7.284544750305902e-06, "epoch": 2.017094017094017, "percentage": 67.24, "elapsed_time": "1:35:43", "remaining_time": "0:46:38"} | |
| {"current_steps": 1654, "total_steps": 2457, "loss": 0.4525635838508606, "lr": 7.261264882983024e-06, "epoch": 2.0195360195360195, "percentage": 67.32, "elapsed_time": "1:35:49", "remaining_time": "0:46:31"} | |
| {"current_steps": 1656, "total_steps": 2457, "loss": 0.4565449655056, "lr": 7.238015208687226e-06, "epoch": 2.021978021978022, "percentage": 67.4, "elapsed_time": "1:35:55", "remaining_time": "0:46:23"} | |
| {"current_steps": 1658, "total_steps": 2457, "loss": 0.4369199872016907, "lr": 7.214795915176671e-06, "epoch": 2.0244200244200243, "percentage": 67.48, "elapsed_time": "1:36:01", "remaining_time": "0:46:16"} | |
| {"current_steps": 1660, "total_steps": 2457, "loss": 0.6220426559448242, "lr": 7.191607189964181e-06, "epoch": 2.0268620268620268, "percentage": 67.56, "elapsed_time": "1:36:08", "remaining_time": "0:46:09"} | |
| {"current_steps": 1662, "total_steps": 2457, "loss": 0.557952880859375, "lr": 7.16844922031571e-06, "epoch": 2.029304029304029, "percentage": 67.64, "elapsed_time": "1:36:14", "remaining_time": "0:46:01"} | |
| {"current_steps": 1664, "total_steps": 2457, "loss": 0.2245861142873764, "lr": 7.145322193248838e-06, "epoch": 2.0317460317460316, "percentage": 67.72, "elapsed_time": "1:36:19", "remaining_time": "0:45:54"} | |
| {"current_steps": 1666, "total_steps": 2457, "loss": 0.40176424384117126, "lr": 7.122226295531267e-06, "epoch": 2.034188034188034, "percentage": 67.81, "elapsed_time": "1:36:27", "remaining_time": "0:45:47"} | |
| {"current_steps": 1668, "total_steps": 2457, "loss": 0.4665899872779846, "lr": 7.099161713679308e-06, "epoch": 2.0366300366300365, "percentage": 67.89, "elapsed_time": "1:36:33", "remaining_time": "0:45:40"} | |
| {"current_steps": 1670, "total_steps": 2457, "loss": 0.6036043763160706, "lr": 7.07612863395636e-06, "epoch": 2.039072039072039, "percentage": 67.97, "elapsed_time": "1:36:38", "remaining_time": "0:45:32"} | |
| {"current_steps": 1672, "total_steps": 2457, "loss": 0.5682324171066284, "lr": 7.053127242371434e-06, "epoch": 2.0415140415140414, "percentage": 68.05, "elapsed_time": "1:36:45", "remaining_time": "0:45:25"} | |
| {"current_steps": 1674, "total_steps": 2457, "loss": 0.5213257074356079, "lr": 7.030157724677631e-06, "epoch": 2.043956043956044, "percentage": 68.13, "elapsed_time": "1:36:51", "remaining_time": "0:45:18"} | |
| {"current_steps": 1676, "total_steps": 2457, "loss": 0.3227638006210327, "lr": 7.0072202663706405e-06, "epoch": 2.0463980463980462, "percentage": 68.21, "elapsed_time": "1:36:57", "remaining_time": "0:45:10"} | |
| {"current_steps": 1678, "total_steps": 2457, "loss": 0.5378082990646362, "lr": 6.984315052687258e-06, "epoch": 2.0488400488400487, "percentage": 68.29, "elapsed_time": "1:37:04", "remaining_time": "0:45:03"} | |
| {"current_steps": 1680, "total_steps": 2457, "loss": 0.49545711278915405, "lr": 6.96144226860388e-06, "epoch": 2.051282051282051, "percentage": 68.38, "elapsed_time": "1:37:11", "remaining_time": "0:44:56"} | |
| {"current_steps": 1682, "total_steps": 2457, "loss": 0.3199822008609772, "lr": 6.938602098835e-06, "epoch": 2.0537240537240535, "percentage": 68.46, "elapsed_time": "1:37:16", "remaining_time": "0:44:49"} | |
| {"current_steps": 1684, "total_steps": 2457, "loss": 0.3839988112449646, "lr": 6.915794727831743e-06, "epoch": 2.056166056166056, "percentage": 68.54, "elapsed_time": "1:37:22", "remaining_time": "0:44:41"} | |
| {"current_steps": 1686, "total_steps": 2457, "loss": 0.3781861662864685, "lr": 6.893020339780341e-06, "epoch": 2.0586080586080584, "percentage": 68.62, "elapsed_time": "1:37:28", "remaining_time": "0:44:34"} | |
| {"current_steps": 1688, "total_steps": 2457, "loss": 0.6202837824821472, "lr": 6.870279118600679e-06, "epoch": 2.061050061050061, "percentage": 68.7, "elapsed_time": "1:37:33", "remaining_time": "0:44:26"} | |
| {"current_steps": 1690, "total_steps": 2457, "loss": 0.46027785539627075, "lr": 6.847571247944791e-06, "epoch": 2.0634920634920633, "percentage": 68.78, "elapsed_time": "1:37:39", "remaining_time": "0:44:19"} | |
| {"current_steps": 1692, "total_steps": 2457, "loss": 0.31774628162384033, "lr": 6.8248969111953825e-06, "epoch": 2.065934065934066, "percentage": 68.86, "elapsed_time": "1:37:46", "remaining_time": "0:44:12"} | |
| {"current_steps": 1694, "total_steps": 2457, "loss": 0.47486642003059387, "lr": 6.80225629146434e-06, "epoch": 2.0683760683760686, "percentage": 68.95, "elapsed_time": "1:37:53", "remaining_time": "0:44:05"} | |
| {"current_steps": 1696, "total_steps": 2457, "loss": 0.4364372789859772, "lr": 6.7796495715912694e-06, "epoch": 2.070818070818071, "percentage": 69.03, "elapsed_time": "1:37:59", "remaining_time": "0:43:58"} | |
| {"current_steps": 1698, "total_steps": 2457, "loss": 0.4288478493690491, "lr": 6.757076934142013e-06, "epoch": 2.0732600732600734, "percentage": 69.11, "elapsed_time": "1:38:06", "remaining_time": "0:43:51"} | |
| {"current_steps": 1700, "total_steps": 2457, "loss": 0.4020456075668335, "lr": 6.734538561407158e-06, "epoch": 2.075702075702076, "percentage": 69.19, "elapsed_time": "1:38:12", "remaining_time": "0:43:44"} | |
| {"current_steps": 1702, "total_steps": 2457, "loss": 0.26895561814308167, "lr": 6.712034635400593e-06, "epoch": 2.0781440781440783, "percentage": 69.27, "elapsed_time": "1:38:18", "remaining_time": "0:43:36"} | |
| {"current_steps": 1704, "total_steps": 2457, "loss": 0.2938929796218872, "lr": 6.689565337858019e-06, "epoch": 2.0805860805860807, "percentage": 69.35, "elapsed_time": "1:38:25", "remaining_time": "0:43:29"} | |
| {"current_steps": 1706, "total_steps": 2457, "loss": 0.19200079143047333, "lr": 6.6671308502354844e-06, "epoch": 2.083028083028083, "percentage": 69.43, "elapsed_time": "1:38:32", "remaining_time": "0:43:22"} | |
| {"current_steps": 1708, "total_steps": 2457, "loss": 0.5591083765029907, "lr": 6.644731353707927e-06, "epoch": 2.0854700854700856, "percentage": 69.52, "elapsed_time": "1:38:37", "remaining_time": "0:43:15"} | |
| {"current_steps": 1710, "total_steps": 2457, "loss": 0.2770901918411255, "lr": 6.622367029167702e-06, "epoch": 2.087912087912088, "percentage": 69.6, "elapsed_time": "1:38:43", "remaining_time": "0:43:07"} | |
| {"current_steps": 1712, "total_steps": 2457, "loss": 0.394546240568161, "lr": 6.600038057223126e-06, "epoch": 2.0903540903540905, "percentage": 69.68, "elapsed_time": "1:38:49", "remaining_time": "0:43:00"} | |
| {"current_steps": 1714, "total_steps": 2457, "loss": 0.4641517996788025, "lr": 6.577744618197017e-06, "epoch": 2.092796092796093, "percentage": 69.76, "elapsed_time": "1:38:55", "remaining_time": "0:42:53"} | |
| {"current_steps": 1716, "total_steps": 2457, "loss": 0.32657861709594727, "lr": 6.555486892125243e-06, "epoch": 2.0952380952380953, "percentage": 69.84, "elapsed_time": "1:39:02", "remaining_time": "0:42:45"} | |
| {"current_steps": 1718, "total_steps": 2457, "loss": 0.6660332083702087, "lr": 6.533265058755256e-06, "epoch": 2.0976800976800978, "percentage": 69.92, "elapsed_time": "1:39:08", "remaining_time": "0:42:38"} | |
| {"current_steps": 1720, "total_steps": 2457, "loss": 0.48777180910110474, "lr": 6.5110792975446515e-06, "epoch": 2.1001221001221, "percentage": 70.0, "elapsed_time": "1:39:14", "remaining_time": "0:42:31"} | |
| {"current_steps": 1722, "total_steps": 2457, "loss": 0.6992468237876892, "lr": 6.488929787659721e-06, "epoch": 2.1025641025641026, "percentage": 70.09, "elapsed_time": "1:39:22", "remaining_time": "0:42:25"} | |
| {"current_steps": 1724, "total_steps": 2457, "loss": 0.3529256284236908, "lr": 6.466816707973991e-06, "epoch": 2.105006105006105, "percentage": 70.17, "elapsed_time": "1:39:29", "remaining_time": "0:42:18"} | |
| {"current_steps": 1726, "total_steps": 2457, "loss": 0.45478177070617676, "lr": 6.444740237066791e-06, "epoch": 2.1074481074481075, "percentage": 70.25, "elapsed_time": "1:39:35", "remaining_time": "0:42:10"} | |
| {"current_steps": 1728, "total_steps": 2457, "loss": 0.3780288100242615, "lr": 6.422700553221817e-06, "epoch": 2.10989010989011, "percentage": 70.33, "elapsed_time": "1:39:40", "remaining_time": "0:42:03"} | |
| {"current_steps": 1730, "total_steps": 2457, "loss": 0.42669016122817993, "lr": 6.400697834425662e-06, "epoch": 2.1123321123321124, "percentage": 70.41, "elapsed_time": "1:39:46", "remaining_time": "0:41:55"} | |
| {"current_steps": 1732, "total_steps": 2457, "loss": 0.34392303228378296, "lr": 6.378732258366421e-06, "epoch": 2.114774114774115, "percentage": 70.49, "elapsed_time": "1:39:51", "remaining_time": "0:41:47"} | |
| {"current_steps": 1734, "total_steps": 2457, "loss": 0.1719311773777008, "lr": 6.356804002432225e-06, "epoch": 2.1172161172161172, "percentage": 70.57, "elapsed_time": "1:39:57", "remaining_time": "0:41:40"} | |
| {"current_steps": 1736, "total_steps": 2457, "loss": 0.5892414450645447, "lr": 6.334913243709809e-06, "epoch": 2.1196581196581197, "percentage": 70.66, "elapsed_time": "1:40:04", "remaining_time": "0:41:33"} | |
| {"current_steps": 1738, "total_steps": 2457, "loss": 0.3725854456424713, "lr": 6.313060158983104e-06, "epoch": 2.122100122100122, "percentage": 70.74, "elapsed_time": "1:40:10", "remaining_time": "0:41:26"} | |
| {"current_steps": 1740, "total_steps": 2457, "loss": 0.4878256618976593, "lr": 6.291244924731794e-06, "epoch": 2.1245421245421245, "percentage": 70.82, "elapsed_time": "1:40:17", "remaining_time": "0:41:19"} | |
| {"current_steps": 1742, "total_steps": 2457, "loss": 0.43116888403892517, "lr": 6.26946771712988e-06, "epoch": 2.126984126984127, "percentage": 70.9, "elapsed_time": "1:40:23", "remaining_time": "0:41:12"} | |
| {"current_steps": 1744, "total_steps": 2457, "loss": 0.37520939111709595, "lr": 6.247728712044283e-06, "epoch": 2.1294261294261294, "percentage": 70.98, "elapsed_time": "1:40:29", "remaining_time": "0:41:05"} | |
| {"current_steps": 1746, "total_steps": 2457, "loss": 0.5751076936721802, "lr": 6.226028085033413e-06, "epoch": 2.131868131868132, "percentage": 71.06, "elapsed_time": "1:40:36", "remaining_time": "0:40:58"} | |
| {"current_steps": 1748, "total_steps": 2457, "loss": 0.20154741406440735, "lr": 6.2043660113457325e-06, "epoch": 2.1343101343101343, "percentage": 71.14, "elapsed_time": "1:40:42", "remaining_time": "0:40:50"} | |
| {"current_steps": 1750, "total_steps": 2457, "loss": 0.6898431777954102, "lr": 6.182742665918373e-06, "epoch": 2.1367521367521367, "percentage": 71.23, "elapsed_time": "1:40:47", "remaining_time": "0:40:43"} | |
| {"current_steps": 1752, "total_steps": 2457, "loss": 0.3924607038497925, "lr": 6.161158223375705e-06, "epoch": 2.139194139194139, "percentage": 71.31, "elapsed_time": "1:40:53", "remaining_time": "0:40:36"} | |
| {"current_steps": 1754, "total_steps": 2457, "loss": 0.43264567852020264, "lr": 6.13961285802792e-06, "epoch": 2.1416361416361416, "percentage": 71.39, "elapsed_time": "1:41:01", "remaining_time": "0:40:29"} | |
| {"current_steps": 1756, "total_steps": 2457, "loss": 0.5022901296615601, "lr": 6.118106743869641e-06, "epoch": 2.144078144078144, "percentage": 71.47, "elapsed_time": "1:41:07", "remaining_time": "0:40:22"} | |
| {"current_steps": 1758, "total_steps": 2457, "loss": 0.21431341767311096, "lr": 6.096640054578511e-06, "epoch": 2.1465201465201464, "percentage": 71.55, "elapsed_time": "1:41:14", "remaining_time": "0:40:15"} | |
| {"current_steps": 1760, "total_steps": 2457, "loss": 0.4715498685836792, "lr": 6.075212963513776e-06, "epoch": 2.148962148962149, "percentage": 71.63, "elapsed_time": "1:41:20", "remaining_time": "0:40:08"} | |
| {"current_steps": 1762, "total_steps": 2457, "loss": 0.4320064187049866, "lr": 6.053825643714912e-06, "epoch": 2.1514041514041513, "percentage": 71.71, "elapsed_time": "1:41:25", "remaining_time": "0:40:00"} | |
| {"current_steps": 1764, "total_steps": 2457, "loss": 0.3226162791252136, "lr": 6.032478267900206e-06, "epoch": 2.1538461538461537, "percentage": 71.79, "elapsed_time": "1:41:31", "remaining_time": "0:39:53"} | |
| {"current_steps": 1766, "total_steps": 2457, "loss": 0.2729605436325073, "lr": 6.011171008465363e-06, "epoch": 2.156288156288156, "percentage": 71.88, "elapsed_time": "1:41:37", "remaining_time": "0:39:45"} | |
| {"current_steps": 1768, "total_steps": 2457, "loss": 0.3462582230567932, "lr": 5.989904037482128e-06, "epoch": 2.1587301587301586, "percentage": 71.96, "elapsed_time": "1:41:44", "remaining_time": "0:39:38"} | |
| {"current_steps": 1770, "total_steps": 2457, "loss": 0.38312727212905884, "lr": 5.968677526696882e-06, "epoch": 2.161172161172161, "percentage": 72.04, "elapsed_time": "1:41:51", "remaining_time": "0:39:32"} | |
| {"current_steps": 1772, "total_steps": 2457, "loss": 0.353424072265625, "lr": 5.947491647529267e-06, "epoch": 2.1636141636141635, "percentage": 72.12, "elapsed_time": "1:41:57", "remaining_time": "0:39:24"} | |
| {"current_steps": 1774, "total_steps": 2457, "loss": 0.5065031051635742, "lr": 5.9263465710707814e-06, "epoch": 2.166056166056166, "percentage": 72.2, "elapsed_time": "1:42:03", "remaining_time": "0:39:17"} | |
| {"current_steps": 1776, "total_steps": 2457, "loss": 0.5348921418190002, "lr": 5.905242468083423e-06, "epoch": 2.1684981684981683, "percentage": 72.28, "elapsed_time": "1:42:10", "remaining_time": "0:39:10"} | |
| {"current_steps": 1778, "total_steps": 2457, "loss": 0.27236610651016235, "lr": 5.884179508998299e-06, "epoch": 2.1709401709401708, "percentage": 72.36, "elapsed_time": "1:42:16", "remaining_time": "0:39:03"} | |
| {"current_steps": 1780, "total_steps": 2457, "loss": 0.43548962473869324, "lr": 5.863157863914239e-06, "epoch": 2.173382173382173, "percentage": 72.45, "elapsed_time": "1:42:22", "remaining_time": "0:38:56"} | |
| {"current_steps": 1782, "total_steps": 2457, "loss": 0.5892971754074097, "lr": 5.8421777025964446e-06, "epoch": 2.1758241758241756, "percentage": 72.53, "elapsed_time": "1:42:29", "remaining_time": "0:38:49"} | |
| {"current_steps": 1784, "total_steps": 2457, "loss": 0.4943884313106537, "lr": 5.8212391944750965e-06, "epoch": 2.178266178266178, "percentage": 72.61, "elapsed_time": "1:42:36", "remaining_time": "0:38:42"} | |
| {"current_steps": 1786, "total_steps": 2457, "loss": 0.5425156354904175, "lr": 5.8003425086440015e-06, "epoch": 2.1807081807081805, "percentage": 72.69, "elapsed_time": "1:42:43", "remaining_time": "0:38:35"} | |
| {"current_steps": 1788, "total_steps": 2457, "loss": 0.3213900625705719, "lr": 5.779487813859218e-06, "epoch": 2.183150183150183, "percentage": 72.77, "elapsed_time": "1:42:49", "remaining_time": "0:38:28"} | |
| {"current_steps": 1790, "total_steps": 2457, "loss": 0.46233004331588745, "lr": 5.758675278537692e-06, "epoch": 2.185592185592186, "percentage": 72.85, "elapsed_time": "1:42:55", "remaining_time": "0:38:21"} | |
| {"current_steps": 1792, "total_steps": 2457, "loss": 0.480983167886734, "lr": 5.737905070755907e-06, "epoch": 2.1880341880341883, "percentage": 72.93, "elapsed_time": "1:43:03", "remaining_time": "0:38:14"} | |
| {"current_steps": 1794, "total_steps": 2457, "loss": 0.2742152810096741, "lr": 5.717177358248522e-06, "epoch": 2.1904761904761907, "percentage": 73.02, "elapsed_time": "1:43:08", "remaining_time": "0:38:07"} | |
| {"current_steps": 1796, "total_steps": 2457, "loss": 0.3769078254699707, "lr": 5.696492308407002e-06, "epoch": 2.192918192918193, "percentage": 73.1, "elapsed_time": "1:43:14", "remaining_time": "0:37:59"} | |
| {"current_steps": 1798, "total_steps": 2457, "loss": 0.40196555852890015, "lr": 5.675850088278298e-06, "epoch": 2.1953601953601956, "percentage": 73.18, "elapsed_time": "1:43:20", "remaining_time": "0:37:52"} | |
| {"current_steps": 1800, "total_steps": 2457, "loss": 0.3571450412273407, "lr": 5.655250864563469e-06, "epoch": 2.197802197802198, "percentage": 73.26, "elapsed_time": "1:43:27", "remaining_time": "0:37:45"} | |
| {"current_steps": 1802, "total_steps": 2457, "loss": 0.4585352838039398, "lr": 5.63469480361635e-06, "epoch": 2.2002442002442004, "percentage": 73.34, "elapsed_time": "1:43:34", "remaining_time": "0:37:38"} | |
| {"current_steps": 1804, "total_steps": 2457, "loss": 0.4414786100387573, "lr": 5.614182071442201e-06, "epoch": 2.202686202686203, "percentage": 73.42, "elapsed_time": "1:43:41", "remaining_time": "0:37:31"} | |
| {"current_steps": 1806, "total_steps": 2457, "loss": 0.5657206177711487, "lr": 5.59371283369637e-06, "epoch": 2.2051282051282053, "percentage": 73.5, "elapsed_time": "1:43:47", "remaining_time": "0:37:24"} | |
| {"current_steps": 1808, "total_steps": 2457, "loss": 0.5330032706260681, "lr": 5.573287255682967e-06, "epoch": 2.2075702075702077, "percentage": 73.59, "elapsed_time": "1:43:53", "remaining_time": "0:37:17"} | |
| {"current_steps": 1810, "total_steps": 2457, "loss": 0.2634370028972626, "lr": 5.552905502353502e-06, "epoch": 2.21001221001221, "percentage": 73.67, "elapsed_time": "1:43:59", "remaining_time": "0:37:10"} | |
| {"current_steps": 1812, "total_steps": 2457, "loss": 0.4326469302177429, "lr": 5.532567738305576e-06, "epoch": 2.2124542124542126, "percentage": 73.75, "elapsed_time": "1:44:05", "remaining_time": "0:37:03"} | |
| {"current_steps": 1814, "total_steps": 2457, "loss": 0.1571735441684723, "lr": 5.512274127781552e-06, "epoch": 2.214896214896215, "percentage": 73.83, "elapsed_time": "1:44:11", "remaining_time": "0:36:55"} | |
| {"current_steps": 1816, "total_steps": 2457, "loss": 0.5355442762374878, "lr": 5.492024834667205e-06, "epoch": 2.2173382173382175, "percentage": 73.91, "elapsed_time": "1:44:19", "remaining_time": "0:36:49"} | |
| {"current_steps": 1818, "total_steps": 2457, "loss": 0.38218754529953003, "lr": 5.471820022490422e-06, "epoch": 2.21978021978022, "percentage": 73.99, "elapsed_time": "1:44:25", "remaining_time": "0:36:42"} | |
| {"current_steps": 1820, "total_steps": 2457, "loss": 0.49747079610824585, "lr": 5.451659854419882e-06, "epoch": 2.2222222222222223, "percentage": 74.07, "elapsed_time": "1:44:31", "remaining_time": "0:36:35"} | |
| {"current_steps": 1822, "total_steps": 2457, "loss": 0.2641042172908783, "lr": 5.431544493263714e-06, "epoch": 2.2246642246642248, "percentage": 74.16, "elapsed_time": "1:44:36", "remaining_time": "0:36:27"} | |
| {"current_steps": 1824, "total_steps": 2457, "loss": 0.39929312467575073, "lr": 5.411474101468208e-06, "epoch": 2.227106227106227, "percentage": 74.24, "elapsed_time": "1:44:43", "remaining_time": "0:36:20"} | |
| {"current_steps": 1826, "total_steps": 2457, "loss": 0.2978437840938568, "lr": 5.3914488411165e-06, "epoch": 2.2295482295482296, "percentage": 74.32, "elapsed_time": "1:44:49", "remaining_time": "0:36:13"} | |
| {"current_steps": 1828, "total_steps": 2457, "loss": 0.3673563599586487, "lr": 5.3714688739272396e-06, "epoch": 2.231990231990232, "percentage": 74.4, "elapsed_time": "1:44:56", "remaining_time": "0:36:06"} | |
| {"current_steps": 1830, "total_steps": 2457, "loss": 0.29434409737586975, "lr": 5.351534361253312e-06, "epoch": 2.2344322344322345, "percentage": 74.48, "elapsed_time": "1:45:03", "remaining_time": "0:35:59"} | |
| {"current_steps": 1832, "total_steps": 2457, "loss": 0.46827900409698486, "lr": 5.331645464080526e-06, "epoch": 2.236874236874237, "percentage": 74.56, "elapsed_time": "1:45:09", "remaining_time": "0:35:52"} | |
| {"current_steps": 1834, "total_steps": 2457, "loss": 0.5047073364257812, "lr": 5.311802343026302e-06, "epoch": 2.2393162393162394, "percentage": 74.64, "elapsed_time": "1:45:15", "remaining_time": "0:35:45"} | |
| {"current_steps": 1836, "total_steps": 2457, "loss": 0.40334218740463257, "lr": 5.292005158338394e-06, "epoch": 2.241758241758242, "percentage": 74.73, "elapsed_time": "1:45:21", "remaining_time": "0:35:38"} | |
| {"current_steps": 1838, "total_steps": 2457, "loss": 0.5924956798553467, "lr": 5.272254069893579e-06, "epoch": 2.244200244200244, "percentage": 74.81, "elapsed_time": "1:45:28", "remaining_time": "0:35:31"} | |
| {"current_steps": 1840, "total_steps": 2457, "loss": 0.31219542026519775, "lr": 5.2525492371963785e-06, "epoch": 2.2466422466422467, "percentage": 74.89, "elapsed_time": "1:45:34", "remaining_time": "0:35:24"} | |
| {"current_steps": 1842, "total_steps": 2457, "loss": 0.46928393840789795, "lr": 5.232890819377765e-06, "epoch": 2.249084249084249, "percentage": 74.97, "elapsed_time": "1:45:40", "remaining_time": "0:35:17"} | |
| {"current_steps": 1844, "total_steps": 2457, "loss": 0.4485982060432434, "lr": 5.213278975193874e-06, "epoch": 2.2515262515262515, "percentage": 75.05, "elapsed_time": "1:45:48", "remaining_time": "0:35:10"} | |
| {"current_steps": 1846, "total_steps": 2457, "loss": 0.3948480784893036, "lr": 5.193713863024722e-06, "epoch": 2.253968253968254, "percentage": 75.13, "elapsed_time": "1:45:54", "remaining_time": "0:35:03"} | |
| {"current_steps": 1848, "total_steps": 2457, "loss": 0.3254821300506592, "lr": 5.174195640872937e-06, "epoch": 2.2564102564102564, "percentage": 75.21, "elapsed_time": "1:46:00", "remaining_time": "0:34:55"} | |
| {"current_steps": 1850, "total_steps": 2457, "loss": 0.43265148997306824, "lr": 5.154724466362473e-06, "epoch": 2.258852258852259, "percentage": 75.3, "elapsed_time": "1:46:06", "remaining_time": "0:34:48"} | |
| {"current_steps": 1852, "total_steps": 2457, "loss": 0.5352158546447754, "lr": 5.135300496737335e-06, "epoch": 2.2612942612942613, "percentage": 75.38, "elapsed_time": "1:46:11", "remaining_time": "0:34:41"} | |
| {"current_steps": 1854, "total_steps": 2457, "loss": 0.6833795309066772, "lr": 5.115923888860321e-06, "epoch": 2.2637362637362637, "percentage": 75.46, "elapsed_time": "1:46:18", "remaining_time": "0:34:34"} | |
| {"current_steps": 1856, "total_steps": 2457, "loss": 0.6043341755867004, "lr": 5.096594799211748e-06, "epoch": 2.266178266178266, "percentage": 75.54, "elapsed_time": "1:46:25", "remaining_time": "0:34:27"} | |
| {"current_steps": 1858, "total_steps": 2457, "loss": 0.6158211827278137, "lr": 5.0773133838881806e-06, "epoch": 2.2686202686202686, "percentage": 75.62, "elapsed_time": "1:46:32", "remaining_time": "0:34:20"} | |
| {"current_steps": 1860, "total_steps": 2457, "loss": 0.7204128503799438, "lr": 5.058079798601184e-06, "epoch": 2.271062271062271, "percentage": 75.7, "elapsed_time": "1:46:38", "remaining_time": "0:34:13"} | |
| {"current_steps": 1862, "total_steps": 2457, "loss": 0.32139068841934204, "lr": 5.0388941986760675e-06, "epoch": 2.2735042735042734, "percentage": 75.78, "elapsed_time": "1:46:45", "remaining_time": "0:34:06"} | |
| {"current_steps": 1864, "total_steps": 2457, "loss": 0.29253455996513367, "lr": 5.019756739050606e-06, "epoch": 2.275946275946276, "percentage": 75.86, "elapsed_time": "1:46:51", "remaining_time": "0:33:59"} | |
| {"current_steps": 1866, "total_steps": 2457, "loss": 0.39995700120925903, "lr": 5.000667574273821e-06, "epoch": 2.2783882783882783, "percentage": 75.95, "elapsed_time": "1:46:57", "remaining_time": "0:33:52"} | |
| {"current_steps": 1868, "total_steps": 2457, "loss": 0.45448631048202515, "lr": 4.981626858504718e-06, "epoch": 2.2808302808302807, "percentage": 76.03, "elapsed_time": "1:47:04", "remaining_time": "0:33:45"} | |
| {"current_steps": 1870, "total_steps": 2457, "loss": 0.42726626992225647, "lr": 4.962634745511027e-06, "epoch": 2.283272283272283, "percentage": 76.11, "elapsed_time": "1:47:12", "remaining_time": "0:33:39"} | |
| {"current_steps": 1872, "total_steps": 2457, "loss": 0.4752141237258911, "lr": 4.943691388667989e-06, "epoch": 2.2857142857142856, "percentage": 76.19, "elapsed_time": "1:47:18", "remaining_time": "0:33:32"} | |
| {"current_steps": 1874, "total_steps": 2457, "loss": 0.13898348808288574, "lr": 4.924796940957099e-06, "epoch": 2.288156288156288, "percentage": 76.27, "elapsed_time": "1:47:23", "remaining_time": "0:33:24"} | |
| {"current_steps": 1876, "total_steps": 2457, "loss": 0.6339101791381836, "lr": 4.905951554964876e-06, "epoch": 2.2905982905982905, "percentage": 76.35, "elapsed_time": "1:47:29", "remaining_time": "0:33:17"} | |
| {"current_steps": 1878, "total_steps": 2457, "loss": 0.347889244556427, "lr": 4.887155382881625e-06, "epoch": 2.293040293040293, "percentage": 76.43, "elapsed_time": "1:47:35", "remaining_time": "0:33:10"} | |
| {"current_steps": 1880, "total_steps": 2457, "loss": 0.340035080909729, "lr": 4.868408576500216e-06, "epoch": 2.2954822954822953, "percentage": 76.52, "elapsed_time": "1:47:41", "remaining_time": "0:33:03"} | |
| {"current_steps": 1882, "total_steps": 2457, "loss": 0.5293861031532288, "lr": 4.849711287214856e-06, "epoch": 2.2979242979242978, "percentage": 76.6, "elapsed_time": "1:47:47", "remaining_time": "0:32:56"} | |
| {"current_steps": 1884, "total_steps": 2457, "loss": 0.31249868869781494, "lr": 4.8310636660198616e-06, "epoch": 2.3003663003663, "percentage": 76.68, "elapsed_time": "1:47:54", "remaining_time": "0:32:49"} | |
| {"current_steps": 1886, "total_steps": 2457, "loss": 0.5040943026542664, "lr": 4.812465863508448e-06, "epoch": 2.3028083028083026, "percentage": 76.76, "elapsed_time": "1:48:01", "remaining_time": "0:32:42"} | |
| {"current_steps": 1888, "total_steps": 2457, "loss": 0.42627787590026855, "lr": 4.7939180298715055e-06, "epoch": 2.305250305250305, "percentage": 76.84, "elapsed_time": "1:48:07", "remaining_time": "0:32:35"} | |
| {"current_steps": 1890, "total_steps": 2457, "loss": 0.44656771421432495, "lr": 4.775420314896384e-06, "epoch": 2.3076923076923075, "percentage": 76.92, "elapsed_time": "1:48:13", "remaining_time": "0:32:28"} | |
| {"current_steps": 1892, "total_steps": 2457, "loss": 0.5736830830574036, "lr": 4.756972867965698e-06, "epoch": 2.31013431013431, "percentage": 77.0, "elapsed_time": "1:48:20", "remaining_time": "0:32:21"} | |
| {"current_steps": 1894, "total_steps": 2457, "loss": 0.4964962601661682, "lr": 4.738575838056104e-06, "epoch": 2.3125763125763124, "percentage": 77.09, "elapsed_time": "1:48:27", "remaining_time": "0:32:14"} | |
| {"current_steps": 1896, "total_steps": 2457, "loss": 0.4222361445426941, "lr": 4.7202293737371066e-06, "epoch": 2.315018315018315, "percentage": 77.17, "elapsed_time": "1:48:33", "remaining_time": "0:32:07"} | |
| {"current_steps": 1898, "total_steps": 2457, "loss": 0.5211227536201477, "lr": 4.7019336231698576e-06, "epoch": 2.317460317460317, "percentage": 77.25, "elapsed_time": "1:48:39", "remaining_time": "0:32:00"} | |
| {"current_steps": 1900, "total_steps": 2457, "loss": 0.8980540633201599, "lr": 4.6836887341059525e-06, "epoch": 2.3199023199023197, "percentage": 77.33, "elapsed_time": "1:48:45", "remaining_time": "0:31:53"} | |
| {"current_steps": 1902, "total_steps": 2457, "loss": 0.4475945234298706, "lr": 4.6654948538862475e-06, "epoch": 2.3223443223443225, "percentage": 77.41, "elapsed_time": "1:48:53", "remaining_time": "0:31:46"} | |
| {"current_steps": 1904, "total_steps": 2457, "loss": 0.251365065574646, "lr": 4.647352129439665e-06, "epoch": 2.324786324786325, "percentage": 77.49, "elapsed_time": "1:49:00", "remaining_time": "0:31:39"} | |
| {"current_steps": 1906, "total_steps": 2457, "loss": 0.190834641456604, "lr": 4.629260707282009e-06, "epoch": 2.3272283272283274, "percentage": 77.57, "elapsed_time": "1:49:06", "remaining_time": "0:31:32"} | |
| {"current_steps": 1908, "total_steps": 2457, "loss": 0.2842097878456116, "lr": 4.6112207335147704e-06, "epoch": 2.32967032967033, "percentage": 77.66, "elapsed_time": "1:49:12", "remaining_time": "0:31:25"} | |
| {"current_steps": 1910, "total_steps": 2457, "loss": 0.23184801638126373, "lr": 4.593232353823968e-06, "epoch": 2.3321123321123323, "percentage": 77.74, "elapsed_time": "1:49:17", "remaining_time": "0:31:17"} | |
| {"current_steps": 1912, "total_steps": 2457, "loss": 0.40144017338752747, "lr": 4.575295713478956e-06, "epoch": 2.3345543345543347, "percentage": 77.82, "elapsed_time": "1:49:23", "remaining_time": "0:31:10"} | |
| {"current_steps": 1914, "total_steps": 2457, "loss": 0.5639522075653076, "lr": 4.557410957331249e-06, "epoch": 2.336996336996337, "percentage": 77.9, "elapsed_time": "1:49:29", "remaining_time": "0:31:03"} | |
| {"current_steps": 1916, "total_steps": 2457, "loss": 0.636457622051239, "lr": 4.539578229813372e-06, "epoch": 2.3394383394383396, "percentage": 77.98, "elapsed_time": "1:49:37", "remaining_time": "0:30:57"} | |
| {"current_steps": 1918, "total_steps": 2457, "loss": 0.26978304982185364, "lr": 4.521797674937672e-06, "epoch": 2.341880341880342, "percentage": 78.06, "elapsed_time": "1:49:44", "remaining_time": "0:30:50"} | |
| {"current_steps": 1920, "total_steps": 2457, "loss": 0.3309711515903473, "lr": 4.5040694362951625e-06, "epoch": 2.3443223443223444, "percentage": 78.14, "elapsed_time": "1:49:49", "remaining_time": "0:30:42"} | |
| {"current_steps": 1922, "total_steps": 2457, "loss": 0.3379634618759155, "lr": 4.486393657054369e-06, "epoch": 2.346764346764347, "percentage": 78.23, "elapsed_time": "1:49:55", "remaining_time": "0:30:35"} | |
| {"current_steps": 1924, "total_steps": 2457, "loss": 0.2894682288169861, "lr": 4.468770479960171e-06, "epoch": 2.3492063492063493, "percentage": 78.31, "elapsed_time": "1:50:01", "remaining_time": "0:30:28"} | |
| {"current_steps": 1926, "total_steps": 2457, "loss": 0.44025763869285583, "lr": 4.451200047332638e-06, "epoch": 2.3516483516483517, "percentage": 78.39, "elapsed_time": "1:50:08", "remaining_time": "0:30:21"} | |
| {"current_steps": 1928, "total_steps": 2457, "loss": 0.3474840223789215, "lr": 4.433682501065897e-06, "epoch": 2.354090354090354, "percentage": 78.47, "elapsed_time": "1:50:14", "remaining_time": "0:30:14"} | |
| {"current_steps": 1930, "total_steps": 2457, "loss": 0.3358984589576721, "lr": 4.416217982626981e-06, "epoch": 2.3565323565323566, "percentage": 78.55, "elapsed_time": "1:50:20", "remaining_time": "0:30:07"} | |
| {"current_steps": 1932, "total_steps": 2457, "loss": 0.3395053446292877, "lr": 4.398806633054675e-06, "epoch": 2.358974358974359, "percentage": 78.63, "elapsed_time": "1:50:27", "remaining_time": "0:30:00"} | |
| {"current_steps": 1934, "total_steps": 2457, "loss": 0.5439938902854919, "lr": 4.381448592958394e-06, "epoch": 2.3614163614163615, "percentage": 78.71, "elapsed_time": "1:50:33", "remaining_time": "0:29:53"} | |
| {"current_steps": 1936, "total_steps": 2457, "loss": 0.2674437463283539, "lr": 4.36414400251704e-06, "epoch": 2.363858363858364, "percentage": 78.8, "elapsed_time": "1:50:39", "remaining_time": "0:29:46"} | |
| {"current_steps": 1938, "total_steps": 2457, "loss": 0.4141199290752411, "lr": 4.346893001477861e-06, "epoch": 2.3663003663003663, "percentage": 78.88, "elapsed_time": "1:50:46", "remaining_time": "0:29:40"} | |
| {"current_steps": 1940, "total_steps": 2457, "loss": 0.5360310673713684, "lr": 4.329695729155342e-06, "epoch": 2.3687423687423688, "percentage": 78.96, "elapsed_time": "1:50:52", "remaining_time": "0:29:32"} | |
| {"current_steps": 1942, "total_steps": 2457, "loss": 0.25111788511276245, "lr": 4.3125523244300686e-06, "epoch": 2.371184371184371, "percentage": 79.04, "elapsed_time": "1:50:57", "remaining_time": "0:29:25"} | |
| {"current_steps": 1944, "total_steps": 2457, "loss": 0.3430798351764679, "lr": 4.295462925747594e-06, "epoch": 2.3736263736263736, "percentage": 79.12, "elapsed_time": "1:51:02", "remaining_time": "0:29:18"} | |
| {"current_steps": 1946, "total_steps": 2457, "loss": 0.08609216660261154, "lr": 4.278427671117344e-06, "epoch": 2.376068376068376, "percentage": 79.2, "elapsed_time": "1:51:08", "remaining_time": "0:29:11"} | |
| {"current_steps": 1948, "total_steps": 2457, "loss": 0.194163978099823, "lr": 4.261446698111496e-06, "epoch": 2.3785103785103785, "percentage": 79.28, "elapsed_time": "1:51:14", "remaining_time": "0:29:04"} | |
| {"current_steps": 1950, "total_steps": 2457, "loss": 0.20009776949882507, "lr": 4.24452014386385e-06, "epoch": 2.380952380952381, "percentage": 79.37, "elapsed_time": "1:51:21", "remaining_time": "0:28:57"} | |
| {"current_steps": 1952, "total_steps": 2457, "loss": 0.12069036066532135, "lr": 4.22764814506874e-06, "epoch": 2.3833943833943834, "percentage": 79.45, "elapsed_time": "1:51:27", "remaining_time": "0:28:50"} | |
| {"current_steps": 1954, "total_steps": 2457, "loss": 0.35760805010795593, "lr": 4.210830837979932e-06, "epoch": 2.385836385836386, "percentage": 79.53, "elapsed_time": "1:51:33", "remaining_time": "0:28:43"} | |
| {"current_steps": 1956, "total_steps": 2457, "loss": 0.48620444536209106, "lr": 4.194068358409503e-06, "epoch": 2.3882783882783882, "percentage": 79.61, "elapsed_time": "1:51:39", "remaining_time": "0:28:35"} | |
| {"current_steps": 1958, "total_steps": 2457, "loss": 0.20889446139335632, "lr": 4.17736084172677e-06, "epoch": 2.3907203907203907, "percentage": 79.69, "elapsed_time": "1:51:44", "remaining_time": "0:28:28"} | |
| {"current_steps": 1960, "total_steps": 2457, "loss": 0.5993058085441589, "lr": 4.160708422857178e-06, "epoch": 2.393162393162393, "percentage": 79.77, "elapsed_time": "1:51:51", "remaining_time": "0:28:21"} | |
| {"current_steps": 1962, "total_steps": 2457, "loss": 0.1960648149251938, "lr": 4.144111236281214e-06, "epoch": 2.3956043956043955, "percentage": 79.85, "elapsed_time": "1:51:58", "remaining_time": "0:28:15"} | |
| {"current_steps": 1964, "total_steps": 2457, "loss": 0.5698574185371399, "lr": 4.127569416033332e-06, "epoch": 2.398046398046398, "percentage": 79.93, "elapsed_time": "1:52:05", "remaining_time": "0:28:08"} | |
| {"current_steps": 1966, "total_steps": 2457, "loss": 0.18890273571014404, "lr": 4.111083095700858e-06, "epoch": 2.4004884004884004, "percentage": 80.02, "elapsed_time": "1:52:12", "remaining_time": "0:28:01"} | |
| {"current_steps": 1968, "total_steps": 2457, "loss": 0.3097396492958069, "lr": 4.094652408422913e-06, "epoch": 2.402930402930403, "percentage": 80.1, "elapsed_time": "1:52:18", "remaining_time": "0:27:54"} | |
| {"current_steps": 1970, "total_steps": 2457, "loss": 0.23327361047267914, "lr": 4.078277486889341e-06, "epoch": 2.4053724053724053, "percentage": 80.18, "elapsed_time": "1:52:24", "remaining_time": "0:27:47"} | |
| {"current_steps": 1972, "total_steps": 2457, "loss": 0.06529633700847626, "lr": 4.061958463339646e-06, "epoch": 2.4078144078144077, "percentage": 80.26, "elapsed_time": "1:52:30", "remaining_time": "0:27:40"} | |
| {"current_steps": 1974, "total_steps": 2457, "loss": 0.08752602338790894, "lr": 4.045695469561899e-06, "epoch": 2.41025641025641, "percentage": 80.34, "elapsed_time": "1:52:36", "remaining_time": "0:27:33"} | |
| {"current_steps": 1976, "total_steps": 2457, "loss": 0.3558381199836731, "lr": 4.029488636891702e-06, "epoch": 2.4126984126984126, "percentage": 80.42, "elapsed_time": "1:52:42", "remaining_time": "0:27:26"} | |
| {"current_steps": 1978, "total_steps": 2457, "loss": 0.3303931653499603, "lr": 4.013338096211109e-06, "epoch": 2.415140415140415, "percentage": 80.5, "elapsed_time": "1:52:49", "remaining_time": "0:27:19"} | |
| {"current_steps": 1980, "total_steps": 2457, "loss": 0.22131627798080444, "lr": 3.99724397794758e-06, "epoch": 2.4175824175824174, "percentage": 80.59, "elapsed_time": "1:52:55", "remaining_time": "0:27:12"} | |
| {"current_steps": 1982, "total_steps": 2457, "loss": 0.39478451013565063, "lr": 3.981206412072914e-06, "epoch": 2.42002442002442, "percentage": 80.67, "elapsed_time": "1:53:01", "remaining_time": "0:27:05"} | |
| {"current_steps": 1984, "total_steps": 2457, "loss": 0.3109724521636963, "lr": 3.965225528102217e-06, "epoch": 2.4224664224664223, "percentage": 80.75, "elapsed_time": "1:53:07", "remaining_time": "0:26:58"} | |
| {"current_steps": 1986, "total_steps": 2457, "loss": 0.5224888920783997, "lr": 3.949301455092845e-06, "epoch": 2.4249084249084247, "percentage": 80.83, "elapsed_time": "1:53:14", "remaining_time": "0:26:51"} | |
| {"current_steps": 1988, "total_steps": 2457, "loss": 0.4845066964626312, "lr": 3.933434321643356e-06, "epoch": 2.427350427350427, "percentage": 80.91, "elapsed_time": "1:53:20", "remaining_time": "0:26:44"} | |
| {"current_steps": 1990, "total_steps": 2457, "loss": 0.5302805304527283, "lr": 3.917624255892489e-06, "epoch": 2.42979242979243, "percentage": 80.99, "elapsed_time": "1:53:27", "remaining_time": "0:26:37"} | |
| {"current_steps": 1992, "total_steps": 2457, "loss": 0.42821258306503296, "lr": 3.901871385518117e-06, "epoch": 2.4322344322344325, "percentage": 81.07, "elapsed_time": "1:53:33", "remaining_time": "0:26:30"} | |
| {"current_steps": 1994, "total_steps": 2457, "loss": 0.4940814673900604, "lr": 3.886175837736214e-06, "epoch": 2.434676434676435, "percentage": 81.16, "elapsed_time": "1:53:40", "remaining_time": "0:26:23"} | |
| {"current_steps": 1996, "total_steps": 2457, "loss": 0.3047824501991272, "lr": 3.870537739299836e-06, "epoch": 2.4371184371184373, "percentage": 81.24, "elapsed_time": "1:53:47", "remaining_time": "0:26:16"} | |
| {"current_steps": 1998, "total_steps": 2457, "loss": 0.5371643900871277, "lr": 3.854957216498099e-06, "epoch": 2.4395604395604398, "percentage": 81.32, "elapsed_time": "1:53:53", "remaining_time": "0:26:09"} | |
| {"current_steps": 2000, "total_steps": 2457, "loss": 0.24889859557151794, "lr": 3.839434395155135e-06, "epoch": 2.442002442002442, "percentage": 81.4, "elapsed_time": "1:53:59", "remaining_time": "0:26:02"} | |
| {"current_steps": 2002, "total_steps": 2457, "loss": 0.45958831906318665, "lr": 3.8239694006291194e-06, "epoch": 2.4444444444444446, "percentage": 81.48, "elapsed_time": "1:54:06", "remaining_time": "0:25:55"} | |
| {"current_steps": 2004, "total_steps": 2457, "loss": 0.22220918536186218, "lr": 3.8085623578112136e-06, "epoch": 2.446886446886447, "percentage": 81.56, "elapsed_time": "1:54:11", "remaining_time": "0:25:48"} | |
| {"current_steps": 2006, "total_steps": 2457, "loss": 0.29667913913726807, "lr": 3.793213391124586e-06, "epoch": 2.4493284493284495, "percentage": 81.64, "elapsed_time": "1:54:17", "remaining_time": "0:25:41"} | |
| {"current_steps": 2008, "total_steps": 2457, "loss": 0.7430405616760254, "lr": 3.7779226245233937e-06, "epoch": 2.451770451770452, "percentage": 81.73, "elapsed_time": "1:54:23", "remaining_time": "0:25:34"} | |
| {"current_steps": 2010, "total_steps": 2457, "loss": 0.3536508083343506, "lr": 3.7626901814917927e-06, "epoch": 2.4542124542124544, "percentage": 81.81, "elapsed_time": "1:54:30", "remaining_time": "0:25:27"} | |
| {"current_steps": 2012, "total_steps": 2457, "loss": 0.2591190040111542, "lr": 3.747516185042922e-06, "epoch": 2.456654456654457, "percentage": 81.89, "elapsed_time": "1:54:36", "remaining_time": "0:25:20"} | |
| {"current_steps": 2014, "total_steps": 2457, "loss": 0.5008297562599182, "lr": 3.7324007577179283e-06, "epoch": 2.4590964590964592, "percentage": 81.97, "elapsed_time": "1:54:42", "remaining_time": "0:25:13"} | |
| {"current_steps": 2016, "total_steps": 2457, "loss": 0.4963090121746063, "lr": 3.7173440215849744e-06, "epoch": 2.4615384615384617, "percentage": 82.05, "elapsed_time": "1:54:49", "remaining_time": "0:25:07"} | |
| {"current_steps": 2018, "total_steps": 2457, "loss": 0.5157759189605713, "lr": 3.7023460982382355e-06, "epoch": 2.463980463980464, "percentage": 82.13, "elapsed_time": "1:54:55", "remaining_time": "0:24:59"} | |
| {"current_steps": 2020, "total_steps": 2457, "loss": 0.4686001241207123, "lr": 3.687407108796942e-06, "epoch": 2.4664224664224665, "percentage": 82.21, "elapsed_time": "1:55:02", "remaining_time": "0:24:53"} | |
| {"current_steps": 2022, "total_steps": 2457, "loss": 0.25978168845176697, "lr": 3.672527173904388e-06, "epoch": 2.468864468864469, "percentage": 82.3, "elapsed_time": "1:55:07", "remaining_time": "0:24:46"} | |
| {"current_steps": 2024, "total_steps": 2457, "loss": 0.3640308380126953, "lr": 3.6577064137269525e-06, "epoch": 2.4713064713064714, "percentage": 82.38, "elapsed_time": "1:55:14", "remaining_time": "0:24:39"} | |
| {"current_steps": 2026, "total_steps": 2457, "loss": 0.3720964193344116, "lr": 3.6429449479531416e-06, "epoch": 2.473748473748474, "percentage": 82.46, "elapsed_time": "1:55:22", "remaining_time": "0:24:32"} | |
| {"current_steps": 2028, "total_steps": 2457, "loss": 0.2083432972431183, "lr": 3.6282428957926154e-06, "epoch": 2.4761904761904763, "percentage": 82.54, "elapsed_time": "1:55:27", "remaining_time": "0:24:25"} | |
| {"current_steps": 2030, "total_steps": 2457, "loss": 0.5114956498146057, "lr": 3.613600375975221e-06, "epoch": 2.4786324786324787, "percentage": 82.62, "elapsed_time": "1:55:33", "remaining_time": "0:24:18"} | |
| {"current_steps": 2032, "total_steps": 2457, "loss": 0.47537893056869507, "lr": 3.599017506750042e-06, "epoch": 2.481074481074481, "percentage": 82.7, "elapsed_time": "1:55:40", "remaining_time": "0:24:11"} | |
| {"current_steps": 2034, "total_steps": 2457, "loss": 0.25453007221221924, "lr": 3.5844944058844393e-06, "epoch": 2.4835164835164836, "percentage": 82.78, "elapsed_time": "1:55:46", "remaining_time": "0:24:04"} | |
| {"current_steps": 2036, "total_steps": 2457, "loss": 0.5005137920379639, "lr": 3.570031190663098e-06, "epoch": 2.485958485958486, "percentage": 82.87, "elapsed_time": "1:55:52", "remaining_time": "0:23:57"} | |
| {"current_steps": 2038, "total_steps": 2457, "loss": 0.5193389058113098, "lr": 3.5556279778870862e-06, "epoch": 2.4884004884004884, "percentage": 82.95, "elapsed_time": "1:55:59", "remaining_time": "0:23:50"} | |
| {"current_steps": 2040, "total_steps": 2457, "loss": 0.5654491782188416, "lr": 3.5412848838729075e-06, "epoch": 2.490842490842491, "percentage": 83.03, "elapsed_time": "1:56:05", "remaining_time": "0:23:43"} | |
| {"current_steps": 2042, "total_steps": 2457, "loss": 0.5325220227241516, "lr": 3.5270020244515583e-06, "epoch": 2.4932844932844933, "percentage": 83.11, "elapsed_time": "1:56:12", "remaining_time": "0:23:36"} | |
| {"current_steps": 2044, "total_steps": 2457, "loss": 0.38437139987945557, "lr": 3.5127795149676014e-06, "epoch": 2.4957264957264957, "percentage": 83.19, "elapsed_time": "1:56:19", "remaining_time": "0:23:30"} | |
| {"current_steps": 2046, "total_steps": 2457, "loss": 0.2638123035430908, "lr": 3.49861747027823e-06, "epoch": 2.498168498168498, "percentage": 83.27, "elapsed_time": "1:56:24", "remaining_time": "0:23:23"} | |
| {"current_steps": 2048, "total_steps": 2457, "loss": 0.4149170219898224, "lr": 3.484516004752334e-06, "epoch": 2.5006105006105006, "percentage": 83.35, "elapsed_time": "1:56:30", "remaining_time": "0:23:16"} | |
| {"current_steps": 2050, "total_steps": 2457, "loss": 0.4781511425971985, "lr": 3.4704752322695877e-06, "epoch": 2.503052503052503, "percentage": 83.44, "elapsed_time": "1:56:36", "remaining_time": "0:23:09"} | |
| {"current_steps": 2052, "total_steps": 2457, "loss": 0.7653157711029053, "lr": 3.456495266219525e-06, "epoch": 2.5054945054945055, "percentage": 83.52, "elapsed_time": "1:56:42", "remaining_time": "0:23:02"} | |
| {"current_steps": 2054, "total_steps": 2457, "loss": 0.36611488461494446, "lr": 3.442576219500614e-06, "epoch": 2.507936507936508, "percentage": 83.6, "elapsed_time": "1:56:49", "remaining_time": "0:22:55"} | |
| {"current_steps": 2056, "total_steps": 2457, "loss": 0.531693696975708, "lr": 3.428718204519369e-06, "epoch": 2.5103785103785103, "percentage": 83.68, "elapsed_time": "1:56:57", "remaining_time": "0:22:48"} | |
| {"current_steps": 2058, "total_steps": 2457, "loss": 0.18801343441009521, "lr": 3.4149213331894193e-06, "epoch": 2.5128205128205128, "percentage": 83.76, "elapsed_time": "1:57:03", "remaining_time": "0:22:41"} | |
| {"current_steps": 2060, "total_steps": 2457, "loss": 0.16657070815563202, "lr": 3.4011857169306127e-06, "epoch": 2.515262515262515, "percentage": 83.84, "elapsed_time": "1:57:09", "remaining_time": "0:22:34"} | |
| {"current_steps": 2062, "total_steps": 2457, "loss": 0.2420540601015091, "lr": 3.3875114666681235e-06, "epoch": 2.5177045177045176, "percentage": 83.92, "elapsed_time": "1:57:14", "remaining_time": "0:22:27"} | |
| {"current_steps": 2064, "total_steps": 2457, "loss": 0.4269709587097168, "lr": 3.3738986928315474e-06, "epoch": 2.52014652014652, "percentage": 84.0, "elapsed_time": "1:57:20", "remaining_time": "0:22:20"} | |
| {"current_steps": 2066, "total_steps": 2457, "loss": 0.3732086420059204, "lr": 3.360347505354011e-06, "epoch": 2.5225885225885225, "percentage": 84.09, "elapsed_time": "1:57:27", "remaining_time": "0:22:13"} | |
| {"current_steps": 2068, "total_steps": 2457, "loss": 0.5551900863647461, "lr": 3.3468580136712903e-06, "epoch": 2.525030525030525, "percentage": 84.17, "elapsed_time": "1:57:33", "remaining_time": "0:22:06"} | |
| {"current_steps": 2070, "total_steps": 2457, "loss": 0.5004504919052124, "lr": 3.333430326720921e-06, "epoch": 2.5274725274725274, "percentage": 84.25, "elapsed_time": "1:57:39", "remaining_time": "0:21:59"} | |
| {"current_steps": 2072, "total_steps": 2457, "loss": 0.31844204664230347, "lr": 3.3200645529413165e-06, "epoch": 2.52991452991453, "percentage": 84.33, "elapsed_time": "1:57:46", "remaining_time": "0:21:53"} | |
| {"current_steps": 2074, "total_steps": 2457, "loss": 0.592690646648407, "lr": 3.3067608002709006e-06, "epoch": 2.5323565323565322, "percentage": 84.41, "elapsed_time": "1:57:53", "remaining_time": "0:21:46"} | |
| {"current_steps": 2076, "total_steps": 2457, "loss": 0.509267270565033, "lr": 3.2935191761472313e-06, "epoch": 2.5347985347985347, "percentage": 84.49, "elapsed_time": "1:57:59", "remaining_time": "0:21:39"} | |
| {"current_steps": 2078, "total_steps": 2457, "loss": 0.4890163540840149, "lr": 3.280339787506127e-06, "epoch": 2.537240537240537, "percentage": 84.57, "elapsed_time": "1:58:06", "remaining_time": "0:21:32"} | |
| {"current_steps": 2080, "total_steps": 2457, "loss": 0.35127052664756775, "lr": 3.2672227407808184e-06, "epoch": 2.5396825396825395, "percentage": 84.66, "elapsed_time": "1:58:13", "remaining_time": "0:21:25"} | |
| {"current_steps": 2082, "total_steps": 2457, "loss": 0.4693216383457184, "lr": 3.2541681419010716e-06, "epoch": 2.542124542124542, "percentage": 84.74, "elapsed_time": "1:58:19", "remaining_time": "0:21:18"} | |
| {"current_steps": 2084, "total_steps": 2457, "loss": 0.47572940587997437, "lr": 3.2411760962923434e-06, "epoch": 2.5445665445665444, "percentage": 84.82, "elapsed_time": "1:58:25", "remaining_time": "0:21:11"} | |
| {"current_steps": 2086, "total_steps": 2457, "loss": 0.45491641759872437, "lr": 3.228246708874926e-06, "epoch": 2.547008547008547, "percentage": 84.9, "elapsed_time": "1:58:32", "remaining_time": "0:21:05"} | |
| {"current_steps": 2088, "total_steps": 2457, "loss": 0.6177046298980713, "lr": 3.2153800840631043e-06, "epoch": 2.5494505494505493, "percentage": 84.98, "elapsed_time": "1:58:39", "remaining_time": "0:20:58"} | |
| {"current_steps": 2090, "total_steps": 2457, "loss": 0.45679447054862976, "lr": 3.202576325764307e-06, "epoch": 2.5518925518925517, "percentage": 85.06, "elapsed_time": "1:58:45", "remaining_time": "0:20:51"} | |
| {"current_steps": 2092, "total_steps": 2457, "loss": 0.3028113842010498, "lr": 3.1898355373782663e-06, "epoch": 2.554334554334554, "percentage": 85.14, "elapsed_time": "1:58:51", "remaining_time": "0:20:44"} | |
| {"current_steps": 2094, "total_steps": 2457, "loss": 0.2570323646068573, "lr": 3.177157821796191e-06, "epoch": 2.5567765567765566, "percentage": 85.23, "elapsed_time": "1:58:57", "remaining_time": "0:20:37"} | |
| {"current_steps": 2096, "total_steps": 2457, "loss": 0.3652976155281067, "lr": 3.1645432813999306e-06, "epoch": 2.559218559218559, "percentage": 85.31, "elapsed_time": "1:59:03", "remaining_time": "0:20:30"} | |
| {"current_steps": 2098, "total_steps": 2457, "loss": 0.08200995624065399, "lr": 3.1519920180611436e-06, "epoch": 2.5616605616605614, "percentage": 85.39, "elapsed_time": "1:59:08", "remaining_time": "0:20:23"} | |
| {"current_steps": 2100, "total_steps": 2457, "loss": 0.26613810658454895, "lr": 3.139504133140484e-06, "epoch": 2.564102564102564, "percentage": 85.47, "elapsed_time": "1:59:15", "remaining_time": "0:20:16"} | |
| {"current_steps": 2102, "total_steps": 2457, "loss": 0.39854198694229126, "lr": 3.127079727486781e-06, "epoch": 2.5665445665445663, "percentage": 85.55, "elapsed_time": "1:59:22", "remaining_time": "0:20:09"} | |
| {"current_steps": 2104, "total_steps": 2457, "loss": 0.35459813475608826, "lr": 3.114718901436215e-06, "epoch": 2.5689865689865687, "percentage": 85.63, "elapsed_time": "1:59:28", "remaining_time": "0:20:02"} | |
| {"current_steps": 2106, "total_steps": 2457, "loss": 0.3210771977901459, "lr": 3.1024217548115195e-06, "epoch": 2.571428571428571, "percentage": 85.71, "elapsed_time": "1:59:34", "remaining_time": "0:19:55"} | |
| {"current_steps": 2108, "total_steps": 2457, "loss": 0.24245740473270416, "lr": 3.090188386921171e-06, "epoch": 2.5738705738705736, "percentage": 85.8, "elapsed_time": "1:59:40", "remaining_time": "0:19:48"} | |
| {"current_steps": 2110, "total_steps": 2457, "loss": 0.21324002742767334, "lr": 3.078018896558582e-06, "epoch": 2.576312576312576, "percentage": 85.88, "elapsed_time": "1:59:46", "remaining_time": "0:19:41"} | |
| {"current_steps": 2112, "total_steps": 2457, "loss": 0.469443142414093, "lr": 3.0659133820013123e-06, "epoch": 2.578754578754579, "percentage": 85.96, "elapsed_time": "1:59:52", "remaining_time": "0:19:34"} | |
| {"current_steps": 2114, "total_steps": 2457, "loss": 0.16458410024642944, "lr": 3.0538719410102612e-06, "epoch": 2.5811965811965814, "percentage": 86.04, "elapsed_time": "1:59:58", "remaining_time": "0:19:27"} | |
| {"current_steps": 2116, "total_steps": 2457, "loss": 0.3730916976928711, "lr": 3.0418946708288984e-06, "epoch": 2.583638583638584, "percentage": 86.12, "elapsed_time": "2:00:05", "remaining_time": "0:19:21"} | |
| {"current_steps": 2118, "total_steps": 2457, "loss": 0.5398478507995605, "lr": 3.029981668182458e-06, "epoch": 2.586080586080586, "percentage": 86.2, "elapsed_time": "2:00:12", "remaining_time": "0:19:14"} | |
| {"current_steps": 2120, "total_steps": 2457, "loss": 0.25115227699279785, "lr": 3.0181330292771727e-06, "epoch": 2.5885225885225887, "percentage": 86.28, "elapsed_time": "2:00:18", "remaining_time": "0:19:07"} | |
| {"current_steps": 2122, "total_steps": 2457, "loss": 0.6454752087593079, "lr": 3.0063488497994864e-06, "epoch": 2.590964590964591, "percentage": 86.37, "elapsed_time": "2:00:24", "remaining_time": "0:19:00"} | |
| {"current_steps": 2124, "total_steps": 2457, "loss": 0.30809617042541504, "lr": 2.994629224915288e-06, "epoch": 2.5934065934065935, "percentage": 86.45, "elapsed_time": "2:00:30", "remaining_time": "0:18:53"} | |
| {"current_steps": 2126, "total_steps": 2457, "loss": 0.1984136551618576, "lr": 2.9829742492691436e-06, "epoch": 2.595848595848596, "percentage": 86.53, "elapsed_time": "2:00:36", "remaining_time": "0:18:46"} | |
| {"current_steps": 2128, "total_steps": 2457, "loss": 0.4299178123474121, "lr": 2.971384016983522e-06, "epoch": 2.5982905982905984, "percentage": 86.61, "elapsed_time": "2:00:42", "remaining_time": "0:18:39"} | |
| {"current_steps": 2130, "total_steps": 2457, "loss": 0.2969256043434143, "lr": 2.959858621658047e-06, "epoch": 2.600732600732601, "percentage": 86.69, "elapsed_time": "2:00:49", "remaining_time": "0:18:32"} | |
| {"current_steps": 2132, "total_steps": 2457, "loss": 0.2652299702167511, "lr": 2.94839815636874e-06, "epoch": 2.6031746031746033, "percentage": 86.77, "elapsed_time": "2:00:55", "remaining_time": "0:18:26"} | |
| {"current_steps": 2134, "total_steps": 2457, "loss": 0.34369128942489624, "lr": 2.9370027136672536e-06, "epoch": 2.6056166056166057, "percentage": 86.85, "elapsed_time": "2:01:02", "remaining_time": "0:18:19"} | |
| {"current_steps": 2136, "total_steps": 2457, "loss": 0.30307111144065857, "lr": 2.925672385580145e-06, "epoch": 2.608058608058608, "percentage": 86.94, "elapsed_time": "2:01:08", "remaining_time": "0:18:12"} | |
| {"current_steps": 2138, "total_steps": 2457, "loss": 0.2503519058227539, "lr": 2.9144072636081233e-06, "epoch": 2.6105006105006106, "percentage": 87.02, "elapsed_time": "2:01:13", "remaining_time": "0:18:05"} | |
| {"current_steps": 2140, "total_steps": 2457, "loss": 0.25583434104919434, "lr": 2.9032074387253017e-06, "epoch": 2.612942612942613, "percentage": 87.1, "elapsed_time": "2:01:19", "remaining_time": "0:17:58"} | |
| {"current_steps": 2142, "total_steps": 2457, "loss": 0.3618330955505371, "lr": 2.892073001378481e-06, "epoch": 2.6153846153846154, "percentage": 87.18, "elapsed_time": "2:01:26", "remaining_time": "0:17:51"} | |
| {"current_steps": 2144, "total_steps": 2457, "loss": 0.4887958765029907, "lr": 2.881004041486406e-06, "epoch": 2.617826617826618, "percentage": 87.26, "elapsed_time": "2:01:33", "remaining_time": "0:17:44"} | |
| {"current_steps": 2146, "total_steps": 2457, "loss": 0.46932682394981384, "lr": 2.8700006484390395e-06, "epoch": 2.6202686202686203, "percentage": 87.34, "elapsed_time": "2:01:40", "remaining_time": "0:17:38"} | |
| {"current_steps": 2148, "total_steps": 2457, "loss": 0.3209373652935028, "lr": 2.8590629110968503e-06, "epoch": 2.6227106227106227, "percentage": 87.42, "elapsed_time": "2:01:47", "remaining_time": "0:17:31"} | |
| {"current_steps": 2150, "total_steps": 2457, "loss": 0.468944787979126, "lr": 2.8481909177900874e-06, "epoch": 2.625152625152625, "percentage": 87.51, "elapsed_time": "2:01:53", "remaining_time": "0:17:24"} | |
| {"current_steps": 2152, "total_steps": 2457, "loss": 0.439802885055542, "lr": 2.837384756318063e-06, "epoch": 2.6275946275946276, "percentage": 87.59, "elapsed_time": "2:01:58", "remaining_time": "0:17:17"} | |
| {"current_steps": 2154, "total_steps": 2457, "loss": 0.48533153533935547, "lr": 2.826644513948456e-06, "epoch": 2.63003663003663, "percentage": 87.67, "elapsed_time": "2:02:04", "remaining_time": "0:17:10"} | |
| {"current_steps": 2156, "total_steps": 2457, "loss": 0.5256586670875549, "lr": 2.8159702774166e-06, "epoch": 2.6324786324786325, "percentage": 87.75, "elapsed_time": "2:02:11", "remaining_time": "0:17:03"} | |
| {"current_steps": 2158, "total_steps": 2457, "loss": 0.5299547910690308, "lr": 2.8053621329247767e-06, "epoch": 2.634920634920635, "percentage": 87.83, "elapsed_time": "2:02:17", "remaining_time": "0:16:56"} | |
| {"current_steps": 2160, "total_steps": 2457, "loss": 0.2885707914829254, "lr": 2.7948201661415307e-06, "epoch": 2.6373626373626373, "percentage": 87.91, "elapsed_time": "2:02:23", "remaining_time": "0:16:49"} | |
| {"current_steps": 2162, "total_steps": 2457, "loss": 0.34332627058029175, "lr": 2.7843444622009746e-06, "epoch": 2.6398046398046398, "percentage": 87.99, "elapsed_time": "2:02:30", "remaining_time": "0:16:42"} | |
| {"current_steps": 2164, "total_steps": 2457, "loss": 0.3300524652004242, "lr": 2.773935105702096e-06, "epoch": 2.642246642246642, "percentage": 88.07, "elapsed_time": "2:02:36", "remaining_time": "0:16:36"} | |
| {"current_steps": 2166, "total_steps": 2457, "loss": 0.4990626871585846, "lr": 2.763592180708081e-06, "epoch": 2.6446886446886446, "percentage": 88.16, "elapsed_time": "2:02:42", "remaining_time": "0:16:29"} | |
| {"current_steps": 2168, "total_steps": 2457, "loss": 0.42835402488708496, "lr": 2.7533157707456336e-06, "epoch": 2.647130647130647, "percentage": 88.24, "elapsed_time": "2:02:48", "remaining_time": "0:16:22"} | |
| {"current_steps": 2170, "total_steps": 2457, "loss": 0.504192590713501, "lr": 2.7431059588042945e-06, "epoch": 2.6495726495726495, "percentage": 88.32, "elapsed_time": "2:02:55", "remaining_time": "0:16:15"} | |
| {"current_steps": 2172, "total_steps": 2457, "loss": 0.5846405029296875, "lr": 2.7329628273357815e-06, "epoch": 2.652014652014652, "percentage": 88.4, "elapsed_time": "2:03:01", "remaining_time": "0:16:08"} | |
| {"current_steps": 2174, "total_steps": 2457, "loss": 0.4775027632713318, "lr": 2.72288645825332e-06, "epoch": 2.6544566544566544, "percentage": 88.48, "elapsed_time": "2:03:07", "remaining_time": "0:16:01"} | |
| {"current_steps": 2176, "total_steps": 2457, "loss": 0.2678804397583008, "lr": 2.7128769329309744e-06, "epoch": 2.656898656898657, "percentage": 88.56, "elapsed_time": "2:03:13", "remaining_time": "0:15:54"} | |
| {"current_steps": 2178, "total_steps": 2457, "loss": 0.4422096908092499, "lr": 2.702934332203002e-06, "epoch": 2.659340659340659, "percentage": 88.64, "elapsed_time": "2:03:20", "remaining_time": "0:15:48"} | |
| {"current_steps": 2180, "total_steps": 2457, "loss": 0.4233754575252533, "lr": 2.6930587363631932e-06, "epoch": 2.6617826617826617, "percentage": 88.73, "elapsed_time": "2:03:27", "remaining_time": "0:15:41"} | |
| {"current_steps": 2182, "total_steps": 2457, "loss": 0.40418240427970886, "lr": 2.6832502251642223e-06, "epoch": 2.664224664224664, "percentage": 88.81, "elapsed_time": "2:03:33", "remaining_time": "0:15:34"} | |
| {"current_steps": 2184, "total_steps": 2457, "loss": 0.2588379979133606, "lr": 2.6735088778170105e-06, "epoch": 2.6666666666666665, "percentage": 88.89, "elapsed_time": "2:03:39", "remaining_time": "0:15:27"} | |
| {"current_steps": 2186, "total_steps": 2457, "loss": 0.39823517203330994, "lr": 2.66383477299008e-06, "epoch": 2.669108669108669, "percentage": 88.97, "elapsed_time": "2:03:46", "remaining_time": "0:15:20"} | |
| {"current_steps": 2188, "total_steps": 2457, "loss": 0.3795110881328583, "lr": 2.6542279888089163e-06, "epoch": 2.6715506715506714, "percentage": 89.05, "elapsed_time": "2:03:51", "remaining_time": "0:15:13"} | |
| {"current_steps": 2190, "total_steps": 2457, "loss": 0.5400364995002747, "lr": 2.6446886028553476e-06, "epoch": 2.6739926739926743, "percentage": 89.13, "elapsed_time": "2:03:58", "remaining_time": "0:15:06"} | |
| {"current_steps": 2192, "total_steps": 2457, "loss": 0.5039065480232239, "lr": 2.6352166921669076e-06, "epoch": 2.6764346764346767, "percentage": 89.21, "elapsed_time": "2:04:06", "remaining_time": "0:15:00"} | |
| {"current_steps": 2194, "total_steps": 2457, "loss": 0.13939893245697021, "lr": 2.625812333236222e-06, "epoch": 2.678876678876679, "percentage": 89.3, "elapsed_time": "2:04:12", "remaining_time": "0:14:53"} | |
| {"current_steps": 2196, "total_steps": 2457, "loss": 0.33114153146743774, "lr": 2.61647560201038e-06, "epoch": 2.6813186813186816, "percentage": 89.38, "elapsed_time": "2:04:19", "remaining_time": "0:14:46"} | |
| {"current_steps": 2198, "total_steps": 2457, "loss": 0.521342396736145, "lr": 2.6072065738903335e-06, "epoch": 2.683760683760684, "percentage": 89.46, "elapsed_time": "2:04:25", "remaining_time": "0:14:39"} | |
| {"current_steps": 2200, "total_steps": 2457, "loss": 0.4681139588356018, "lr": 2.5980053237302816e-06, "epoch": 2.6862026862026864, "percentage": 89.54, "elapsed_time": "2:04:32", "remaining_time": "0:14:32"} | |
| {"current_steps": 2202, "total_steps": 2457, "loss": 0.28020548820495605, "lr": 2.588871925837062e-06, "epoch": 2.688644688644689, "percentage": 89.62, "elapsed_time": "2:04:39", "remaining_time": "0:14:26"} | |
| {"current_steps": 2204, "total_steps": 2457, "loss": 0.5311964750289917, "lr": 2.5798064539695604e-06, "epoch": 2.6910866910866913, "percentage": 89.7, "elapsed_time": "2:04:44", "remaining_time": "0:14:19"} | |
| {"current_steps": 2206, "total_steps": 2457, "loss": 0.12289441376924515, "lr": 2.5708089813381088e-06, "epoch": 2.6935286935286937, "percentage": 89.78, "elapsed_time": "2:04:50", "remaining_time": "0:14:12"} | |
| {"current_steps": 2208, "total_steps": 2457, "loss": 0.47109082341194153, "lr": 2.561879580603893e-06, "epoch": 2.695970695970696, "percentage": 89.87, "elapsed_time": "2:04:57", "remaining_time": "0:14:05"} | |
| {"current_steps": 2210, "total_steps": 2457, "loss": 0.3485221564769745, "lr": 2.5530183238783728e-06, "epoch": 2.6984126984126986, "percentage": 89.95, "elapsed_time": "2:05:03", "remaining_time": "0:13:58"} | |
| {"current_steps": 2212, "total_steps": 2457, "loss": 0.5045080184936523, "lr": 2.5442252827226925e-06, "epoch": 2.700854700854701, "percentage": 90.03, "elapsed_time": "2:05:09", "remaining_time": "0:13:51"} | |
| {"current_steps": 2214, "total_steps": 2457, "loss": 0.2372823804616928, "lr": 2.5355005281471046e-06, "epoch": 2.7032967032967035, "percentage": 90.11, "elapsed_time": "2:05:16", "remaining_time": "0:13:44"} | |
| {"current_steps": 2216, "total_steps": 2457, "loss": 0.2721218168735504, "lr": 2.526844130610399e-06, "epoch": 2.705738705738706, "percentage": 90.19, "elapsed_time": "2:05:21", "remaining_time": "0:13:38"} | |
| {"current_steps": 2218, "total_steps": 2457, "loss": 0.311516672372818, "lr": 2.5182561600193317e-06, "epoch": 2.7081807081807083, "percentage": 90.27, "elapsed_time": "2:05:27", "remaining_time": "0:13:31"} | |
| {"current_steps": 2220, "total_steps": 2457, "loss": 0.1073763519525528, "lr": 2.5097366857280636e-06, "epoch": 2.7106227106227108, "percentage": 90.35, "elapsed_time": "2:05:33", "remaining_time": "0:13:24"} | |
| {"current_steps": 2222, "total_steps": 2457, "loss": 0.358319491147995, "lr": 2.501285776537593e-06, "epoch": 2.713064713064713, "percentage": 90.44, "elapsed_time": "2:05:38", "remaining_time": "0:13:17"} | |
| {"current_steps": 2224, "total_steps": 2457, "loss": 0.21015426516532898, "lr": 2.4929035006952106e-06, "epoch": 2.7155067155067156, "percentage": 90.52, "elapsed_time": "2:05:45", "remaining_time": "0:13:10"} | |
| {"current_steps": 2226, "total_steps": 2457, "loss": 0.25736236572265625, "lr": 2.4845899258939362e-06, "epoch": 2.717948717948718, "percentage": 90.6, "elapsed_time": "2:05:52", "remaining_time": "0:13:03"} | |
| {"current_steps": 2228, "total_steps": 2457, "loss": 0.2484760284423828, "lr": 2.4763451192719816e-06, "epoch": 2.7203907203907205, "percentage": 90.68, "elapsed_time": "2:05:58", "remaining_time": "0:12:56"} | |
| {"current_steps": 2230, "total_steps": 2457, "loss": 0.4695739150047302, "lr": 2.4681691474122064e-06, "epoch": 2.722832722832723, "percentage": 90.76, "elapsed_time": "2:06:04", "remaining_time": "0:12:50"} | |
| {"current_steps": 2232, "total_steps": 2457, "loss": 0.2893969714641571, "lr": 2.4600620763415754e-06, "epoch": 2.7252747252747254, "percentage": 90.84, "elapsed_time": "2:06:10", "remaining_time": "0:12:43"} | |
| {"current_steps": 2234, "total_steps": 2457, "loss": 0.5152880549430847, "lr": 2.4520239715306325e-06, "epoch": 2.727716727716728, "percentage": 90.92, "elapsed_time": "2:06:15", "remaining_time": "0:12:36"} | |
| {"current_steps": 2236, "total_steps": 2457, "loss": 0.7832448482513428, "lr": 2.4440548978929678e-06, "epoch": 2.7301587301587302, "percentage": 91.01, "elapsed_time": "2:06:22", "remaining_time": "0:12:29"} | |
| {"current_steps": 2238, "total_steps": 2457, "loss": 0.376642107963562, "lr": 2.4361549197846914e-06, "epoch": 2.7326007326007327, "percentage": 91.09, "elapsed_time": "2:06:30", "remaining_time": "0:12:22"} | |
| {"current_steps": 2240, "total_steps": 2457, "loss": 0.26889967918395996, "lr": 2.42832410100392e-06, "epoch": 2.735042735042735, "percentage": 91.17, "elapsed_time": "2:06:37", "remaining_time": "0:12:15"} | |
| {"current_steps": 2242, "total_steps": 2457, "loss": 0.5269310474395752, "lr": 2.420562504790256e-06, "epoch": 2.7374847374847375, "percentage": 91.25, "elapsed_time": "2:06:43", "remaining_time": "0:12:09"} | |
| {"current_steps": 2244, "total_steps": 2457, "loss": 0.2715807557106018, "lr": 2.412870193824278e-06, "epoch": 2.73992673992674, "percentage": 91.33, "elapsed_time": "2:06:49", "remaining_time": "0:12:02"} | |
| {"current_steps": 2246, "total_steps": 2457, "loss": 0.2188037633895874, "lr": 2.4052472302270365e-06, "epoch": 2.7423687423687424, "percentage": 91.41, "elapsed_time": "2:06:55", "remaining_time": "0:11:55"} | |
| {"current_steps": 2248, "total_steps": 2457, "loss": 0.4869040846824646, "lr": 2.3976936755595533e-06, "epoch": 2.744810744810745, "percentage": 91.49, "elapsed_time": "2:07:02", "remaining_time": "0:11:48"} | |
| {"current_steps": 2250, "total_steps": 2457, "loss": 0.40255841612815857, "lr": 2.390209590822319e-06, "epoch": 2.7472527472527473, "percentage": 91.58, "elapsed_time": "2:07:09", "remaining_time": "0:11:41"} | |
| {"current_steps": 2252, "total_steps": 2457, "loss": 0.6289904117584229, "lr": 2.3827950364548034e-06, "epoch": 2.7496947496947497, "percentage": 91.66, "elapsed_time": "2:07:16", "remaining_time": "0:11:35"} | |
| {"current_steps": 2254, "total_steps": 2457, "loss": 0.5615298748016357, "lr": 2.375450072334972e-06, "epoch": 2.752136752136752, "percentage": 91.74, "elapsed_time": "2:07:22", "remaining_time": "0:11:28"} | |
| {"current_steps": 2256, "total_steps": 2457, "loss": 0.2363334745168686, "lr": 2.3681747577787924e-06, "epoch": 2.7545787545787546, "percentage": 91.82, "elapsed_time": "2:07:29", "remaining_time": "0:11:21"} | |
| {"current_steps": 2258, "total_steps": 2457, "loss": 0.4858379364013672, "lr": 2.3609691515397628e-06, "epoch": 2.757020757020757, "percentage": 91.9, "elapsed_time": "2:07:35", "remaining_time": "0:11:14"} | |
| {"current_steps": 2260, "total_steps": 2457, "loss": 0.5177884697914124, "lr": 2.3538333118084396e-06, "epoch": 2.7594627594627594, "percentage": 91.98, "elapsed_time": "2:07:41", "remaining_time": "0:11:07"} | |
| {"current_steps": 2262, "total_steps": 2457, "loss": 0.5373342037200928, "lr": 2.3467672962119565e-06, "epoch": 2.761904761904762, "percentage": 92.06, "elapsed_time": "2:07:46", "remaining_time": "0:11:00"} | |
| {"current_steps": 2264, "total_steps": 2457, "loss": 0.43640759587287903, "lr": 2.3397711618135725e-06, "epoch": 2.7643467643467643, "percentage": 92.14, "elapsed_time": "2:07:53", "remaining_time": "0:10:54"} | |
| {"current_steps": 2266, "total_steps": 2457, "loss": 0.3964022099971771, "lr": 2.332844965112201e-06, "epoch": 2.7667887667887667, "percentage": 92.23, "elapsed_time": "2:07:59", "remaining_time": "0:10:47"} | |
| {"current_steps": 2268, "total_steps": 2457, "loss": 0.3127731680870056, "lr": 2.3259887620419573e-06, "epoch": 2.769230769230769, "percentage": 92.31, "elapsed_time": "2:08:05", "remaining_time": "0:10:40"} | |
| {"current_steps": 2270, "total_steps": 2457, "loss": 0.2613333463668823, "lr": 2.3192026079717086e-06, "epoch": 2.7716727716727716, "percentage": 92.39, "elapsed_time": "2:08:11", "remaining_time": "0:10:33"} | |
| {"current_steps": 2272, "total_steps": 2457, "loss": 0.07839272171258926, "lr": 2.3124865577046252e-06, "epoch": 2.774114774114774, "percentage": 92.47, "elapsed_time": "2:08:16", "remaining_time": "0:10:26"} | |
| {"current_steps": 2274, "total_steps": 2457, "loss": 0.502284824848175, "lr": 2.3058406654777355e-06, "epoch": 2.7765567765567765, "percentage": 92.55, "elapsed_time": "2:08:22", "remaining_time": "0:10:19"} | |
| {"current_steps": 2276, "total_steps": 2457, "loss": 0.6292468905448914, "lr": 2.299264984961492e-06, "epoch": 2.778998778998779, "percentage": 92.63, "elapsed_time": "2:08:29", "remaining_time": "0:10:13"} | |
| {"current_steps": 2278, "total_steps": 2457, "loss": 0.3484017252922058, "lr": 2.2927595692593366e-06, "epoch": 2.7814407814407813, "percentage": 92.71, "elapsed_time": "2:08:34", "remaining_time": "0:10:06"} | |
| {"current_steps": 2280, "total_steps": 2457, "loss": 0.18759427964687347, "lr": 2.286324470907269e-06, "epoch": 2.7838827838827838, "percentage": 92.8, "elapsed_time": "2:08:40", "remaining_time": "0:09:59"} | |
| {"current_steps": 2282, "total_steps": 2457, "loss": 0.419060617685318, "lr": 2.279959741873426e-06, "epoch": 2.786324786324786, "percentage": 92.88, "elapsed_time": "2:08:47", "remaining_time": "0:09:52"} | |
| {"current_steps": 2284, "total_steps": 2457, "loss": 0.4783077836036682, "lr": 2.2736654335576634e-06, "epoch": 2.7887667887667886, "percentage": 92.96, "elapsed_time": "2:08:53", "remaining_time": "0:09:45"} | |
| {"current_steps": 2286, "total_steps": 2457, "loss": 0.4703105390071869, "lr": 2.267441596791132e-06, "epoch": 2.791208791208791, "percentage": 93.04, "elapsed_time": "2:09:00", "remaining_time": "0:09:39"} | |
| {"current_steps": 2288, "total_steps": 2457, "loss": 0.41585975885391235, "lr": 2.2612882818358784e-06, "epoch": 2.7936507936507935, "percentage": 93.12, "elapsed_time": "2:09:06", "remaining_time": "0:09:32"} | |
| {"current_steps": 2290, "total_steps": 2457, "loss": 0.08420296758413315, "lr": 2.2552055383844327e-06, "epoch": 2.796092796092796, "percentage": 93.2, "elapsed_time": "2:09:13", "remaining_time": "0:09:25"} | |
| {"current_steps": 2292, "total_steps": 2457, "loss": 0.35032370686531067, "lr": 2.2491934155594063e-06, "epoch": 2.7985347985347984, "percentage": 93.28, "elapsed_time": "2:09:19", "remaining_time": "0:09:18"} | |
| {"current_steps": 2294, "total_steps": 2457, "loss": 0.36088746786117554, "lr": 2.243251961913099e-06, "epoch": 2.800976800976801, "percentage": 93.37, "elapsed_time": "2:09:25", "remaining_time": "0:09:11"} | |
| {"current_steps": 2296, "total_steps": 2457, "loss": 0.42339953780174255, "lr": 2.2373812254271074e-06, "epoch": 2.8034188034188032, "percentage": 93.45, "elapsed_time": "2:09:31", "remaining_time": "0:09:04"} | |
| {"current_steps": 2298, "total_steps": 2457, "loss": 0.1882065087556839, "lr": 2.231581253511929e-06, "epoch": 2.8058608058608057, "percentage": 93.53, "elapsed_time": "2:09:38", "remaining_time": "0:08:58"} | |
| {"current_steps": 2300, "total_steps": 2457, "loss": 0.33834829926490784, "lr": 2.2258520930065902e-06, "epoch": 2.808302808302808, "percentage": 93.61, "elapsed_time": "2:09:46", "remaining_time": "0:08:51"} | |
| {"current_steps": 2302, "total_steps": 2457, "loss": 0.5746235847473145, "lr": 2.2201937901782632e-06, "epoch": 2.8107448107448105, "percentage": 93.69, "elapsed_time": "2:09:52", "remaining_time": "0:08:44"} | |
| {"current_steps": 2304, "total_steps": 2457, "loss": 0.2884528338909149, "lr": 2.2146063907218928e-06, "epoch": 2.813186813186813, "percentage": 93.77, "elapsed_time": "2:09:57", "remaining_time": "0:08:37"} | |
| {"current_steps": 2306, "total_steps": 2457, "loss": 0.34547799825668335, "lr": 2.2090899397598235e-06, "epoch": 2.8156288156288154, "percentage": 93.85, "elapsed_time": "2:10:04", "remaining_time": "0:08:31"} | |
| {"current_steps": 2308, "total_steps": 2457, "loss": 0.4068155288696289, "lr": 2.2036444818414424e-06, "epoch": 2.818070818070818, "percentage": 93.94, "elapsed_time": "2:10:09", "remaining_time": "0:08:24"} | |
| {"current_steps": 2310, "total_steps": 2457, "loss": 0.4539620876312256, "lr": 2.198270060942815e-06, "epoch": 2.8205128205128203, "percentage": 94.02, "elapsed_time": "2:10:15", "remaining_time": "0:08:17"} | |
| {"current_steps": 2312, "total_steps": 2457, "loss": 0.22723491489887238, "lr": 2.192966720466328e-06, "epoch": 2.8229548229548227, "percentage": 94.1, "elapsed_time": "2:10:24", "remaining_time": "0:08:10"} | |
| {"current_steps": 2314, "total_steps": 2457, "loss": 0.287578284740448, "lr": 2.1877345032403458e-06, "epoch": 2.825396825396825, "percentage": 94.18, "elapsed_time": "2:10:30", "remaining_time": "0:08:03"} | |
| {"current_steps": 2316, "total_steps": 2457, "loss": 0.4537888169288635, "lr": 2.182573451518859e-06, "epoch": 2.8278388278388276, "percentage": 94.26, "elapsed_time": "2:10:37", "remaining_time": "0:07:57"} | |
| {"current_steps": 2318, "total_steps": 2457, "loss": 0.3850943446159363, "lr": 2.1774836069811415e-06, "epoch": 2.8302808302808304, "percentage": 94.34, "elapsed_time": "2:10:43", "remaining_time": "0:07:50"} | |
| {"current_steps": 2320, "total_steps": 2457, "loss": 0.22680553793907166, "lr": 2.1724650107314217e-06, "epoch": 2.832722832722833, "percentage": 94.42, "elapsed_time": "2:10:49", "remaining_time": "0:07:43"} | |
| {"current_steps": 2322, "total_steps": 2457, "loss": 0.34959569573402405, "lr": 2.1675177032985435e-06, "epoch": 2.8351648351648353, "percentage": 94.51, "elapsed_time": "2:10:55", "remaining_time": "0:07:36"} | |
| {"current_steps": 2324, "total_steps": 2457, "loss": 0.08046525716781616, "lr": 2.1626417246356398e-06, "epoch": 2.8376068376068377, "percentage": 94.59, "elapsed_time": "2:11:00", "remaining_time": "0:07:29"} | |
| {"current_steps": 2326, "total_steps": 2457, "loss": 0.3989933431148529, "lr": 2.1578371141198154e-06, "epoch": 2.84004884004884, "percentage": 94.67, "elapsed_time": "2:11:07", "remaining_time": "0:07:23"} | |
| {"current_steps": 2328, "total_steps": 2457, "loss": 0.27708202600479126, "lr": 2.15310391055182e-06, "epoch": 2.8424908424908426, "percentage": 94.75, "elapsed_time": "2:11:14", "remaining_time": "0:07:16"} | |
| {"current_steps": 2330, "total_steps": 2457, "loss": 0.24901802837848663, "lr": 2.1484421521557453e-06, "epoch": 2.844932844932845, "percentage": 94.83, "elapsed_time": "2:11:19", "remaining_time": "0:07:09"} | |
| {"current_steps": 2332, "total_steps": 2457, "loss": 0.45619091391563416, "lr": 2.143851876578706e-06, "epoch": 2.8473748473748475, "percentage": 94.91, "elapsed_time": "2:11:25", "remaining_time": "0:07:02"} | |
| {"current_steps": 2334, "total_steps": 2457, "loss": 0.07932747900485992, "lr": 2.1393331208905436e-06, "epoch": 2.84981684981685, "percentage": 94.99, "elapsed_time": "2:11:30", "remaining_time": "0:06:55"} | |
| {"current_steps": 2336, "total_steps": 2457, "loss": 0.5910269021987915, "lr": 2.134885921583522e-06, "epoch": 2.8522588522588523, "percentage": 95.08, "elapsed_time": "2:11:36", "remaining_time": "0:06:49"} | |
| {"current_steps": 2338, "total_steps": 2457, "loss": 0.3153696656227112, "lr": 2.1305103145720383e-06, "epoch": 2.8547008547008548, "percentage": 95.16, "elapsed_time": "2:11:42", "remaining_time": "0:06:42"} | |
| {"current_steps": 2340, "total_steps": 2457, "loss": 0.47363409399986267, "lr": 2.1262063351923255e-06, "epoch": 2.857142857142857, "percentage": 95.24, "elapsed_time": "2:11:49", "remaining_time": "0:06:35"} | |
| {"current_steps": 2342, "total_steps": 2457, "loss": 0.48734188079833984, "lr": 2.121974018202172e-06, "epoch": 2.8595848595848596, "percentage": 95.32, "elapsed_time": "2:11:56", "remaining_time": "0:06:28"} | |
| {"current_steps": 2344, "total_steps": 2457, "loss": 0.19048890471458435, "lr": 2.1178133977806413e-06, "epoch": 2.862026862026862, "percentage": 95.4, "elapsed_time": "2:12:02", "remaining_time": "0:06:21"} | |
| {"current_steps": 2346, "total_steps": 2457, "loss": 0.6129634976387024, "lr": 2.113724507527794e-06, "epoch": 2.8644688644688645, "percentage": 95.48, "elapsed_time": "2:12:08", "remaining_time": "0:06:15"} | |
| {"current_steps": 2348, "total_steps": 2457, "loss": 0.2763885259628296, "lr": 2.1097073804644163e-06, "epoch": 2.866910866910867, "percentage": 95.56, "elapsed_time": "2:12:14", "remaining_time": "0:06:08"} | |
| {"current_steps": 2350, "total_steps": 2457, "loss": 0.2500677704811096, "lr": 2.105762049031753e-06, "epoch": 2.8693528693528694, "percentage": 95.65, "elapsed_time": "2:12:19", "remaining_time": "0:06:01"} | |
| {"current_steps": 2352, "total_steps": 2457, "loss": 0.45614075660705566, "lr": 2.1018885450912487e-06, "epoch": 2.871794871794872, "percentage": 95.73, "elapsed_time": "2:12:25", "remaining_time": "0:05:54"} | |
| {"current_steps": 2354, "total_steps": 2457, "loss": 0.3945198953151703, "lr": 2.098086899924288e-06, "epoch": 2.8742368742368742, "percentage": 95.81, "elapsed_time": "2:12:31", "remaining_time": "0:05:47"} | |
| {"current_steps": 2356, "total_steps": 2457, "loss": 0.49924108386039734, "lr": 2.0943571442319437e-06, "epoch": 2.8766788766788767, "percentage": 95.89, "elapsed_time": "2:12:38", "remaining_time": "0:05:41"} | |
| {"current_steps": 2358, "total_steps": 2457, "loss": 0.4753328263759613, "lr": 2.090699308134726e-06, "epoch": 2.879120879120879, "percentage": 95.97, "elapsed_time": "2:12:45", "remaining_time": "0:05:34"} | |
| {"current_steps": 2360, "total_steps": 2457, "loss": 0.23788021504878998, "lr": 2.0871134211723417e-06, "epoch": 2.8815628815628815, "percentage": 96.05, "elapsed_time": "2:12:50", "remaining_time": "0:05:27"} | |
| {"current_steps": 2362, "total_steps": 2457, "loss": 0.32568857073783875, "lr": 2.0835995123034603e-06, "epoch": 2.884004884004884, "percentage": 96.13, "elapsed_time": "2:12:56", "remaining_time": "0:05:20"} | |
| {"current_steps": 2364, "total_steps": 2457, "loss": 0.6228987574577332, "lr": 2.0801576099054696e-06, "epoch": 2.8864468864468864, "percentage": 96.21, "elapsed_time": "2:13:03", "remaining_time": "0:05:14"} | |
| {"current_steps": 2366, "total_steps": 2457, "loss": 0.39544668793678284, "lr": 2.0767877417742564e-06, "epoch": 2.888888888888889, "percentage": 96.3, "elapsed_time": "2:13:10", "remaining_time": "0:05:07"} | |
| {"current_steps": 2368, "total_steps": 2457, "loss": 0.3747745156288147, "lr": 2.0734899351239744e-06, "epoch": 2.8913308913308913, "percentage": 96.38, "elapsed_time": "2:13:16", "remaining_time": "0:05:00"} | |
| {"current_steps": 2370, "total_steps": 2457, "loss": 0.3083977997303009, "lr": 2.0702642165868326e-06, "epoch": 2.8937728937728937, "percentage": 96.46, "elapsed_time": "2:13:23", "remaining_time": "0:04:53"} | |
| {"current_steps": 2372, "total_steps": 2457, "loss": 0.388817697763443, "lr": 2.0671106122128717e-06, "epoch": 2.896214896214896, "percentage": 96.54, "elapsed_time": "2:13:30", "remaining_time": "0:04:47"} | |
| {"current_steps": 2374, "total_steps": 2457, "loss": 0.3050660490989685, "lr": 2.064029147469759e-06, "epoch": 2.8986568986568986, "percentage": 96.62, "elapsed_time": "2:13:35", "remaining_time": "0:04:40"} | |
| {"current_steps": 2376, "total_steps": 2457, "loss": 0.42830216884613037, "lr": 2.0610198472425817e-06, "epoch": 2.901098901098901, "percentage": 96.7, "elapsed_time": "2:13:41", "remaining_time": "0:04:33"} | |
| {"current_steps": 2378, "total_steps": 2457, "loss": 0.4124550223350525, "lr": 2.0580827358336447e-06, "epoch": 2.9035409035409034, "percentage": 96.78, "elapsed_time": "2:13:48", "remaining_time": "0:04:26"} | |
| {"current_steps": 2380, "total_steps": 2457, "loss": 0.34032320976257324, "lr": 2.055217836962276e-06, "epoch": 2.905982905982906, "percentage": 96.87, "elapsed_time": "2:13:54", "remaining_time": "0:04:19"} | |
| {"current_steps": 2382, "total_steps": 2457, "loss": 0.5842119455337524, "lr": 2.0524251737646367e-06, "epoch": 2.9084249084249083, "percentage": 96.95, "elapsed_time": "2:14:00", "remaining_time": "0:04:13"} | |
| {"current_steps": 2384, "total_steps": 2457, "loss": 0.308889776468277, "lr": 2.049704768793527e-06, "epoch": 2.9108669108669107, "percentage": 97.03, "elapsed_time": "2:14:07", "remaining_time": "0:04:06"} | |
| {"current_steps": 2386, "total_steps": 2457, "loss": 0.736882746219635, "lr": 2.0470566440182126e-06, "epoch": 2.913308913308913, "percentage": 97.11, "elapsed_time": "2:14:13", "remaining_time": "0:03:59"} | |
| {"current_steps": 2388, "total_steps": 2457, "loss": 0.3669341504573822, "lr": 2.0444808208242414e-06, "epoch": 2.9157509157509156, "percentage": 97.19, "elapsed_time": "2:14:19", "remaining_time": "0:03:52"} | |
| {"current_steps": 2390, "total_steps": 2457, "loss": 0.303989052772522, "lr": 2.041977320013275e-06, "epoch": 2.918192918192918, "percentage": 97.27, "elapsed_time": "2:14:24", "remaining_time": "0:03:46"} | |
| {"current_steps": 2392, "total_steps": 2457, "loss": 0.4449572265148163, "lr": 2.0395461618029175e-06, "epoch": 2.9206349206349205, "percentage": 97.35, "elapsed_time": "2:14:31", "remaining_time": "0:03:39"} | |
| {"current_steps": 2394, "total_steps": 2457, "loss": 0.31565719842910767, "lr": 2.0371873658265546e-06, "epoch": 2.9230769230769234, "percentage": 97.44, "elapsed_time": "2:14:36", "remaining_time": "0:03:32"} | |
| {"current_steps": 2396, "total_steps": 2457, "loss": 0.24595557153224945, "lr": 2.0349009511331912e-06, "epoch": 2.925518925518926, "percentage": 97.52, "elapsed_time": "2:14:42", "remaining_time": "0:03:25"} | |
| {"current_steps": 2398, "total_steps": 2457, "loss": 0.30839934945106506, "lr": 2.032686936187305e-06, "epoch": 2.927960927960928, "percentage": 97.6, "elapsed_time": "2:14:48", "remaining_time": "0:03:19"} | |
| {"current_steps": 2400, "total_steps": 2457, "loss": 0.32078707218170166, "lr": 2.0305453388686876e-06, "epoch": 2.9304029304029307, "percentage": 97.68, "elapsed_time": "2:14:54", "remaining_time": "0:03:12"} | |
| {"current_steps": 2402, "total_steps": 2457, "loss": 0.27718839049339294, "lr": 2.0284761764723087e-06, "epoch": 2.932844932844933, "percentage": 97.76, "elapsed_time": "2:14:59", "remaining_time": "0:03:05"} | |
| {"current_steps": 2404, "total_steps": 2457, "loss": 0.18042829632759094, "lr": 2.026479465708171e-06, "epoch": 2.9352869352869355, "percentage": 97.84, "elapsed_time": "2:15:05", "remaining_time": "0:02:58"} | |
| {"current_steps": 2406, "total_steps": 2457, "loss": 0.5652621984481812, "lr": 2.0245552227011777e-06, "epoch": 2.937728937728938, "percentage": 97.92, "elapsed_time": "2:15:11", "remaining_time": "0:02:51"} | |
| {"current_steps": 2408, "total_steps": 2457, "loss": 0.28077784180641174, "lr": 2.022703462991003e-06, "epoch": 2.9401709401709404, "percentage": 98.01, "elapsed_time": "2:15:18", "remaining_time": "0:02:45"} | |
| {"current_steps": 2410, "total_steps": 2457, "loss": 0.312043696641922, "lr": 2.0209242015319625e-06, "epoch": 2.942612942612943, "percentage": 98.09, "elapsed_time": "2:15:24", "remaining_time": "0:02:38"} | |
| {"current_steps": 2412, "total_steps": 2457, "loss": 0.42037639021873474, "lr": 2.0192174526928982e-06, "epoch": 2.9450549450549453, "percentage": 98.17, "elapsed_time": "2:15:30", "remaining_time": "0:02:31"} | |
| {"current_steps": 2414, "total_steps": 2457, "loss": 0.5173778533935547, "lr": 2.0175832302570575e-06, "epoch": 2.9474969474969477, "percentage": 98.25, "elapsed_time": "2:15:38", "remaining_time": "0:02:24"} | |
| {"current_steps": 2416, "total_steps": 2457, "loss": 0.46436506509780884, "lr": 2.016021547421984e-06, "epoch": 2.94993894993895, "percentage": 98.33, "elapsed_time": "2:15:44", "remaining_time": "0:02:18"} | |
| {"current_steps": 2418, "total_steps": 2457, "loss": 0.24875374138355255, "lr": 2.0145324167994134e-06, "epoch": 2.9523809523809526, "percentage": 98.41, "elapsed_time": "2:15:50", "remaining_time": "0:02:11"} | |
| {"current_steps": 2420, "total_steps": 2457, "loss": 0.35978463292121887, "lr": 2.0131158504151655e-06, "epoch": 2.954822954822955, "percentage": 98.49, "elapsed_time": "2:15:56", "remaining_time": "0:02:04"} | |
| {"current_steps": 2422, "total_steps": 2457, "loss": 0.3947286605834961, "lr": 2.0117718597090543e-06, "epoch": 2.9572649572649574, "percentage": 98.58, "elapsed_time": "2:16:02", "remaining_time": "0:01:57"} | |
| {"current_steps": 2424, "total_steps": 2457, "loss": 0.28263401985168457, "lr": 2.010500455534788e-06, "epoch": 2.95970695970696, "percentage": 98.66, "elapsed_time": "2:16:08", "remaining_time": "0:01:51"} | |
| {"current_steps": 2426, "total_steps": 2457, "loss": 0.5800071954727173, "lr": 2.0093016481598885e-06, "epoch": 2.9621489621489623, "percentage": 98.74, "elapsed_time": "2:16:16", "remaining_time": "0:01:44"} | |
| {"current_steps": 2428, "total_steps": 2457, "loss": 0.1977805346250534, "lr": 2.0081754472656034e-06, "epoch": 2.9645909645909647, "percentage": 98.82, "elapsed_time": "2:16:22", "remaining_time": "0:01:37"} | |
| {"current_steps": 2430, "total_steps": 2457, "loss": 0.3762721121311188, "lr": 2.0071218619468327e-06, "epoch": 2.967032967032967, "percentage": 98.9, "elapsed_time": "2:16:29", "remaining_time": "0:01:30"} | |
| {"current_steps": 2432, "total_steps": 2457, "loss": 0.3768196403980255, "lr": 2.0061409007120475e-06, "epoch": 2.9694749694749696, "percentage": 98.98, "elapsed_time": "2:16:34", "remaining_time": "0:01:24"} | |
| {"current_steps": 2434, "total_steps": 2457, "loss": 0.46781641244888306, "lr": 2.005232571483231e-06, "epoch": 2.971916971916972, "percentage": 99.06, "elapsed_time": "2:16:40", "remaining_time": "0:01:17"} | |
| {"current_steps": 2436, "total_steps": 2457, "loss": 0.25440388917922974, "lr": 2.0043968815958075e-06, "epoch": 2.9743589743589745, "percentage": 99.15, "elapsed_time": "2:16:45", "remaining_time": "0:01:10"} | |
| {"current_steps": 2438, "total_steps": 2457, "loss": 0.12983591854572296, "lr": 2.003633837798584e-06, "epoch": 2.976800976800977, "percentage": 99.23, "elapsed_time": "2:16:52", "remaining_time": "0:01:04"} | |
| {"current_steps": 2440, "total_steps": 2457, "loss": 0.43715769052505493, "lr": 2.0029434462537e-06, "epoch": 2.9792429792429793, "percentage": 99.31, "elapsed_time": "2:16:59", "remaining_time": "0:00:57"} | |
| {"current_steps": 2442, "total_steps": 2457, "loss": 0.4317605495452881, "lr": 2.002325712536572e-06, "epoch": 2.9816849816849818, "percentage": 99.39, "elapsed_time": "2:17:05", "remaining_time": "0:00:50"} | |
| {"current_steps": 2444, "total_steps": 2457, "loss": 0.39571458101272583, "lr": 2.001780641635854e-06, "epoch": 2.984126984126984, "percentage": 99.47, "elapsed_time": "2:17:10", "remaining_time": "0:00:43"} | |
| {"current_steps": 2446, "total_steps": 2457, "loss": 0.4417667090892792, "lr": 2.001308237953393e-06, "epoch": 2.9865689865689866, "percentage": 99.55, "elapsed_time": "2:17:17", "remaining_time": "0:00:37"} | |
| {"current_steps": 2448, "total_steps": 2457, "loss": 0.5195387601852417, "lr": 2.000908505304195e-06, "epoch": 2.989010989010989, "percentage": 99.63, "elapsed_time": "2:17:23", "remaining_time": "0:00:30"} | |
| {"current_steps": 2450, "total_steps": 2457, "loss": 0.19710102677345276, "lr": 2.0005814469163937e-06, "epoch": 2.9914529914529915, "percentage": 99.72, "elapsed_time": "2:17:30", "remaining_time": "0:00:23"} | |
| {"current_steps": 2452, "total_steps": 2457, "loss": 0.4630212187767029, "lr": 2.0003270654312266e-06, "epoch": 2.993894993894994, "percentage": 99.8, "elapsed_time": "2:17:36", "remaining_time": "0:00:16"} | |
| {"current_steps": 2454, "total_steps": 2457, "loss": 0.6292054057121277, "lr": 2.000145362903009e-06, "epoch": 2.9963369963369964, "percentage": 99.88, "elapsed_time": "2:17:42", "remaining_time": "0:00:10"} | |
| {"current_steps": 2456, "total_steps": 2457, "loss": 0.16045792400836945, "lr": 2.0000363407991222e-06, "epoch": 2.998778998778999, "percentage": 99.96, "elapsed_time": "2:17:48", "remaining_time": "0:00:03"} | |
| {"current_steps": 2457, "total_steps": 2457, "epoch": 3.0, "percentage": 100.0, "elapsed_time": "2:17:51", "remaining_time": "0:00:00"} | |