Image-Text-to-Text
Transformers
Safetensors
qwen3_5
llama-factory
full
Generated from Trainer
conversational
Instructions to use furproxy/9b-86 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use furproxy/9b-86 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="furproxy/9b-86") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("furproxy/9b-86") model = AutoModelForImageTextToText.from_pretrained("furproxy/9b-86") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] inputs = processor.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use furproxy/9b-86 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "furproxy/9b-86" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "furproxy/9b-86", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/furproxy/9b-86
- SGLang
How to use furproxy/9b-86 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "furproxy/9b-86" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "furproxy/9b-86", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "furproxy/9b-86" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "furproxy/9b-86", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' - Docker Model Runner
How to use furproxy/9b-86 with Docker Model Runner:
docker model run hf.co/furproxy/9b-86
| {"current_steps": 2, "total_steps": 1804, "loss": 1.8651859760284424, "lr": 5.494505494505495e-08, "epoch": 0.004434589800443459, "percentage": 0.11, "elapsed_time": "0:00:15", "remaining_time": "3:50:15"} | |
| {"current_steps": 4, "total_steps": 1804, "loss": 2.1317176818847656, "lr": 1.6483516483516484e-07, "epoch": 0.008869179600886918, "percentage": 0.22, "elapsed_time": "0:00:24", "remaining_time": "3:06:26"} | |
| {"current_steps": 6, "total_steps": 1804, "loss": 1.904492735862732, "lr": 2.7472527472527475e-07, "epoch": 0.013303769401330377, "percentage": 0.33, "elapsed_time": "0:00:34", "remaining_time": "2:50:54"} | |
| {"current_steps": 8, "total_steps": 1804, "loss": 1.8241561651229858, "lr": 3.846153846153847e-07, "epoch": 0.017738359201773836, "percentage": 0.44, "elapsed_time": "0:00:43", "remaining_time": "2:42:46"} | |
| {"current_steps": 10, "total_steps": 1804, "loss": 1.6350065469741821, "lr": 4.945054945054946e-07, "epoch": 0.022172949002217297, "percentage": 0.55, "elapsed_time": "0:00:50", "remaining_time": "2:30:57"} | |
| {"current_steps": 12, "total_steps": 1804, "loss": 2.1859989166259766, "lr": 6.043956043956044e-07, "epoch": 0.026607538802660754, "percentage": 0.67, "elapsed_time": "0:01:00", "remaining_time": "2:29:56"} | |
| {"current_steps": 14, "total_steps": 1804, "loss": 1.943555235862732, "lr": 7.142857142857143e-07, "epoch": 0.031042128603104215, "percentage": 0.78, "elapsed_time": "0:01:08", "remaining_time": "2:26:28"} | |
| {"current_steps": 16, "total_steps": 1804, "loss": 1.4925756454467773, "lr": 8.241758241758242e-07, "epoch": 0.03547671840354767, "percentage": 0.89, "elapsed_time": "0:01:15", "remaining_time": "2:20:32"} | |
| {"current_steps": 18, "total_steps": 1804, "loss": 1.4368367195129395, "lr": 9.340659340659342e-07, "epoch": 0.03991130820399113, "percentage": 1.0, "elapsed_time": "0:01:21", "remaining_time": "2:14:30"} | |
| {"current_steps": 20, "total_steps": 1804, "loss": 1.5969985723495483, "lr": 1.0439560439560442e-06, "epoch": 0.04434589800443459, "percentage": 1.11, "elapsed_time": "0:01:29", "remaining_time": "2:13:15"} | |
| {"current_steps": 22, "total_steps": 1804, "loss": 1.462158441543579, "lr": 1.153846153846154e-06, "epoch": 0.04878048780487805, "percentage": 1.22, "elapsed_time": "0:01:39", "remaining_time": "2:14:29"} | |
| {"current_steps": 24, "total_steps": 1804, "loss": 1.5942579507827759, "lr": 1.2637362637362637e-06, "epoch": 0.05321507760532151, "percentage": 1.33, "elapsed_time": "0:01:50", "remaining_time": "2:16:49"} | |
| {"current_steps": 26, "total_steps": 1804, "loss": 1.4908994436264038, "lr": 1.3736263736263738e-06, "epoch": 0.057649667405764965, "percentage": 1.44, "elapsed_time": "0:02:00", "remaining_time": "2:17:12"} | |
| {"current_steps": 28, "total_steps": 1804, "loss": 1.179060935974121, "lr": 1.4835164835164837e-06, "epoch": 0.06208425720620843, "percentage": 1.55, "elapsed_time": "0:02:07", "remaining_time": "2:14:57"} | |
| {"current_steps": 30, "total_steps": 1804, "loss": 1.3003807067871094, "lr": 1.5934065934065933e-06, "epoch": 0.06651884700665188, "percentage": 1.66, "elapsed_time": "0:02:17", "remaining_time": "2:15:33"} | |
| {"current_steps": 32, "total_steps": 1804, "loss": 1.2082353830337524, "lr": 1.7032967032967034e-06, "epoch": 0.07095343680709534, "percentage": 1.77, "elapsed_time": "0:02:27", "remaining_time": "2:15:58"} | |
| {"current_steps": 34, "total_steps": 1804, "loss": 1.5782675743103027, "lr": 1.8131868131868133e-06, "epoch": 0.07538802660753881, "percentage": 1.88, "elapsed_time": "0:02:36", "remaining_time": "2:16:06"} | |
| {"current_steps": 36, "total_steps": 1804, "loss": 1.342716932296753, "lr": 1.9230769230769234e-06, "epoch": 0.07982261640798226, "percentage": 2.0, "elapsed_time": "0:02:46", "remaining_time": "2:16:22"} | |
| {"current_steps": 38, "total_steps": 1804, "loss": 1.5886274576187134, "lr": 2.032967032967033e-06, "epoch": 0.08425720620842572, "percentage": 2.11, "elapsed_time": "0:02:56", "remaining_time": "2:16:42"} | |
| {"current_steps": 40, "total_steps": 1804, "loss": 0.8519856929779053, "lr": 2.1428571428571427e-06, "epoch": 0.08869179600886919, "percentage": 2.22, "elapsed_time": "0:03:02", "remaining_time": "2:13:53"} | |
| {"current_steps": 42, "total_steps": 1804, "loss": 1.4712433815002441, "lr": 2.252747252747253e-06, "epoch": 0.09312638580931264, "percentage": 2.33, "elapsed_time": "0:03:12", "remaining_time": "2:14:23"} | |
| {"current_steps": 44, "total_steps": 1804, "loss": 1.4157054424285889, "lr": 2.362637362637363e-06, "epoch": 0.0975609756097561, "percentage": 2.44, "elapsed_time": "0:03:21", "remaining_time": "2:14:01"} | |
| {"current_steps": 46, "total_steps": 1804, "loss": 1.428381323814392, "lr": 2.472527472527473e-06, "epoch": 0.10199556541019955, "percentage": 2.55, "elapsed_time": "0:03:28", "remaining_time": "2:12:58"} | |
| {"current_steps": 48, "total_steps": 1804, "loss": 1.6168309450149536, "lr": 2.582417582417583e-06, "epoch": 0.10643015521064302, "percentage": 2.66, "elapsed_time": "0:03:38", "remaining_time": "2:13:15"} | |
| {"current_steps": 50, "total_steps": 1804, "loss": 1.502392053604126, "lr": 2.6923076923076923e-06, "epoch": 0.11086474501108648, "percentage": 2.77, "elapsed_time": "0:03:48", "remaining_time": "2:13:34"} | |
| {"current_steps": 52, "total_steps": 1804, "loss": 1.4243361949920654, "lr": 2.8021978021978024e-06, "epoch": 0.11529933481152993, "percentage": 2.88, "elapsed_time": "0:03:58", "remaining_time": "2:13:57"} | |
| {"current_steps": 54, "total_steps": 1804, "loss": 1.4172306060791016, "lr": 2.9120879120879125e-06, "epoch": 0.1197339246119734, "percentage": 2.99, "elapsed_time": "0:04:08", "remaining_time": "2:13:58"} | |
| {"current_steps": 56, "total_steps": 1804, "loss": 1.1664988994598389, "lr": 3.021978021978022e-06, "epoch": 0.12416851441241686, "percentage": 3.1, "elapsed_time": "0:04:18", "remaining_time": "2:14:17"} | |
| {"current_steps": 58, "total_steps": 1804, "loss": 1.364538550376892, "lr": 3.1318681318681323e-06, "epoch": 0.1286031042128603, "percentage": 3.22, "elapsed_time": "0:04:28", "remaining_time": "2:14:32"} | |
| {"current_steps": 60, "total_steps": 1804, "loss": 1.415244221687317, "lr": 3.2417582417582424e-06, "epoch": 0.13303769401330376, "percentage": 3.33, "elapsed_time": "0:04:35", "remaining_time": "2:13:20"} | |
| {"current_steps": 62, "total_steps": 1804, "loss": 1.3695659637451172, "lr": 3.3516483516483516e-06, "epoch": 0.13747228381374724, "percentage": 3.44, "elapsed_time": "0:04:45", "remaining_time": "2:13:34"} | |
| {"current_steps": 64, "total_steps": 1804, "loss": 1.435889720916748, "lr": 3.4615384615384617e-06, "epoch": 0.1419068736141907, "percentage": 3.55, "elapsed_time": "0:04:55", "remaining_time": "2:13:49"} | |
| {"current_steps": 66, "total_steps": 1804, "loss": 1.4904870986938477, "lr": 3.5714285714285718e-06, "epoch": 0.14634146341463414, "percentage": 3.66, "elapsed_time": "0:05:03", "remaining_time": "2:13:20"} | |
| {"current_steps": 68, "total_steps": 1804, "loss": 1.3552013635635376, "lr": 3.681318681318682e-06, "epoch": 0.15077605321507762, "percentage": 3.77, "elapsed_time": "0:05:14", "remaining_time": "2:13:37"} | |
| {"current_steps": 70, "total_steps": 1804, "loss": 1.3096868991851807, "lr": 3.7912087912087915e-06, "epoch": 0.15521064301552107, "percentage": 3.88, "elapsed_time": "0:05:24", "remaining_time": "2:13:51"} | |
| {"current_steps": 72, "total_steps": 1804, "loss": 1.3637210130691528, "lr": 3.901098901098901e-06, "epoch": 0.15964523281596452, "percentage": 3.99, "elapsed_time": "0:05:34", "remaining_time": "2:14:07"} | |
| {"current_steps": 74, "total_steps": 1804, "loss": 1.2611558437347412, "lr": 4.010989010989012e-06, "epoch": 0.164079822616408, "percentage": 4.1, "elapsed_time": "0:05:44", "remaining_time": "2:14:10"} | |
| {"current_steps": 76, "total_steps": 1804, "loss": 1.3402124643325806, "lr": 4.120879120879121e-06, "epoch": 0.16851441241685144, "percentage": 4.21, "elapsed_time": "0:05:54", "remaining_time": "2:14:16"} | |
| {"current_steps": 78, "total_steps": 1804, "loss": 1.3869333267211914, "lr": 4.230769230769231e-06, "epoch": 0.1729490022172949, "percentage": 4.32, "elapsed_time": "0:06:04", "remaining_time": "2:14:24"} | |
| {"current_steps": 80, "total_steps": 1804, "loss": 1.3390436172485352, "lr": 4.340659340659341e-06, "epoch": 0.17738359201773837, "percentage": 4.43, "elapsed_time": "0:06:11", "remaining_time": "2:13:18"} | |
| {"current_steps": 82, "total_steps": 1804, "loss": 0.8444686532020569, "lr": 4.45054945054945e-06, "epoch": 0.18181818181818182, "percentage": 4.55, "elapsed_time": "0:06:18", "remaining_time": "2:12:20"} | |
| {"current_steps": 84, "total_steps": 1804, "loss": 1.0279695987701416, "lr": 4.560439560439561e-06, "epoch": 0.18625277161862527, "percentage": 4.66, "elapsed_time": "0:06:27", "remaining_time": "2:12:16"} | |
| {"current_steps": 86, "total_steps": 1804, "loss": 0.9698995351791382, "lr": 4.6703296703296706e-06, "epoch": 0.19068736141906872, "percentage": 4.77, "elapsed_time": "0:06:37", "remaining_time": "2:12:17"} | |
| {"current_steps": 88, "total_steps": 1804, "loss": 1.2624331712722778, "lr": 4.780219780219781e-06, "epoch": 0.1951219512195122, "percentage": 4.88, "elapsed_time": "0:06:47", "remaining_time": "2:12:23"} | |
| {"current_steps": 90, "total_steps": 1804, "loss": 1.392942190170288, "lr": 4.890109890109891e-06, "epoch": 0.19955654101995565, "percentage": 4.99, "elapsed_time": "0:06:55", "remaining_time": "2:11:58"} | |
| {"current_steps": 92, "total_steps": 1804, "loss": 1.229879379272461, "lr": 5e-06, "epoch": 0.2039911308203991, "percentage": 5.1, "elapsed_time": "0:07:05", "remaining_time": "2:12:06"} | |
| {"current_steps": 94, "total_steps": 1804, "loss": 1.2555553913116455, "lr": 4.999984864490455e-06, "epoch": 0.20842572062084258, "percentage": 5.21, "elapsed_time": "0:07:16", "remaining_time": "2:12:15"} | |
| {"current_steps": 96, "total_steps": 1804, "loss": 1.8031316995620728, "lr": 4.999939458165447e-06, "epoch": 0.21286031042128603, "percentage": 5.32, "elapsed_time": "0:07:25", "remaining_time": "2:12:13"} | |
| {"current_steps": 98, "total_steps": 1804, "loss": 0.950995922088623, "lr": 4.999863781635863e-06, "epoch": 0.21729490022172948, "percentage": 5.43, "elapsed_time": "0:07:33", "remaining_time": "2:11:30"} | |
| {"current_steps": 100, "total_steps": 1804, "loss": 1.1569544076919556, "lr": 4.999757835919841e-06, "epoch": 0.22172949002217296, "percentage": 5.54, "elapsed_time": "0:07:40", "remaining_time": "2:10:46"} | |
| {"current_steps": 102, "total_steps": 1804, "loss": 1.313308596611023, "lr": 4.9996216224427495e-06, "epoch": 0.2261640798226164, "percentage": 5.65, "elapsed_time": "0:07:50", "remaining_time": "2:10:43"} | |
| {"current_steps": 104, "total_steps": 1804, "loss": 1.0472400188446045, "lr": 4.999455143037178e-06, "epoch": 0.23059866962305986, "percentage": 5.76, "elapsed_time": "0:07:57", "remaining_time": "2:10:10"} | |
| {"current_steps": 106, "total_steps": 1804, "loss": 1.2840181589126587, "lr": 4.999258399942903e-06, "epoch": 0.23503325942350334, "percentage": 5.88, "elapsed_time": "0:08:07", "remaining_time": "2:10:10"} | |
| {"current_steps": 108, "total_steps": 1804, "loss": 1.48462975025177, "lr": 4.9990313958068645e-06, "epoch": 0.2394678492239468, "percentage": 5.99, "elapsed_time": "0:08:17", "remaining_time": "2:10:13"} | |
| {"current_steps": 110, "total_steps": 1804, "loss": 1.026124119758606, "lr": 4.998774133683127e-06, "epoch": 0.24390243902439024, "percentage": 6.1, "elapsed_time": "0:08:24", "remaining_time": "2:09:26"} | |
| {"current_steps": 112, "total_steps": 1804, "loss": 1.2828115224838257, "lr": 4.9984866170328426e-06, "epoch": 0.24833702882483372, "percentage": 6.21, "elapsed_time": "0:08:33", "remaining_time": "2:09:17"} | |
| {"current_steps": 114, "total_steps": 1804, "loss": 0.7794591188430786, "lr": 4.998168849724196e-06, "epoch": 0.25277161862527714, "percentage": 6.32, "elapsed_time": "0:08:37", "remaining_time": "2:07:53"} | |
| {"current_steps": 116, "total_steps": 1804, "loss": 1.3091580867767334, "lr": 4.997820836032363e-06, "epoch": 0.2572062084257206, "percentage": 6.43, "elapsed_time": "0:08:46", "remaining_time": "2:07:44"} | |
| {"current_steps": 118, "total_steps": 1804, "loss": 1.0651829242706299, "lr": 4.997442580639443e-06, "epoch": 0.2616407982261641, "percentage": 6.54, "elapsed_time": "0:08:53", "remaining_time": "2:07:06"} | |
| {"current_steps": 120, "total_steps": 1804, "loss": 1.2504687309265137, "lr": 4.997034088634404e-06, "epoch": 0.2660753880266075, "percentage": 6.65, "elapsed_time": "0:09:04", "remaining_time": "2:07:16"} | |
| {"current_steps": 122, "total_steps": 1804, "loss": 1.1688843965530396, "lr": 4.996595365513012e-06, "epoch": 0.270509977827051, "percentage": 6.76, "elapsed_time": "0:09:13", "remaining_time": "2:07:06"} | |
| {"current_steps": 124, "total_steps": 1804, "loss": 1.5667338371276855, "lr": 4.9961264171777515e-06, "epoch": 0.2749445676274945, "percentage": 6.87, "elapsed_time": "0:09:23", "remaining_time": "2:07:08"} | |
| {"current_steps": 126, "total_steps": 1804, "loss": 0.9175050258636475, "lr": 4.995627249937755e-06, "epoch": 0.2793791574279379, "percentage": 6.98, "elapsed_time": "0:09:30", "remaining_time": "2:06:31"} | |
| {"current_steps": 128, "total_steps": 1804, "loss": 1.209236741065979, "lr": 4.995097870508711e-06, "epoch": 0.2838137472283814, "percentage": 7.1, "elapsed_time": "0:09:39", "remaining_time": "2:06:24"} | |
| {"current_steps": 130, "total_steps": 1804, "loss": 0.9163856506347656, "lr": 4.994538286012777e-06, "epoch": 0.28824833702882485, "percentage": 7.21, "elapsed_time": "0:09:45", "remaining_time": "2:05:41"} | |
| {"current_steps": 132, "total_steps": 1804, "loss": 0.974137544631958, "lr": 4.993948503978484e-06, "epoch": 0.2926829268292683, "percentage": 7.32, "elapsed_time": "0:09:55", "remaining_time": "2:05:38"} | |
| {"current_steps": 134, "total_steps": 1804, "loss": 0.9840149283409119, "lr": 4.993328532340633e-06, "epoch": 0.29711751662971175, "percentage": 7.43, "elapsed_time": "0:10:04", "remaining_time": "2:05:34"} | |
| {"current_steps": 136, "total_steps": 1804, "loss": 1.110643982887268, "lr": 4.99267837944019e-06, "epoch": 0.30155210643015523, "percentage": 7.54, "elapsed_time": "0:10:12", "remaining_time": "2:05:06"} | |
| {"current_steps": 138, "total_steps": 1804, "loss": 0.7842212319374084, "lr": 4.991998054024172e-06, "epoch": 0.30598669623059865, "percentage": 7.65, "elapsed_time": "0:10:18", "remaining_time": "2:04:27"} | |
| {"current_steps": 140, "total_steps": 1804, "loss": 1.0406193733215332, "lr": 4.991287565245534e-06, "epoch": 0.31042128603104213, "percentage": 7.76, "elapsed_time": "0:10:22", "remaining_time": "2:03:20"} | |
| {"current_steps": 142, "total_steps": 1804, "loss": 0.9569450616836548, "lr": 4.990546922663039e-06, "epoch": 0.3148558758314856, "percentage": 7.87, "elapsed_time": "0:10:32", "remaining_time": "2:03:26"} | |
| {"current_steps": 144, "total_steps": 1804, "loss": 0.9646241068840027, "lr": 4.989776136241134e-06, "epoch": 0.31929046563192903, "percentage": 7.98, "elapsed_time": "0:10:42", "remaining_time": "2:03:28"} | |
| {"current_steps": 146, "total_steps": 1804, "loss": 0.9507364630699158, "lr": 4.988975216349814e-06, "epoch": 0.3237250554323725, "percentage": 8.09, "elapsed_time": "0:10:52", "remaining_time": "2:03:28"} | |
| {"current_steps": 148, "total_steps": 1804, "loss": 1.401615023612976, "lr": 4.988144173764486e-06, "epoch": 0.328159645232816, "percentage": 8.2, "elapsed_time": "0:11:02", "remaining_time": "2:03:33"} | |
| {"current_steps": 150, "total_steps": 1804, "loss": 1.1528998613357544, "lr": 4.987283019665817e-06, "epoch": 0.3325942350332594, "percentage": 8.31, "elapsed_time": "0:11:12", "remaining_time": "2:03:35"} | |
| {"current_steps": 152, "total_steps": 1804, "loss": 1.060468077659607, "lr": 4.986391765639592e-06, "epoch": 0.3370288248337029, "percentage": 8.43, "elapsed_time": "0:11:22", "remaining_time": "2:03:34"} | |
| {"current_steps": 154, "total_steps": 1804, "loss": 1.2685281038284302, "lr": 4.985470423676551e-06, "epoch": 0.34146341463414637, "percentage": 8.54, "elapsed_time": "0:11:32", "remaining_time": "2:03:38"} | |
| {"current_steps": 156, "total_steps": 1804, "loss": 1.2129993438720703, "lr": 4.984519006172232e-06, "epoch": 0.3458980044345898, "percentage": 8.65, "elapsed_time": "0:11:41", "remaining_time": "2:03:33"} | |
| {"current_steps": 158, "total_steps": 1804, "loss": 1.2925925254821777, "lr": 4.983537525926804e-06, "epoch": 0.35033259423503327, "percentage": 8.76, "elapsed_time": "0:11:51", "remaining_time": "2:03:35"} | |
| {"current_steps": 160, "total_steps": 1804, "loss": 1.1071832180023193, "lr": 4.982525996144891e-06, "epoch": 0.35476718403547675, "percentage": 8.87, "elapsed_time": "0:11:58", "remaining_time": "2:03:06"} | |
| {"current_steps": 162, "total_steps": 1804, "loss": 0.8754007816314697, "lr": 4.981484430435399e-06, "epoch": 0.35920177383592017, "percentage": 8.98, "elapsed_time": "0:12:05", "remaining_time": "2:02:34"} | |
| {"current_steps": 164, "total_steps": 1804, "loss": 0.9003241062164307, "lr": 4.98041284281133e-06, "epoch": 0.36363636363636365, "percentage": 9.09, "elapsed_time": "0:12:11", "remaining_time": "2:01:58"} | |
| {"current_steps": 166, "total_steps": 1804, "loss": 1.273066759109497, "lr": 4.979311247689596e-06, "epoch": 0.36807095343680707, "percentage": 9.2, "elapsed_time": "0:12:21", "remaining_time": "2:01:57"} | |
| {"current_steps": 168, "total_steps": 1804, "loss": 1.1863045692443848, "lr": 4.978179659890821e-06, "epoch": 0.37250554323725055, "percentage": 9.31, "elapsed_time": "0:12:32", "remaining_time": "2:02:03"} | |
| {"current_steps": 170, "total_steps": 1804, "loss": 1.2364041805267334, "lr": 4.977018094639146e-06, "epoch": 0.376940133037694, "percentage": 9.42, "elapsed_time": "0:12:41", "remaining_time": "2:01:57"} | |
| {"current_steps": 172, "total_steps": 1804, "loss": 0.7797529101371765, "lr": 4.975826567562023e-06, "epoch": 0.38137472283813745, "percentage": 9.53, "elapsed_time": "0:12:50", "remaining_time": "2:01:49"} | |
| {"current_steps": 174, "total_steps": 1804, "loss": 1.6287654638290405, "lr": 4.97460509469e-06, "epoch": 0.3858093126385809, "percentage": 9.65, "elapsed_time": "0:12:58", "remaining_time": "2:01:28"} | |
| {"current_steps": 176, "total_steps": 1804, "loss": 1.26861572265625, "lr": 4.973353692456513e-06, "epoch": 0.3902439024390244, "percentage": 9.76, "elapsed_time": "0:13:07", "remaining_time": "2:01:23"} | |
| {"current_steps": 178, "total_steps": 1804, "loss": 1.3224495649337769, "lr": 4.972072377697661e-06, "epoch": 0.3946784922394678, "percentage": 9.87, "elapsed_time": "0:13:17", "remaining_time": "2:01:25"} | |
| {"current_steps": 180, "total_steps": 1804, "loss": 1.05917227268219, "lr": 4.9707611676519775e-06, "epoch": 0.3991130820399113, "percentage": 9.98, "elapsed_time": "0:13:25", "remaining_time": "2:01:03"} | |
| {"current_steps": 182, "total_steps": 1804, "loss": 1.2697656154632568, "lr": 4.969420079960203e-06, "epoch": 0.4035476718403548, "percentage": 10.09, "elapsed_time": "0:13:35", "remaining_time": "2:01:05"} | |
| {"current_steps": 184, "total_steps": 1804, "loss": 0.884378969669342, "lr": 4.968049132665045e-06, "epoch": 0.4079822616407982, "percentage": 10.2, "elapsed_time": "0:13:44", "remaining_time": "2:01:00"} | |
| {"current_steps": 186, "total_steps": 1804, "loss": 0.9717956781387329, "lr": 4.966648344210936e-06, "epoch": 0.4124168514412417, "percentage": 10.31, "elapsed_time": "0:13:53", "remaining_time": "2:00:51"} | |
| {"current_steps": 188, "total_steps": 1804, "loss": 0.9312400817871094, "lr": 4.965217733443782e-06, "epoch": 0.41685144124168516, "percentage": 10.42, "elapsed_time": "0:14:03", "remaining_time": "2:00:47"} | |
| {"current_steps": 190, "total_steps": 1804, "loss": 0.9861539602279663, "lr": 4.963757319610716e-06, "epoch": 0.4212860310421286, "percentage": 10.53, "elapsed_time": "0:14:10", "remaining_time": "2:00:22"} | |
| {"current_steps": 192, "total_steps": 1804, "loss": 0.8997060656547546, "lr": 4.962267122359835e-06, "epoch": 0.42572062084257206, "percentage": 10.64, "elapsed_time": "0:14:16", "remaining_time": "1:59:51"} | |
| {"current_steps": 194, "total_steps": 1804, "loss": 1.2791428565979004, "lr": 4.960747161739931e-06, "epoch": 0.43015521064301554, "percentage": 10.75, "elapsed_time": "0:14:26", "remaining_time": "1:59:54"} | |
| {"current_steps": 196, "total_steps": 1804, "loss": 1.549714207649231, "lr": 4.9591974582002324e-06, "epoch": 0.43458980044345896, "percentage": 10.86, "elapsed_time": "0:14:36", "remaining_time": "1:59:51"} | |
| {"current_steps": 198, "total_steps": 1804, "loss": 1.2765225172042847, "lr": 4.957618032590118e-06, "epoch": 0.43902439024390244, "percentage": 10.98, "elapsed_time": "0:14:46", "remaining_time": "1:59:50"} | |
| {"current_steps": 200, "total_steps": 1804, "loss": 1.1299937963485718, "lr": 4.956008906158842e-06, "epoch": 0.4434589800443459, "percentage": 11.09, "elapsed_time": "0:14:54", "remaining_time": "1:59:30"} | |
| {"current_steps": 202, "total_steps": 1804, "loss": 1.2492018938064575, "lr": 4.954370100555249e-06, "epoch": 0.44789356984478934, "percentage": 11.2, "elapsed_time": "0:15:03", "remaining_time": "1:59:23"} | |
| {"current_steps": 204, "total_steps": 1804, "loss": 1.216017484664917, "lr": 4.952701637827476e-06, "epoch": 0.4523281596452328, "percentage": 11.31, "elapsed_time": "0:15:13", "remaining_time": "1:59:26"} | |
| {"current_steps": 206, "total_steps": 1804, "loss": 1.0710757970809937, "lr": 4.951003540422668e-06, "epoch": 0.4567627494456763, "percentage": 11.42, "elapsed_time": "0:15:20", "remaining_time": "1:59:01"} | |
| {"current_steps": 208, "total_steps": 1804, "loss": 1.044965147972107, "lr": 4.949275831186663e-06, "epoch": 0.4611973392461197, "percentage": 11.53, "elapsed_time": "0:15:27", "remaining_time": "1:58:38"} | |
| {"current_steps": 210, "total_steps": 1804, "loss": 0.645362377166748, "lr": 4.947518533363691e-06, "epoch": 0.4656319290465632, "percentage": 11.64, "elapsed_time": "0:15:36", "remaining_time": "1:58:31"} | |
| {"current_steps": 212, "total_steps": 1804, "loss": 0.8511308431625366, "lr": 4.945731670596062e-06, "epoch": 0.4700665188470067, "percentage": 11.75, "elapsed_time": "0:15:46", "remaining_time": "1:58:29"} | |
| {"current_steps": 214, "total_steps": 1804, "loss": 1.0683618783950806, "lr": 4.943915266923845e-06, "epoch": 0.4745011086474501, "percentage": 11.86, "elapsed_time": "0:15:53", "remaining_time": "1:58:02"} | |
| {"current_steps": 216, "total_steps": 1804, "loss": 1.08345627784729, "lr": 4.942069346784547e-06, "epoch": 0.4789356984478936, "percentage": 11.97, "elapsed_time": "0:16:00", "remaining_time": "1:57:40"} | |
| {"current_steps": 218, "total_steps": 1804, "loss": 1.1189590692520142, "lr": 4.940193935012785e-06, "epoch": 0.48337028824833705, "percentage": 12.08, "elapsed_time": "0:16:10", "remaining_time": "1:57:40"} | |
| {"current_steps": 220, "total_steps": 1804, "loss": 1.2146114110946655, "lr": 4.938289056839946e-06, "epoch": 0.4878048780487805, "percentage": 12.2, "elapsed_time": "0:16:20", "remaining_time": "1:57:39"} | |
| {"current_steps": 222, "total_steps": 1804, "loss": 1.2259554862976074, "lr": 4.936354737893854e-06, "epoch": 0.49223946784922396, "percentage": 12.31, "elapsed_time": "0:16:30", "remaining_time": "1:57:41"} | |
| {"current_steps": 224, "total_steps": 1804, "loss": 1.2009336948394775, "lr": 4.934391004198424e-06, "epoch": 0.49667405764966743, "percentage": 12.42, "elapsed_time": "0:16:41", "remaining_time": "1:57:43"} | |
| {"current_steps": 226, "total_steps": 1804, "loss": 1.2059407234191895, "lr": 4.932397882173307e-06, "epoch": 0.5011086474501109, "percentage": 12.53, "elapsed_time": "0:16:51", "remaining_time": "1:57:42"} | |
| {"current_steps": 228, "total_steps": 1804, "loss": 1.2553625106811523, "lr": 4.930375398633543e-06, "epoch": 0.5055432372505543, "percentage": 12.64, "elapsed_time": "0:17:01", "remaining_time": "1:57:42"} | |
| {"current_steps": 230, "total_steps": 1804, "loss": 1.8459392786026, "lr": 4.928323580789192e-06, "epoch": 0.5099778270509978, "percentage": 12.75, "elapsed_time": "0:17:11", "remaining_time": "1:57:38"} | |
| {"current_steps": 232, "total_steps": 1804, "loss": 0.8564022779464722, "lr": 4.926242456244973e-06, "epoch": 0.5144124168514412, "percentage": 12.86, "elapsed_time": "0:17:15", "remaining_time": "1:56:59"} | |
| {"current_steps": 234, "total_steps": 1804, "loss": 1.2610244750976562, "lr": 4.924132052999892e-06, "epoch": 0.5188470066518847, "percentage": 12.97, "elapsed_time": "0:17:26", "remaining_time": "1:57:01"} | |
| {"current_steps": 236, "total_steps": 1804, "loss": 0.9529590010643005, "lr": 4.921992399446861e-06, "epoch": 0.5232815964523282, "percentage": 13.08, "elapsed_time": "0:17:33", "remaining_time": "1:56:42"} | |
| {"current_steps": 238, "total_steps": 1804, "loss": 0.9686543941497803, "lr": 4.919823524372323e-06, "epoch": 0.5277161862527716, "percentage": 13.19, "elapsed_time": "0:17:41", "remaining_time": "1:56:22"} | |
| {"current_steps": 240, "total_steps": 1804, "loss": 1.258904218673706, "lr": 4.91762545695586e-06, "epoch": 0.532150776053215, "percentage": 13.3, "elapsed_time": "0:17:51", "remaining_time": "1:56:19"} | |
| {"current_steps": 242, "total_steps": 1804, "loss": 1.3248710632324219, "lr": 4.9153982267698e-06, "epoch": 0.5365853658536586, "percentage": 13.41, "elapsed_time": "0:18:01", "remaining_time": "1:56:19"} | |
| {"current_steps": 244, "total_steps": 1804, "loss": 0.9705762267112732, "lr": 4.913141863778822e-06, "epoch": 0.541019955654102, "percentage": 13.53, "elapsed_time": "0:18:11", "remaining_time": "1:56:16"} | |
| {"current_steps": 246, "total_steps": 1804, "loss": 1.2580170631408691, "lr": 4.910856398339553e-06, "epoch": 0.5454545454545454, "percentage": 13.64, "elapsed_time": "0:18:21", "remaining_time": "1:56:13"} | |
| {"current_steps": 248, "total_steps": 1804, "loss": 1.4803242683410645, "lr": 4.9085418612001545e-06, "epoch": 0.549889135254989, "percentage": 13.75, "elapsed_time": "0:18:31", "remaining_time": "1:56:10"} | |
| {"current_steps": 250, "total_steps": 1804, "loss": 1.2252423763275146, "lr": 4.906198283499916e-06, "epoch": 0.5543237250554324, "percentage": 13.86, "elapsed_time": "0:18:38", "remaining_time": "1:55:53"} | |
| {"current_steps": 252, "total_steps": 1804, "loss": 0.6616644263267517, "lr": 4.903825696768829e-06, "epoch": 0.5587583148558758, "percentage": 13.97, "elapsed_time": "0:18:45", "remaining_time": "1:55:29"} | |
| {"current_steps": 254, "total_steps": 1804, "loss": 1.474183440208435, "lr": 4.901424132927172e-06, "epoch": 0.5631929046563193, "percentage": 14.08, "elapsed_time": "0:18:55", "remaining_time": "1:55:26"} | |
| {"current_steps": 256, "total_steps": 1804, "loss": 1.316019892692566, "lr": 4.898993624285069e-06, "epoch": 0.5676274944567627, "percentage": 14.19, "elapsed_time": "0:19:05", "remaining_time": "1:55:24"} | |
| {"current_steps": 258, "total_steps": 1804, "loss": 1.2566733360290527, "lr": 4.896534203542062e-06, "epoch": 0.5720620842572062, "percentage": 14.3, "elapsed_time": "0:19:12", "remaining_time": "1:55:04"} | |
| {"current_steps": 260, "total_steps": 1804, "loss": 1.2865486145019531, "lr": 4.894045903786675e-06, "epoch": 0.5764966740576497, "percentage": 14.41, "elapsed_time": "0:19:20", "remaining_time": "1:54:54"} | |
| {"current_steps": 262, "total_steps": 1804, "loss": 0.7123095989227295, "lr": 4.891528758495961e-06, "epoch": 0.5809312638580931, "percentage": 14.52, "elapsed_time": "0:19:30", "remaining_time": "1:54:50"} | |
| {"current_steps": 264, "total_steps": 1804, "loss": 1.360097050666809, "lr": 4.888982801535053e-06, "epoch": 0.5853658536585366, "percentage": 14.63, "elapsed_time": "0:19:40", "remaining_time": "1:54:46"} | |
| {"current_steps": 266, "total_steps": 1804, "loss": 1.0533033609390259, "lr": 4.886408067156712e-06, "epoch": 0.5898004434589801, "percentage": 14.75, "elapsed_time": "0:19:46", "remaining_time": "1:54:23"} | |
| {"current_steps": 268, "total_steps": 1804, "loss": 1.550133228302002, "lr": 4.883804590000865e-06, "epoch": 0.5942350332594235, "percentage": 14.86, "elapsed_time": "0:19:56", "remaining_time": "1:54:19"} | |
| {"current_steps": 270, "total_steps": 1804, "loss": 1.1676595211029053, "lr": 4.881172405094138e-06, "epoch": 0.5986696230598669, "percentage": 14.97, "elapsed_time": "0:20:06", "remaining_time": "1:54:15"} | |
| {"current_steps": 272, "total_steps": 1804, "loss": 1.2335383892059326, "lr": 4.878511547849383e-06, "epoch": 0.6031042128603105, "percentage": 15.08, "elapsed_time": "0:20:16", "remaining_time": "1:54:10"} | |
| {"current_steps": 274, "total_steps": 1804, "loss": 1.2187210321426392, "lr": 4.875822054065203e-06, "epoch": 0.6075388026607539, "percentage": 15.19, "elapsed_time": "0:20:26", "remaining_time": "1:54:06"} | |
| {"current_steps": 276, "total_steps": 1804, "loss": 1.226876974105835, "lr": 4.8731039599254754e-06, "epoch": 0.6119733924611973, "percentage": 15.3, "elapsed_time": "0:20:36", "remaining_time": "1:54:03"} | |
| {"current_steps": 278, "total_steps": 1804, "loss": 1.205104112625122, "lr": 4.870357301998856e-06, "epoch": 0.6164079822616408, "percentage": 15.41, "elapsed_time": "0:20:46", "remaining_time": "1:54:00"} | |
| {"current_steps": 280, "total_steps": 1804, "loss": 1.287103295326233, "lr": 4.867582117238294e-06, "epoch": 0.6208425720620843, "percentage": 15.52, "elapsed_time": "0:20:56", "remaining_time": "1:53:56"} | |
| {"current_steps": 282, "total_steps": 1804, "loss": 0.8597289323806763, "lr": 4.864778442980532e-06, "epoch": 0.6252771618625277, "percentage": 15.63, "elapsed_time": "0:21:02", "remaining_time": "1:53:35"} | |
| {"current_steps": 284, "total_steps": 1804, "loss": 1.2538930177688599, "lr": 4.861946316945605e-06, "epoch": 0.6297117516629712, "percentage": 15.74, "elapsed_time": "0:21:14", "remaining_time": "1:53:40"} | |
| {"current_steps": 286, "total_steps": 1804, "loss": 1.330883264541626, "lr": 4.859085777236331e-06, "epoch": 0.6341463414634146, "percentage": 15.85, "elapsed_time": "0:21:24", "remaining_time": "1:53:39"} | |
| {"current_steps": 288, "total_steps": 1804, "loss": 1.252233862876892, "lr": 4.8561968623377985e-06, "epoch": 0.6385809312638581, "percentage": 15.96, "elapsed_time": "0:21:34", "remaining_time": "1:53:35"} | |
| {"current_steps": 290, "total_steps": 1804, "loss": 1.2359025478363037, "lr": 4.853279611116852e-06, "epoch": 0.6430155210643016, "percentage": 16.08, "elapsed_time": "0:21:44", "remaining_time": "1:53:29"} | |
| {"current_steps": 292, "total_steps": 1804, "loss": 1.3491101264953613, "lr": 4.850334062821566e-06, "epoch": 0.647450110864745, "percentage": 16.19, "elapsed_time": "0:21:54", "remaining_time": "1:53:24"} | |
| {"current_steps": 294, "total_steps": 1804, "loss": 0.9202826619148254, "lr": 4.8473602570807185e-06, "epoch": 0.6518847006651884, "percentage": 16.3, "elapsed_time": "0:22:03", "remaining_time": "1:53:19"} | |
| {"current_steps": 296, "total_steps": 1804, "loss": 0.9892662167549133, "lr": 4.844358233903254e-06, "epoch": 0.656319290465632, "percentage": 16.41, "elapsed_time": "0:22:13", "remaining_time": "1:53:16"} | |
| {"current_steps": 298, "total_steps": 1804, "loss": 1.2050740718841553, "lr": 4.841328033677753e-06, "epoch": 0.6607538802660754, "percentage": 16.52, "elapsed_time": "0:22:25", "remaining_time": "1:53:17"} | |
| {"current_steps": 300, "total_steps": 1804, "loss": 1.2547260522842407, "lr": 4.83826969717188e-06, "epoch": 0.6651884700665188, "percentage": 16.63, "elapsed_time": "0:22:34", "remaining_time": "1:53:08"} | |
| {"current_steps": 302, "total_steps": 1804, "loss": 1.1995564699172974, "lr": 4.835183265531843e-06, "epoch": 0.6696230598669624, "percentage": 16.74, "elapsed_time": "0:22:44", "remaining_time": "1:53:06"} | |
| {"current_steps": 304, "total_steps": 1804, "loss": 1.2650580406188965, "lr": 4.832068780281831e-06, "epoch": 0.6740576496674058, "percentage": 16.85, "elapsed_time": "0:22:54", "remaining_time": "1:53:01"} | |
| {"current_steps": 306, "total_steps": 1804, "loss": 1.2397799491882324, "lr": 4.828926283323464e-06, "epoch": 0.6784922394678492, "percentage": 16.96, "elapsed_time": "0:23:04", "remaining_time": "1:52:57"} | |
| {"current_steps": 308, "total_steps": 1804, "loss": 0.8677200675010681, "lr": 4.8257558169352254e-06, "epoch": 0.6829268292682927, "percentage": 17.07, "elapsed_time": "0:23:14", "remaining_time": "1:52:53"} | |
| {"current_steps": 310, "total_steps": 1804, "loss": 1.2440086603164673, "lr": 4.8225574237718906e-06, "epoch": 0.6873614190687362, "percentage": 17.18, "elapsed_time": "0:23:23", "remaining_time": "1:52:46"} | |
| {"current_steps": 312, "total_steps": 1804, "loss": 1.2062432765960693, "lr": 4.819331146863958e-06, "epoch": 0.6917960088691796, "percentage": 17.29, "elapsed_time": "0:23:34", "remaining_time": "1:52:43"} | |
| {"current_steps": 314, "total_steps": 1804, "loss": 1.2283505201339722, "lr": 4.8160770296170685e-06, "epoch": 0.6962305986696231, "percentage": 17.41, "elapsed_time": "0:23:44", "remaining_time": "1:52:39"} | |
| {"current_steps": 316, "total_steps": 1804, "loss": 1.3305617570877075, "lr": 4.812795115811419e-06, "epoch": 0.7006651884700665, "percentage": 17.52, "elapsed_time": "0:23:52", "remaining_time": "1:52:23"} | |
| {"current_steps": 318, "total_steps": 1804, "loss": 0.9975929856300354, "lr": 4.809485449601177e-06, "epoch": 0.70509977827051, "percentage": 17.63, "elapsed_time": "0:23:58", "remaining_time": "1:52:03"} | |
| {"current_steps": 320, "total_steps": 1804, "loss": 0.9766805768013, "lr": 4.806148075513883e-06, "epoch": 0.7095343680709535, "percentage": 17.74, "elapsed_time": "0:24:05", "remaining_time": "1:51:45"} | |
| {"current_steps": 322, "total_steps": 1804, "loss": 0.9863343834877014, "lr": 4.802783038449857e-06, "epoch": 0.7139689578713969, "percentage": 17.85, "elapsed_time": "0:24:15", "remaining_time": "1:51:38"} | |
| {"current_steps": 324, "total_steps": 1804, "loss": 1.0997380018234253, "lr": 4.799390383681587e-06, "epoch": 0.7184035476718403, "percentage": 17.96, "elapsed_time": "0:24:22", "remaining_time": "1:51:18"} | |
| {"current_steps": 326, "total_steps": 1804, "loss": 1.077536702156067, "lr": 4.795970156853124e-06, "epoch": 0.7228381374722838, "percentage": 18.07, "elapsed_time": "0:24:31", "remaining_time": "1:51:13"} | |
| {"current_steps": 328, "total_steps": 1804, "loss": 0.8034789562225342, "lr": 4.792522403979471e-06, "epoch": 0.7272727272727273, "percentage": 18.18, "elapsed_time": "0:24:38", "remaining_time": "1:50:55"} | |
| {"current_steps": 330, "total_steps": 1804, "loss": 0.8247131109237671, "lr": 4.789047171445957e-06, "epoch": 0.7317073170731707, "percentage": 18.29, "elapsed_time": "0:24:45", "remaining_time": "1:50:35"} | |
| {"current_steps": 332, "total_steps": 1804, "loss": 1.3770612478256226, "lr": 4.785544506007619e-06, "epoch": 0.7361419068736141, "percentage": 18.4, "elapsed_time": "0:24:55", "remaining_time": "1:50:28"} | |
| {"current_steps": 334, "total_steps": 1804, "loss": 1.1880348920822144, "lr": 4.782014454788566e-06, "epoch": 0.7405764966740577, "percentage": 18.51, "elapsed_time": "0:25:05", "remaining_time": "1:50:24"} | |
| {"current_steps": 336, "total_steps": 1804, "loss": 1.2278774976730347, "lr": 4.778457065281355e-06, "epoch": 0.7450110864745011, "percentage": 18.63, "elapsed_time": "0:25:15", "remaining_time": "1:50:19"} | |
| {"current_steps": 338, "total_steps": 1804, "loss": 0.6877051591873169, "lr": 4.774872385346345e-06, "epoch": 0.7494456762749445, "percentage": 18.74, "elapsed_time": "0:25:19", "remaining_time": "1:49:48"} | |
| {"current_steps": 340, "total_steps": 1804, "loss": 0.3212733566761017, "lr": 4.7712604632110524e-06, "epoch": 0.753880266075388, "percentage": 18.85, "elapsed_time": "0:25:22", "remaining_time": "1:49:15"} | |
| {"current_steps": 342, "total_steps": 1804, "loss": 0.8571130633354187, "lr": 4.767621347469506e-06, "epoch": 0.7583148558758315, "percentage": 18.96, "elapsed_time": "0:25:29", "remaining_time": "1:48:57"} | |
| {"current_steps": 344, "total_steps": 1804, "loss": 1.2813206911087036, "lr": 4.7639550870815895e-06, "epoch": 0.7627494456762749, "percentage": 19.07, "elapsed_time": "0:25:38", "remaining_time": "1:48:48"} | |
| {"current_steps": 346, "total_steps": 1804, "loss": 1.2372010946273804, "lr": 4.760261731372388e-06, "epoch": 0.7671840354767184, "percentage": 19.18, "elapsed_time": "0:25:48", "remaining_time": "1:48:44"} | |
| {"current_steps": 348, "total_steps": 1804, "loss": 1.0866163969039917, "lr": 4.75654133003152e-06, "epoch": 0.7716186252771619, "percentage": 19.29, "elapsed_time": "0:25:56", "remaining_time": "1:48:33"} | |
| {"current_steps": 350, "total_steps": 1804, "loss": 1.2248578071594238, "lr": 4.752793933112469e-06, "epoch": 0.7760532150776053, "percentage": 19.4, "elapsed_time": "0:26:06", "remaining_time": "1:48:28"} | |
| {"current_steps": 352, "total_steps": 1804, "loss": 1.144696831703186, "lr": 4.749019591031914e-06, "epoch": 0.7804878048780488, "percentage": 19.51, "elapsed_time": "0:26:16", "remaining_time": "1:48:23"} | |
| {"current_steps": 354, "total_steps": 1804, "loss": 1.0152348279953003, "lr": 4.745218354569045e-06, "epoch": 0.7849223946784922, "percentage": 19.62, "elapsed_time": "0:26:26", "remaining_time": "1:48:18"} | |
| {"current_steps": 356, "total_steps": 1804, "loss": 1.1802353858947754, "lr": 4.741390274864885e-06, "epoch": 0.7893569844789357, "percentage": 19.73, "elapsed_time": "0:26:35", "remaining_time": "1:48:11"} | |
| {"current_steps": 358, "total_steps": 1804, "loss": 1.2564811706542969, "lr": 4.737535403421601e-06, "epoch": 0.7937915742793792, "percentage": 19.84, "elapsed_time": "0:26:45", "remaining_time": "1:48:05"} | |
| {"current_steps": 360, "total_steps": 1804, "loss": 1.2239693403244019, "lr": 4.733653792101809e-06, "epoch": 0.7982261640798226, "percentage": 19.96, "elapsed_time": "0:26:55", "remaining_time": "1:48:00"} | |
| {"current_steps": 362, "total_steps": 1804, "loss": 0.5834329724311829, "lr": 4.729745493127878e-06, "epoch": 0.802660753880266, "percentage": 20.07, "elapsed_time": "0:27:01", "remaining_time": "1:47:39"} | |
| {"current_steps": 364, "total_steps": 1804, "loss": 1.324857234954834, "lr": 4.725810559081227e-06, "epoch": 0.8070953436807096, "percentage": 20.18, "elapsed_time": "0:27:11", "remaining_time": "1:47:33"} | |
| {"current_steps": 366, "total_steps": 1804, "loss": 1.2007935047149658, "lr": 4.7218490429016175e-06, "epoch": 0.811529933481153, "percentage": 20.29, "elapsed_time": "0:27:21", "remaining_time": "1:47:29"} | |
| {"current_steps": 368, "total_steps": 1804, "loss": 0.9981905817985535, "lr": 4.717860997886442e-06, "epoch": 0.8159645232815964, "percentage": 20.4, "elapsed_time": "0:27:30", "remaining_time": "1:47:21"} | |
| {"current_steps": 370, "total_steps": 1804, "loss": 0.8222334384918213, "lr": 4.713846477690005e-06, "epoch": 0.8203991130820399, "percentage": 20.51, "elapsed_time": "0:27:37", "remaining_time": "1:47:03"} | |
| {"current_steps": 372, "total_steps": 1804, "loss": 1.22462797164917, "lr": 4.709805536322804e-06, "epoch": 0.8248337028824834, "percentage": 20.62, "elapsed_time": "0:27:47", "remaining_time": "1:46:59"} | |
| {"current_steps": 374, "total_steps": 1804, "loss": 1.2273211479187012, "lr": 4.7057382281508e-06, "epoch": 0.8292682926829268, "percentage": 20.73, "elapsed_time": "0:27:57", "remaining_time": "1:46:54"} | |
| {"current_steps": 376, "total_steps": 1804, "loss": 1.1893556118011475, "lr": 4.701644607894687e-06, "epoch": 0.8337028824833703, "percentage": 20.84, "elapsed_time": "0:28:06", "remaining_time": "1:46:44"} | |
| {"current_steps": 378, "total_steps": 1804, "loss": 1.215989589691162, "lr": 4.697524730629159e-06, "epoch": 0.8381374722838137, "percentage": 20.95, "elapsed_time": "0:28:11", "remaining_time": "1:46:19"} | |
| {"current_steps": 380, "total_steps": 1804, "loss": 0.7287262082099915, "lr": 4.693378651782162e-06, "epoch": 0.8425720620842572, "percentage": 21.06, "elapsed_time": "0:28:17", "remaining_time": "1:46:02"} | |
| {"current_steps": 382, "total_steps": 1804, "loss": 1.2984516620635986, "lr": 4.689206427134155e-06, "epoch": 0.8470066518847007, "percentage": 21.18, "elapsed_time": "0:28:25", "remaining_time": "1:45:49"} | |
| {"current_steps": 384, "total_steps": 1804, "loss": 1.1311615705490112, "lr": 4.6850081128173595e-06, "epoch": 0.8514412416851441, "percentage": 21.29, "elapsed_time": "0:28:34", "remaining_time": "1:45:40"} | |
| {"current_steps": 386, "total_steps": 1804, "loss": 1.246683955192566, "lr": 4.680783765314994e-06, "epoch": 0.8558758314855875, "percentage": 21.4, "elapsed_time": "0:28:44", "remaining_time": "1:45:34"} | |
| {"current_steps": 388, "total_steps": 1804, "loss": 1.1455856561660767, "lr": 4.6765334414605315e-06, "epoch": 0.8603104212860311, "percentage": 21.51, "elapsed_time": "0:28:51", "remaining_time": "1:45:20"} | |
| {"current_steps": 390, "total_steps": 1804, "loss": 1.2500605583190918, "lr": 4.672257198436918e-06, "epoch": 0.8647450110864745, "percentage": 21.62, "elapsed_time": "0:29:02", "remaining_time": "1:45:17"} | |
| {"current_steps": 392, "total_steps": 1804, "loss": 0.9110448360443115, "lr": 4.667955093775814e-06, "epoch": 0.8691796008869179, "percentage": 21.73, "elapsed_time": "0:29:09", "remaining_time": "1:45:00"} | |
| {"current_steps": 394, "total_steps": 1804, "loss": 1.2182717323303223, "lr": 4.663627185356818e-06, "epoch": 0.8736141906873615, "percentage": 21.84, "elapsed_time": "0:29:18", "remaining_time": "1:44:53"} | |
| {"current_steps": 396, "total_steps": 1804, "loss": 1.2174220085144043, "lr": 4.65927353140668e-06, "epoch": 0.8780487804878049, "percentage": 21.95, "elapsed_time": "0:29:28", "remaining_time": "1:44:48"} | |
| {"current_steps": 398, "total_steps": 1804, "loss": 1.210001826286316, "lr": 4.654894190498534e-06, "epoch": 0.8824833702882483, "percentage": 22.06, "elapsed_time": "0:29:38", "remaining_time": "1:44:43"} | |
| {"current_steps": 400, "total_steps": 1804, "loss": 0.45290517807006836, "lr": 4.650489221551095e-06, "epoch": 0.8869179600886918, "percentage": 22.17, "elapsed_time": "0:29:42", "remaining_time": "1:44:15"} | |
| {"current_steps": 402, "total_steps": 1804, "loss": 1.045676350593567, "lr": 4.646058683827874e-06, "epoch": 0.8913525498891353, "percentage": 22.28, "elapsed_time": "0:29:51", "remaining_time": "1:44:09"} | |
| {"current_steps": 404, "total_steps": 1804, "loss": 0.9281713366508484, "lr": 4.641602636936378e-06, "epoch": 0.8957871396895787, "percentage": 22.39, "elapsed_time": "0:29:59", "remaining_time": "1:43:54"} | |
| {"current_steps": 406, "total_steps": 1804, "loss": 1.2655534744262695, "lr": 4.637121140827311e-06, "epoch": 0.9002217294900222, "percentage": 22.51, "elapsed_time": "0:30:08", "remaining_time": "1:43:47"} | |
| {"current_steps": 408, "total_steps": 1804, "loss": 1.1325139999389648, "lr": 4.632614255793762e-06, "epoch": 0.9046563192904656, "percentage": 22.62, "elapsed_time": "0:30:18", "remaining_time": "1:43:41"} | |
| {"current_steps": 410, "total_steps": 1804, "loss": 1.069734811782837, "lr": 4.6280820424704e-06, "epoch": 0.9090909090909091, "percentage": 22.73, "elapsed_time": "0:30:22", "remaining_time": "1:43:17"} | |
| {"current_steps": 412, "total_steps": 1804, "loss": 1.2320454120635986, "lr": 4.623524561832653e-06, "epoch": 0.9135254988913526, "percentage": 22.84, "elapsed_time": "0:30:32", "remaining_time": "1:43:10"} | |
| {"current_steps": 414, "total_steps": 1804, "loss": 1.2557101249694824, "lr": 4.618941875195893e-06, "epoch": 0.917960088691796, "percentage": 22.95, "elapsed_time": "0:30:42", "remaining_time": "1:43:06"} | |
| {"current_steps": 416, "total_steps": 1804, "loss": 0.9693298935890198, "lr": 4.614334044214606e-06, "epoch": 0.9223946784922394, "percentage": 23.06, "elapsed_time": "0:30:52", "remaining_time": "1:43:00"} | |
| {"current_steps": 418, "total_steps": 1804, "loss": 1.2573974132537842, "lr": 4.6097011308815645e-06, "epoch": 0.926829268292683, "percentage": 23.17, "elapsed_time": "0:31:01", "remaining_time": "1:42:52"} | |
| {"current_steps": 420, "total_steps": 1804, "loss": 0.7741899490356445, "lr": 4.605043197526996e-06, "epoch": 0.9312638580931264, "percentage": 23.28, "elapsed_time": "0:31:11", "remaining_time": "1:42:47"} | |
| {"current_steps": 422, "total_steps": 1804, "loss": 1.45621657371521, "lr": 4.600360306817738e-06, "epoch": 0.9356984478935698, "percentage": 23.39, "elapsed_time": "0:31:21", "remaining_time": "1:42:40"} | |
| {"current_steps": 424, "total_steps": 1804, "loss": 1.0499638319015503, "lr": 4.595652521756403e-06, "epoch": 0.9401330376940134, "percentage": 23.5, "elapsed_time": "0:31:27", "remaining_time": "1:42:24"} | |
| {"current_steps": 426, "total_steps": 1804, "loss": 1.202383041381836, "lr": 4.590919905680524e-06, "epoch": 0.9445676274944568, "percentage": 23.61, "elapsed_time": "0:31:37", "remaining_time": "1:42:17"} | |
| {"current_steps": 428, "total_steps": 1804, "loss": 1.0747066736221313, "lr": 4.5861625222617065e-06, "epoch": 0.9490022172949002, "percentage": 23.73, "elapsed_time": "0:31:45", "remaining_time": "1:42:04"} | |
| {"current_steps": 430, "total_steps": 1804, "loss": 0.6646397709846497, "lr": 4.58138043550477e-06, "epoch": 0.9534368070953437, "percentage": 23.84, "elapsed_time": "0:31:53", "remaining_time": "1:41:54"} | |
| {"current_steps": 432, "total_steps": 1804, "loss": 1.2203632593154907, "lr": 4.576573709746887e-06, "epoch": 0.9578713968957872, "percentage": 23.95, "elapsed_time": "0:32:03", "remaining_time": "1:41:48"} | |
| {"current_steps": 434, "total_steps": 1804, "loss": 1.0556832551956177, "lr": 4.5717424096567205e-06, "epoch": 0.9623059866962306, "percentage": 24.06, "elapsed_time": "0:32:08", "remaining_time": "1:41:26"} | |
| {"current_steps": 436, "total_steps": 1804, "loss": 1.2815701961517334, "lr": 4.566886600233547e-06, "epoch": 0.9667405764966741, "percentage": 24.17, "elapsed_time": "0:32:18", "remaining_time": "1:41:21"} | |
| {"current_steps": 438, "total_steps": 1804, "loss": 1.233439564704895, "lr": 4.56200634680639e-06, "epoch": 0.9711751662971175, "percentage": 24.28, "elapsed_time": "0:32:28", "remaining_time": "1:41:16"} | |
| {"current_steps": 440, "total_steps": 1804, "loss": 0.7945879697799683, "lr": 4.557101715033136e-06, "epoch": 0.975609756097561, "percentage": 24.39, "elapsed_time": "0:32:35", "remaining_time": "1:41:02"} | |
| {"current_steps": 442, "total_steps": 1804, "loss": 1.0334054231643677, "lr": 4.552172770899652e-06, "epoch": 0.9800443458980045, "percentage": 24.5, "elapsed_time": "0:32:42", "remaining_time": "1:40:48"} | |
| {"current_steps": 444, "total_steps": 1804, "loss": 1.2736181020736694, "lr": 4.547219580718899e-06, "epoch": 0.9844789356984479, "percentage": 24.61, "elapsed_time": "0:32:51", "remaining_time": "1:40:39"} | |
| {"current_steps": 446, "total_steps": 1804, "loss": 1.2092833518981934, "lr": 4.542242211130039e-06, "epoch": 0.9889135254988913, "percentage": 24.72, "elapsed_time": "0:33:01", "remaining_time": "1:40:33"} | |
| {"current_steps": 448, "total_steps": 1804, "loss": 1.2434141635894775, "lr": 4.537240729097539e-06, "epoch": 0.9933481152993349, "percentage": 24.83, "elapsed_time": "0:33:11", "remaining_time": "1:40:28"} | |
| {"current_steps": 450, "total_steps": 1804, "loss": 1.0325958728790283, "lr": 4.532215201910269e-06, "epoch": 0.9977827050997783, "percentage": 24.94, "elapsed_time": "0:33:21", "remaining_time": "1:40:22"} | |
| {"current_steps": 452, "total_steps": 1804, "loss": 1.0579339265823364, "lr": 4.527165697180598e-06, "epoch": 1.0022172949002217, "percentage": 25.06, "elapsed_time": "0:33:28", "remaining_time": "1:40:09"} | |
| {"current_steps": 454, "total_steps": 1804, "loss": 1.3362075090408325, "lr": 4.522092282843481e-06, "epoch": 1.0066518847006651, "percentage": 25.17, "elapsed_time": "0:33:38", "remaining_time": "1:40:02"} | |
| {"current_steps": 456, "total_steps": 1804, "loss": 1.2156652212142944, "lr": 4.516995027155554e-06, "epoch": 1.0110864745011086, "percentage": 25.28, "elapsed_time": "0:33:48", "remaining_time": "1:39:56"} | |
| {"current_steps": 458, "total_steps": 1804, "loss": 0.8079202175140381, "lr": 4.511873998694204e-06, "epoch": 1.0155210643015522, "percentage": 25.39, "elapsed_time": "0:33:57", "remaining_time": "1:39:47"} | |
| {"current_steps": 460, "total_steps": 1804, "loss": 0.8377529978752136, "lr": 4.506729266356651e-06, "epoch": 1.0199556541019956, "percentage": 25.5, "elapsed_time": "0:34:04", "remaining_time": "1:39:34"} | |
| {"current_steps": 462, "total_steps": 1804, "loss": 0.5483433604240417, "lr": 4.5015608993590276e-06, "epoch": 1.024390243902439, "percentage": 25.61, "elapsed_time": "0:34:10", "remaining_time": "1:39:17"} | |
| {"current_steps": 464, "total_steps": 1804, "loss": 0.9224250316619873, "lr": 4.4963689672354375e-06, "epoch": 1.0288248337028825, "percentage": 25.72, "elapsed_time": "0:34:20", "remaining_time": "1:39:11"} | |
| {"current_steps": 466, "total_steps": 1804, "loss": 0.7165282964706421, "lr": 4.491153539837026e-06, "epoch": 1.033259423503326, "percentage": 25.83, "elapsed_time": "0:34:29", "remaining_time": "1:39:02"} | |
| {"current_steps": 468, "total_steps": 1804, "loss": 1.0422555208206177, "lr": 4.4859146873310375e-06, "epoch": 1.0376940133037693, "percentage": 25.94, "elapsed_time": "0:34:39", "remaining_time": "1:38:56"} | |
| {"current_steps": 470, "total_steps": 1804, "loss": 0.6359959244728088, "lr": 4.480652480199873e-06, "epoch": 1.042128603104213, "percentage": 26.05, "elapsed_time": "0:34:46", "remaining_time": "1:38:41"} | |
| {"current_steps": 472, "total_steps": 1804, "loss": 0.9994376301765442, "lr": 4.475366989240147e-06, "epoch": 1.0465631929046564, "percentage": 26.16, "elapsed_time": "0:34:52", "remaining_time": "1:38:25"} | |
| {"current_steps": 474, "total_steps": 1804, "loss": 1.0376862287521362, "lr": 4.470058285561721e-06, "epoch": 1.0509977827050998, "percentage": 26.27, "elapsed_time": "0:35:02", "remaining_time": "1:38:20"} | |
| {"current_steps": 476, "total_steps": 1804, "loss": 1.076725959777832, "lr": 4.464726440586761e-06, "epoch": 1.0554323725055432, "percentage": 26.39, "elapsed_time": "0:35:12", "remaining_time": "1:38:14"} | |
| {"current_steps": 478, "total_steps": 1804, "loss": 1.1006007194519043, "lr": 4.45937152604877e-06, "epoch": 1.0598669623059866, "percentage": 26.5, "elapsed_time": "0:35:22", "remaining_time": "1:38:07"} | |
| {"current_steps": 480, "total_steps": 1804, "loss": 0.6436704397201538, "lr": 4.453993613991622e-06, "epoch": 1.06430155210643, "percentage": 26.61, "elapsed_time": "0:35:30", "remaining_time": "1:37:56"} | |
| {"current_steps": 482, "total_steps": 1804, "loss": 1.153225064277649, "lr": 4.4485927767685995e-06, "epoch": 1.0687361419068737, "percentage": 26.72, "elapsed_time": "0:35:40", "remaining_time": "1:37:51"} | |
| {"current_steps": 484, "total_steps": 1804, "loss": 0.9155857563018799, "lr": 4.443169087041409e-06, "epoch": 1.0731707317073171, "percentage": 26.83, "elapsed_time": "0:35:47", "remaining_time": "1:37:36"} | |
| {"current_steps": 486, "total_steps": 1804, "loss": 1.0506926774978638, "lr": 4.4377226177792145e-06, "epoch": 1.0776053215077606, "percentage": 26.94, "elapsed_time": "0:35:57", "remaining_time": "1:37:30"} | |
| {"current_steps": 488, "total_steps": 1804, "loss": 0.7985799312591553, "lr": 4.432253442257649e-06, "epoch": 1.082039911308204, "percentage": 27.05, "elapsed_time": "0:36:07", "remaining_time": "1:37:24"} | |
| {"current_steps": 490, "total_steps": 1804, "loss": 0.9610664248466492, "lr": 4.426761634057831e-06, "epoch": 1.0864745011086474, "percentage": 27.16, "elapsed_time": "0:36:17", "remaining_time": "1:37:18"} | |
| {"current_steps": 492, "total_steps": 1804, "loss": 1.1698113679885864, "lr": 4.421247267065375e-06, "epoch": 1.0909090909090908, "percentage": 27.27, "elapsed_time": "0:36:27", "remaining_time": "1:37:12"} | |
| {"current_steps": 494, "total_steps": 1804, "loss": 1.0748765468597412, "lr": 4.415710415469394e-06, "epoch": 1.0953436807095343, "percentage": 27.38, "elapsed_time": "0:36:36", "remaining_time": "1:37:05"} | |
| {"current_steps": 496, "total_steps": 1804, "loss": 0.8963067531585693, "lr": 4.410151153761506e-06, "epoch": 1.099778270509978, "percentage": 27.49, "elapsed_time": "0:36:46", "remaining_time": "1:36:58"} | |
| {"current_steps": 498, "total_steps": 1804, "loss": 1.023295521736145, "lr": 4.404569556734832e-06, "epoch": 1.1042128603104213, "percentage": 27.61, "elapsed_time": "0:36:56", "remaining_time": "1:36:52"} | |
| {"current_steps": 500, "total_steps": 1804, "loss": 0.9625403881072998, "lr": 4.398965699482984e-06, "epoch": 1.1086474501108647, "percentage": 27.72, "elapsed_time": "0:37:03", "remaining_time": "1:36:38"} | |
| {"current_steps": 502, "total_steps": 1804, "loss": 1.0802518129348755, "lr": 4.39333965739906e-06, "epoch": 1.1130820399113082, "percentage": 27.83, "elapsed_time": "0:37:13", "remaining_time": "1:36:31"} | |
| {"current_steps": 504, "total_steps": 1804, "loss": 0.9319751262664795, "lr": 4.3876915061746275e-06, "epoch": 1.1175166297117516, "percentage": 27.94, "elapsed_time": "0:37:22", "remaining_time": "1:36:23"} | |
| {"current_steps": 506, "total_steps": 1804, "loss": 1.0706431865692139, "lr": 4.382021321798707e-06, "epoch": 1.1219512195121952, "percentage": 28.05, "elapsed_time": "0:37:31", "remaining_time": "1:36:16"} | |
| {"current_steps": 508, "total_steps": 1804, "loss": 0.790678858757019, "lr": 4.376329180556745e-06, "epoch": 1.1263858093126387, "percentage": 28.16, "elapsed_time": "0:37:38", "remaining_time": "1:36:02"} | |
| {"current_steps": 510, "total_steps": 1804, "loss": 1.1257532835006714, "lr": 4.370615159029594e-06, "epoch": 1.130820399113082, "percentage": 28.27, "elapsed_time": "0:37:48", "remaining_time": "1:35:57"} | |
| {"current_steps": 512, "total_steps": 1804, "loss": 0.5582272410392761, "lr": 4.36487933409248e-06, "epoch": 1.1352549889135255, "percentage": 28.38, "elapsed_time": "0:37:58", "remaining_time": "1:35:48"} | |
| {"current_steps": 514, "total_steps": 1804, "loss": 0.541851282119751, "lr": 4.359121782913964e-06, "epoch": 1.139689578713969, "percentage": 28.49, "elapsed_time": "0:38:02", "remaining_time": "1:35:28"} | |
| {"current_steps": 516, "total_steps": 1804, "loss": 1.0328795909881592, "lr": 4.3533425829549085e-06, "epoch": 1.1441241685144123, "percentage": 28.6, "elapsed_time": "0:38:09", "remaining_time": "1:35:15"} | |
| {"current_steps": 518, "total_steps": 1804, "loss": 1.163663625717163, "lr": 4.347541811967436e-06, "epoch": 1.1485587583148558, "percentage": 28.71, "elapsed_time": "0:38:19", "remaining_time": "1:35:08"} | |
| {"current_steps": 520, "total_steps": 1804, "loss": 1.1469664573669434, "lr": 4.341719547993879e-06, "epoch": 1.1529933481152994, "percentage": 28.82, "elapsed_time": "0:38:30", "remaining_time": "1:35:05"} | |
| {"current_steps": 522, "total_steps": 1804, "loss": 0.5811082124710083, "lr": 4.335875869365732e-06, "epoch": 1.1574279379157428, "percentage": 28.94, "elapsed_time": "0:38:37", "remaining_time": "1:34:51"} | |
| {"current_steps": 524, "total_steps": 1804, "loss": 1.0397084951400757, "lr": 4.330010854702598e-06, "epoch": 1.1618625277161863, "percentage": 29.05, "elapsed_time": "0:38:47", "remaining_time": "1:34:45"} | |
| {"current_steps": 526, "total_steps": 1804, "loss": 1.1979455947875977, "lr": 4.3241245829111324e-06, "epoch": 1.1662971175166297, "percentage": 29.16, "elapsed_time": "0:38:57", "remaining_time": "1:34:39"} | |
| {"current_steps": 528, "total_steps": 1804, "loss": 0.6363497376441956, "lr": 4.318217133183978e-06, "epoch": 1.170731707317073, "percentage": 29.27, "elapsed_time": "0:39:03", "remaining_time": "1:34:24"} | |
| {"current_steps": 530, "total_steps": 1804, "loss": 0.8744889497756958, "lr": 4.312288584998697e-06, "epoch": 1.1751662971175167, "percentage": 29.38, "elapsed_time": "0:39:13", "remaining_time": "1:34:17"} | |
| {"current_steps": 532, "total_steps": 1804, "loss": 0.96029132604599, "lr": 4.306339018116714e-06, "epoch": 1.1796008869179602, "percentage": 29.49, "elapsed_time": "0:39:21", "remaining_time": "1:34:06"} | |
| {"current_steps": 534, "total_steps": 1804, "loss": 1.0959749221801758, "lr": 4.300368512582227e-06, "epoch": 1.1840354767184036, "percentage": 29.6, "elapsed_time": "0:39:31", "remaining_time": "1:34:00"} | |
| {"current_steps": 536, "total_steps": 1804, "loss": 0.82485431432724, "lr": 4.294377148721144e-06, "epoch": 1.188470066518847, "percentage": 29.71, "elapsed_time": "0:39:38", "remaining_time": "1:33:47"} | |
| {"current_steps": 538, "total_steps": 1804, "loss": 1.1424366235733032, "lr": 4.288365007139991e-06, "epoch": 1.1929046563192904, "percentage": 29.82, "elapsed_time": "0:39:48", "remaining_time": "1:33:41"} | |
| {"current_steps": 540, "total_steps": 1804, "loss": 0.6629378795623779, "lr": 4.2823321687248386e-06, "epoch": 1.1973392461197339, "percentage": 29.93, "elapsed_time": "0:39:54", "remaining_time": "1:33:24"} | |
| {"current_steps": 542, "total_steps": 1804, "loss": 0.6427868008613586, "lr": 4.276278714640203e-06, "epoch": 1.2017738359201773, "percentage": 30.04, "elapsed_time": "0:40:03", "remaining_time": "1:33:16"} | |
| {"current_steps": 544, "total_steps": 1804, "loss": 1.1916182041168213, "lr": 4.270204726327963e-06, "epoch": 1.206208425720621, "percentage": 30.16, "elapsed_time": "0:40:12", "remaining_time": "1:33:08"} | |
| {"current_steps": 546, "total_steps": 1804, "loss": 0.8375392556190491, "lr": 4.264110285506259e-06, "epoch": 1.2106430155210643, "percentage": 30.27, "elapsed_time": "0:40:20", "remaining_time": "1:32:56"} | |
| {"current_steps": 548, "total_steps": 1804, "loss": 1.216051459312439, "lr": 4.257995474168395e-06, "epoch": 1.2150776053215078, "percentage": 30.38, "elapsed_time": "0:40:30", "remaining_time": "1:32:49"} | |
| {"current_steps": 550, "total_steps": 1804, "loss": 0.598793625831604, "lr": 4.251860374581736e-06, "epoch": 1.2195121951219512, "percentage": 30.49, "elapsed_time": "0:40:38", "remaining_time": "1:32:39"} | |
| {"current_steps": 552, "total_steps": 1804, "loss": 1.076664686203003, "lr": 4.245705069286601e-06, "epoch": 1.2239467849223946, "percentage": 30.6, "elapsed_time": "0:40:48", "remaining_time": "1:32:33"} | |
| {"current_steps": 554, "total_steps": 1804, "loss": 0.7446794509887695, "lr": 4.239529641095149e-06, "epoch": 1.2283813747228383, "percentage": 30.71, "elapsed_time": "0:40:58", "remaining_time": "1:32:27"} | |
| {"current_steps": 556, "total_steps": 1804, "loss": 0.7689218521118164, "lr": 4.233334173090274e-06, "epoch": 1.2328159645232817, "percentage": 30.82, "elapsed_time": "0:41:05", "remaining_time": "1:32:14"} | |
| {"current_steps": 558, "total_steps": 1804, "loss": 0.6857209205627441, "lr": 4.227118748624478e-06, "epoch": 1.237250554323725, "percentage": 30.93, "elapsed_time": "0:41:14", "remaining_time": "1:32:05"} | |
| {"current_steps": 560, "total_steps": 1804, "loss": 1.3058849573135376, "lr": 4.220883451318753e-06, "epoch": 1.2416851441241685, "percentage": 31.04, "elapsed_time": "0:41:24", "remaining_time": "1:31:59"} | |
| {"current_steps": 562, "total_steps": 1804, "loss": 0.9943916201591492, "lr": 4.2146283650614545e-06, "epoch": 1.246119733924612, "percentage": 31.15, "elapsed_time": "0:41:32", "remaining_time": "1:31:47"} | |
| {"current_steps": 564, "total_steps": 1804, "loss": 0.6252534985542297, "lr": 4.208353574007179e-06, "epoch": 1.2505543237250554, "percentage": 31.26, "elapsed_time": "0:41:39", "remaining_time": "1:31:34"} | |
| {"current_steps": 566, "total_steps": 1804, "loss": 0.9947891235351562, "lr": 4.202059162575622e-06, "epoch": 1.2549889135254988, "percentage": 31.37, "elapsed_time": "0:41:48", "remaining_time": "1:31:27"} | |
| {"current_steps": 568, "total_steps": 1804, "loss": 0.9397783279418945, "lr": 4.195745215450451e-06, "epoch": 1.2594235033259422, "percentage": 31.49, "elapsed_time": "0:41:57", "remaining_time": "1:31:18"} | |
| {"current_steps": 570, "total_steps": 1804, "loss": 1.136919379234314, "lr": 4.189411817578159e-06, "epoch": 1.2638580931263859, "percentage": 31.6, "elapsed_time": "0:42:06", "remaining_time": "1:31:09"} | |
| {"current_steps": 572, "total_steps": 1804, "loss": 0.7842409610748291, "lr": 4.1830590541669304e-06, "epoch": 1.2682926829268293, "percentage": 31.71, "elapsed_time": "0:42:16", "remaining_time": "1:31:02"} | |
| {"current_steps": 574, "total_steps": 1804, "loss": 1.3783859014511108, "lr": 4.176687010685484e-06, "epoch": 1.2727272727272727, "percentage": 31.82, "elapsed_time": "0:42:24", "remaining_time": "1:30:51"} | |
| {"current_steps": 576, "total_steps": 1804, "loss": 1.0433826446533203, "lr": 4.170295772861931e-06, "epoch": 1.2771618625277161, "percentage": 31.93, "elapsed_time": "0:42:34", "remaining_time": "1:30:45"} | |
| {"current_steps": 578, "total_steps": 1804, "loss": 1.1605374813079834, "lr": 4.163885426682619e-06, "epoch": 1.2815964523281598, "percentage": 32.04, "elapsed_time": "0:42:44", "remaining_time": "1:30:38"} | |
| {"current_steps": 580, "total_steps": 1804, "loss": 1.121392011642456, "lr": 4.157456058390977e-06, "epoch": 1.2860310421286032, "percentage": 32.15, "elapsed_time": "0:42:53", "remaining_time": "1:30:31"} | |
| {"current_steps": 582, "total_steps": 1804, "loss": 1.3379415273666382, "lr": 4.151007754486351e-06, "epoch": 1.2904656319290466, "percentage": 32.26, "elapsed_time": "0:43:03", "remaining_time": "1:30:24"} | |
| {"current_steps": 584, "total_steps": 1804, "loss": 0.7737810015678406, "lr": 4.144540601722843e-06, "epoch": 1.29490022172949, "percentage": 32.37, "elapsed_time": "0:43:10", "remaining_time": "1:30:11"} | |
| {"current_steps": 586, "total_steps": 1804, "loss": 0.8205963373184204, "lr": 4.138054687108143e-06, "epoch": 1.2993348115299335, "percentage": 32.48, "elapsed_time": "0:43:17", "remaining_time": "1:29:59"} | |
| {"current_steps": 588, "total_steps": 1804, "loss": 0.47316789627075195, "lr": 4.131550097902361e-06, "epoch": 1.3037694013303769, "percentage": 32.59, "elapsed_time": "0:43:24", "remaining_time": "1:29:45"} | |
| {"current_steps": 590, "total_steps": 1804, "loss": 0.9959248900413513, "lr": 4.125026921616852e-06, "epoch": 1.3082039911308203, "percentage": 32.71, "elapsed_time": "0:43:34", "remaining_time": "1:29:38"} | |
| {"current_steps": 592, "total_steps": 1804, "loss": 1.1072802543640137, "lr": 4.118485246013031e-06, "epoch": 1.3126385809312637, "percentage": 32.82, "elapsed_time": "0:43:44", "remaining_time": "1:29:32"} | |
| {"current_steps": 594, "total_steps": 1804, "loss": 1.0438408851623535, "lr": 4.111925159101208e-06, "epoch": 1.3170731707317074, "percentage": 32.93, "elapsed_time": "0:43:54", "remaining_time": "1:29:26"} | |
| {"current_steps": 596, "total_steps": 1804, "loss": 1.0767079591751099, "lr": 4.1053467491393864e-06, "epoch": 1.3215077605321508, "percentage": 33.04, "elapsed_time": "0:44:03", "remaining_time": "1:29:17"} | |
| {"current_steps": 598, "total_steps": 1804, "loss": 1.160161018371582, "lr": 4.098750104632091e-06, "epoch": 1.3259423503325942, "percentage": 33.15, "elapsed_time": "0:44:13", "remaining_time": "1:29:10"} | |
| {"current_steps": 600, "total_steps": 1804, "loss": 0.47440510988235474, "lr": 4.092135314329165e-06, "epoch": 1.3303769401330376, "percentage": 33.26, "elapsed_time": "0:44:19", "remaining_time": "1:28:57"} | |
| {"current_steps": 602, "total_steps": 1804, "loss": 1.0348572731018066, "lr": 4.085502467224583e-06, "epoch": 1.3348115299334813, "percentage": 33.37, "elapsed_time": "0:44:26", "remaining_time": "1:28:43"} | |
| {"current_steps": 604, "total_steps": 1804, "loss": 0.8366844654083252, "lr": 4.078851652555254e-06, "epoch": 1.3392461197339247, "percentage": 33.48, "elapsed_time": "0:44:35", "remaining_time": "1:28:36"} | |
| {"current_steps": 606, "total_steps": 1804, "loss": 0.782626211643219, "lr": 4.072182959799816e-06, "epoch": 1.3436807095343681, "percentage": 33.59, "elapsed_time": "0:44:42", "remaining_time": "1:28:23"} | |
| {"current_steps": 608, "total_steps": 1804, "loss": 1.113935112953186, "lr": 4.065496478677436e-06, "epoch": 1.3481152993348116, "percentage": 33.7, "elapsed_time": "0:44:52", "remaining_time": "1:28:17"} | |
| {"current_steps": 610, "total_steps": 1804, "loss": 1.1273419857025146, "lr": 4.058792299146602e-06, "epoch": 1.352549889135255, "percentage": 33.81, "elapsed_time": "0:45:02", "remaining_time": "1:28:09"} | |
| {"current_steps": 612, "total_steps": 1804, "loss": 0.7277010679244995, "lr": 4.052070511403912e-06, "epoch": 1.3569844789356984, "percentage": 33.92, "elapsed_time": "0:45:09", "remaining_time": "1:27:56"} | |
| {"current_steps": 614, "total_steps": 1804, "loss": 1.1359970569610596, "lr": 4.045331205882863e-06, "epoch": 1.3614190687361418, "percentage": 34.04, "elapsed_time": "0:45:19", "remaining_time": "1:27:50"} | |
| {"current_steps": 616, "total_steps": 1804, "loss": 0.7449517846107483, "lr": 4.038574473252629e-06, "epoch": 1.3658536585365852, "percentage": 34.15, "elapsed_time": "0:45:26", "remaining_time": "1:27:37"} | |
| {"current_steps": 618, "total_steps": 1804, "loss": 1.0706074237823486, "lr": 4.031800404416849e-06, "epoch": 1.370288248337029, "percentage": 34.26, "elapsed_time": "0:45:33", "remaining_time": "1:27:26"} | |
| {"current_steps": 620, "total_steps": 1804, "loss": 0.09774535149335861, "lr": 4.025009090512394e-06, "epoch": 1.3747228381374723, "percentage": 34.37, "elapsed_time": "0:45:37", "remaining_time": "1:27:07"} | |
| {"current_steps": 622, "total_steps": 1804, "loss": 1.0203994512557983, "lr": 4.018200622908153e-06, "epoch": 1.3791574279379157, "percentage": 34.48, "elapsed_time": "0:45:47", "remaining_time": "1:27:00"} | |
| {"current_steps": 624, "total_steps": 1804, "loss": 0.949788510799408, "lr": 4.011375093203793e-06, "epoch": 1.3835920177383592, "percentage": 34.59, "elapsed_time": "0:45:56", "remaining_time": "1:26:53"} | |
| {"current_steps": 626, "total_steps": 1804, "loss": 0.9569694995880127, "lr": 4.004532593228531e-06, "epoch": 1.3880266075388026, "percentage": 34.7, "elapsed_time": "0:46:05", "remaining_time": "1:26:44"} | |
| {"current_steps": 628, "total_steps": 1804, "loss": 1.0896062850952148, "lr": 3.997673215039899e-06, "epoch": 1.3924611973392462, "percentage": 34.81, "elapsed_time": "0:46:14", "remaining_time": "1:26:35"} | |
| {"current_steps": 630, "total_steps": 1804, "loss": 1.084737777709961, "lr": 3.990797050922506e-06, "epoch": 1.3968957871396896, "percentage": 34.92, "elapsed_time": "0:46:24", "remaining_time": "1:26:28"} | |
| {"current_steps": 632, "total_steps": 1804, "loss": 0.8085731863975525, "lr": 3.9839041933867954e-06, "epoch": 1.401330376940133, "percentage": 35.03, "elapsed_time": "0:46:32", "remaining_time": "1:26:17"} | |
| {"current_steps": 634, "total_steps": 1804, "loss": 1.0447653532028198, "lr": 3.976994735167796e-06, "epoch": 1.4057649667405765, "percentage": 35.14, "elapsed_time": "0:46:42", "remaining_time": "1:26:11"} | |
| {"current_steps": 636, "total_steps": 1804, "loss": 1.110194206237793, "lr": 3.970068769223884e-06, "epoch": 1.41019955654102, "percentage": 35.25, "elapsed_time": "0:46:52", "remaining_time": "1:26:05"} | |
| {"current_steps": 638, "total_steps": 1804, "loss": 0.8597381711006165, "lr": 3.963126388735525e-06, "epoch": 1.4146341463414633, "percentage": 35.37, "elapsed_time": "0:47:02", "remaining_time": "1:25:57"} | |
| {"current_steps": 640, "total_steps": 1804, "loss": 0.7458611130714417, "lr": 3.956167687104021e-06, "epoch": 1.4190687361419068, "percentage": 35.48, "elapsed_time": "0:47:09", "remaining_time": "1:25:45"} | |
| {"current_steps": 642, "total_steps": 1804, "loss": 0.8080941438674927, "lr": 3.9491927579502584e-06, "epoch": 1.4235033259423504, "percentage": 35.59, "elapsed_time": "0:47:18", "remaining_time": "1:25:38"} | |
| {"current_steps": 644, "total_steps": 1804, "loss": 0.6925735473632812, "lr": 3.9422016951134415e-06, "epoch": 1.4279379157427938, "percentage": 35.7, "elapsed_time": "0:47:25", "remaining_time": "1:25:26"} | |
| {"current_steps": 646, "total_steps": 1804, "loss": 1.2162237167358398, "lr": 3.935194592649836e-06, "epoch": 1.4323725055432373, "percentage": 35.81, "elapsed_time": "0:47:35", "remaining_time": "1:25:19"} | |
| {"current_steps": 648, "total_steps": 1804, "loss": 1.1060457229614258, "lr": 3.928171544831501e-06, "epoch": 1.4368070953436807, "percentage": 35.92, "elapsed_time": "0:47:46", "remaining_time": "1:25:12"} | |
| {"current_steps": 650, "total_steps": 1804, "loss": 1.1656242609024048, "lr": 3.921132646145019e-06, "epoch": 1.441241685144124, "percentage": 36.03, "elapsed_time": "0:47:55", "remaining_time": "1:25:05"} | |
| {"current_steps": 652, "total_steps": 1804, "loss": 0.9107663035392761, "lr": 3.914077991290232e-06, "epoch": 1.4456762749445677, "percentage": 36.14, "elapsed_time": "0:48:05", "remaining_time": "1:24:59"} | |
| {"current_steps": 654, "total_steps": 1804, "loss": 1.0581393241882324, "lr": 3.907007675178956e-06, "epoch": 1.4501108647450112, "percentage": 36.25, "elapsed_time": "0:48:15", "remaining_time": "1:24:51"} | |
| {"current_steps": 656, "total_steps": 1804, "loss": 0.8163521885871887, "lr": 3.899921792933713e-06, "epoch": 1.4545454545454546, "percentage": 36.36, "elapsed_time": "0:48:22", "remaining_time": "1:24:38"} | |
| {"current_steps": 658, "total_steps": 1804, "loss": 0.8494032621383667, "lr": 3.892820439886448e-06, "epoch": 1.458980044345898, "percentage": 36.47, "elapsed_time": "0:48:29", "remaining_time": "1:24:26"} | |
| {"current_steps": 660, "total_steps": 1804, "loss": 1.0738056898117065, "lr": 3.885703711577249e-06, "epoch": 1.4634146341463414, "percentage": 36.59, "elapsed_time": "0:48:39", "remaining_time": "1:24:20"} | |
| {"current_steps": 662, "total_steps": 1804, "loss": 1.078277587890625, "lr": 3.8785717037530555e-06, "epoch": 1.4678492239467849, "percentage": 36.7, "elapsed_time": "0:48:49", "remaining_time": "1:24:14"} | |
| {"current_steps": 664, "total_steps": 1804, "loss": 0.8074089288711548, "lr": 3.871424512366377e-06, "epoch": 1.4722838137472283, "percentage": 36.81, "elapsed_time": "0:48:59", "remaining_time": "1:24:06"} | |
| {"current_steps": 666, "total_steps": 1804, "loss": 0.5039446949958801, "lr": 3.864262233574e-06, "epoch": 1.476718403547672, "percentage": 36.92, "elapsed_time": "0:49:02", "remaining_time": "1:23:48"} | |
| {"current_steps": 668, "total_steps": 1804, "loss": 0.6357030868530273, "lr": 3.857084963735689e-06, "epoch": 1.4811529933481153, "percentage": 37.03, "elapsed_time": "0:49:09", "remaining_time": "1:23:35"} | |
| {"current_steps": 670, "total_steps": 1804, "loss": 1.0553703308105469, "lr": 3.849892799412902e-06, "epoch": 1.4855875831485588, "percentage": 37.14, "elapsed_time": "0:49:18", "remaining_time": "1:23:27"} | |
| {"current_steps": 672, "total_steps": 1804, "loss": 1.1253726482391357, "lr": 3.84268583736748e-06, "epoch": 1.4900221729490022, "percentage": 37.25, "elapsed_time": "0:49:27", "remaining_time": "1:23:19"} | |
| {"current_steps": 674, "total_steps": 1804, "loss": 0.6612215042114258, "lr": 3.835464174560349e-06, "epoch": 1.4944567627494456, "percentage": 37.36, "elapsed_time": "0:49:34", "remaining_time": "1:23:06"} | |
| {"current_steps": 676, "total_steps": 1804, "loss": 0.753847599029541, "lr": 3.828227908150217e-06, "epoch": 1.4988913525498893, "percentage": 37.47, "elapsed_time": "0:49:44", "remaining_time": "1:23:00"} | |
| {"current_steps": 678, "total_steps": 1804, "loss": 1.095574140548706, "lr": 3.820977135492266e-06, "epoch": 1.5033259423503327, "percentage": 37.58, "elapsed_time": "0:49:54", "remaining_time": "1:22:53"} | |
| {"current_steps": 680, "total_steps": 1804, "loss": 0.6039291620254517, "lr": 3.8137119541368415e-06, "epoch": 1.507760532150776, "percentage": 37.69, "elapsed_time": "0:50:00", "remaining_time": "1:22:39"} | |
| {"current_steps": 682, "total_steps": 1804, "loss": 0.5733712315559387, "lr": 3.80643246182814e-06, "epoch": 1.5121951219512195, "percentage": 37.8, "elapsed_time": "0:50:06", "remaining_time": "1:22:25"} | |
| {"current_steps": 684, "total_steps": 1804, "loss": 1.0893672704696655, "lr": 3.7991387565028963e-06, "epoch": 1.516629711751663, "percentage": 37.92, "elapsed_time": "0:50:15", "remaining_time": "1:22:18"} | |
| {"current_steps": 686, "total_steps": 1804, "loss": 1.0852909088134766, "lr": 3.791830936289062e-06, "epoch": 1.5210643015521064, "percentage": 38.03, "elapsed_time": "0:50:25", "remaining_time": "1:22:11"} | |
| {"current_steps": 688, "total_steps": 1804, "loss": 0.6314648985862732, "lr": 3.784509099504488e-06, "epoch": 1.5254988913525498, "percentage": 38.14, "elapsed_time": "0:50:32", "remaining_time": "1:21:59"} | |
| {"current_steps": 690, "total_steps": 1804, "loss": 0.3787440061569214, "lr": 3.7771733446556025e-06, "epoch": 1.5299334811529932, "percentage": 38.25, "elapsed_time": "0:50:36", "remaining_time": "1:21:42"} | |
| {"current_steps": 692, "total_steps": 1804, "loss": 0.8831952214241028, "lr": 3.7698237704360826e-06, "epoch": 1.5343680709534369, "percentage": 38.36, "elapsed_time": "0:50:43", "remaining_time": "1:21:29"} | |
| {"current_steps": 694, "total_steps": 1804, "loss": 0.9063498377799988, "lr": 3.7624604757255297e-06, "epoch": 1.5388026607538803, "percentage": 38.47, "elapsed_time": "0:50:51", "remaining_time": "1:21:21"} | |
| {"current_steps": 696, "total_steps": 1804, "loss": 0.6573284268379211, "lr": 3.7550835595881365e-06, "epoch": 1.5432372505543237, "percentage": 38.58, "elapsed_time": "0:50:58", "remaining_time": "1:21:08"} | |
| {"current_steps": 698, "total_steps": 1804, "loss": 1.1525388956069946, "lr": 3.747693121271355e-06, "epoch": 1.5476718403547673, "percentage": 38.69, "elapsed_time": "0:51:08", "remaining_time": "1:21:02"} | |
| {"current_steps": 700, "total_steps": 1804, "loss": 1.0669758319854736, "lr": 3.740289260204565e-06, "epoch": 1.5521064301552108, "percentage": 38.8, "elapsed_time": "0:51:18", "remaining_time": "1:20:54"} | |
| {"current_steps": 702, "total_steps": 1804, "loss": 1.1033045053482056, "lr": 3.732872075997729e-06, "epoch": 1.5565410199556542, "percentage": 38.91, "elapsed_time": "0:51:27", "remaining_time": "1:20:45"} | |
| {"current_steps": 704, "total_steps": 1804, "loss": 0.8427386283874512, "lr": 3.725441668440058e-06, "epoch": 1.5609756097560976, "percentage": 39.02, "elapsed_time": "0:51:36", "remaining_time": "1:20:38"} | |
| {"current_steps": 706, "total_steps": 1804, "loss": 0.4659649431705475, "lr": 3.7179981374986683e-06, "epoch": 1.565410199556541, "percentage": 39.14, "elapsed_time": "0:51:43", "remaining_time": "1:20:26"} | |
| {"current_steps": 708, "total_steps": 1804, "loss": 1.1070295572280884, "lr": 3.710541583317233e-06, "epoch": 1.5698447893569845, "percentage": 39.25, "elapsed_time": "0:51:53", "remaining_time": "1:20:20"} | |
| {"current_steps": 710, "total_steps": 1804, "loss": 0.8662485480308533, "lr": 3.70307210621464e-06, "epoch": 1.5742793791574279, "percentage": 39.36, "elapsed_time": "0:52:03", "remaining_time": "1:20:13"} | |
| {"current_steps": 712, "total_steps": 1804, "loss": 0.6354061365127563, "lr": 3.695589806683636e-06, "epoch": 1.5787139689578713, "percentage": 39.47, "elapsed_time": "0:52:13", "remaining_time": "1:20:05"} | |
| {"current_steps": 714, "total_steps": 1804, "loss": 1.0189735889434814, "lr": 3.68809478538948e-06, "epoch": 1.5831485587583147, "percentage": 39.58, "elapsed_time": "0:52:22", "remaining_time": "1:19:56"} | |
| {"current_steps": 716, "total_steps": 1804, "loss": 1.1094708442687988, "lr": 3.6805871431685875e-06, "epoch": 1.5875831485587582, "percentage": 39.69, "elapsed_time": "0:52:32", "remaining_time": "1:19:50"} | |
| {"current_steps": 718, "total_steps": 1804, "loss": 1.3121333122253418, "lr": 3.6730669810271707e-06, "epoch": 1.5920177383592018, "percentage": 39.8, "elapsed_time": "0:52:41", "remaining_time": "1:19:42"} | |
| {"current_steps": 720, "total_steps": 1804, "loss": 0.5672956109046936, "lr": 3.665534400139885e-06, "epoch": 1.5964523281596452, "percentage": 39.91, "elapsed_time": "0:52:51", "remaining_time": "1:19:35"} | |
| {"current_steps": 722, "total_steps": 1804, "loss": 0.5884336233139038, "lr": 3.6579895018484635e-06, "epoch": 1.6008869179600886, "percentage": 40.02, "elapsed_time": "0:52:58", "remaining_time": "1:19:23"} | |
| {"current_steps": 724, "total_steps": 1804, "loss": 1.0861414670944214, "lr": 3.650432387660354e-06, "epoch": 1.6053215077605323, "percentage": 40.13, "elapsed_time": "0:53:09", "remaining_time": "1:19:17"} | |
| {"current_steps": 726, "total_steps": 1804, "loss": 1.0698777437210083, "lr": 3.6428631592473584e-06, "epoch": 1.6097560975609757, "percentage": 40.24, "elapsed_time": "0:53:19", "remaining_time": "1:19:10"} | |
| {"current_steps": 728, "total_steps": 1804, "loss": 0.7489967346191406, "lr": 3.6352819184442552e-06, "epoch": 1.6141906873614191, "percentage": 40.35, "elapsed_time": "0:53:26", "remaining_time": "1:18:58"} | |
| {"current_steps": 730, "total_steps": 1804, "loss": 1.0391978025436401, "lr": 3.6276887672474374e-06, "epoch": 1.6186252771618626, "percentage": 40.47, "elapsed_time": "0:53:36", "remaining_time": "1:18:52"} | |
| {"current_steps": 732, "total_steps": 1804, "loss": 1.090510368347168, "lr": 3.620083807813541e-06, "epoch": 1.623059866962306, "percentage": 40.58, "elapsed_time": "0:53:45", "remaining_time": "1:18:44"} | |
| {"current_steps": 734, "total_steps": 1804, "loss": 1.0251963138580322, "lr": 3.6124671424580633e-06, "epoch": 1.6274944567627494, "percentage": 40.69, "elapsed_time": "0:53:55", "remaining_time": "1:18:35"} | |
| {"current_steps": 736, "total_steps": 1804, "loss": 0.5789291262626648, "lr": 3.604838873653991e-06, "epoch": 1.6319290465631928, "percentage": 40.8, "elapsed_time": "0:53:59", "remaining_time": "1:18:20"} | |
| {"current_steps": 738, "total_steps": 1804, "loss": 1.1960307359695435, "lr": 3.597199104030424e-06, "epoch": 1.6363636363636362, "percentage": 40.91, "elapsed_time": "0:54:10", "remaining_time": "1:18:14"} | |
| {"current_steps": 740, "total_steps": 1804, "loss": 1.0560542345046997, "lr": 3.589547936371189e-06, "epoch": 1.6407982261640797, "percentage": 41.02, "elapsed_time": "0:54:19", "remaining_time": "1:18:06"} | |
| {"current_steps": 742, "total_steps": 1804, "loss": 1.1990491151809692, "lr": 3.58188547361346e-06, "epoch": 1.6452328159645233, "percentage": 41.13, "elapsed_time": "0:54:29", "remaining_time": "1:17:59"} | |
| {"current_steps": 744, "total_steps": 1804, "loss": 0.8513088226318359, "lr": 3.574211818846374e-06, "epoch": 1.6496674057649667, "percentage": 41.24, "elapsed_time": "0:54:38", "remaining_time": "1:17:51"} | |
| {"current_steps": 746, "total_steps": 1804, "loss": 0.9072363972663879, "lr": 3.566527075309641e-06, "epoch": 1.6541019955654102, "percentage": 41.35, "elapsed_time": "0:54:48", "remaining_time": "1:17:43"} | |
| {"current_steps": 748, "total_steps": 1804, "loss": 0.8129432201385498, "lr": 3.558831346392159e-06, "epoch": 1.6585365853658538, "percentage": 41.46, "elapsed_time": "0:54:55", "remaining_time": "1:17:32"} | |
| {"current_steps": 750, "total_steps": 1804, "loss": 1.1549206972122192, "lr": 3.5511247356306205e-06, "epoch": 1.6629711751662972, "percentage": 41.57, "elapsed_time": "0:55:05", "remaining_time": "1:17:25"} | |
| {"current_steps": 752, "total_steps": 1804, "loss": 0.8654111623764038, "lr": 3.5434073467081183e-06, "epoch": 1.6674057649667406, "percentage": 41.69, "elapsed_time": "0:55:14", "remaining_time": "1:17:16"} | |
| {"current_steps": 754, "total_steps": 1804, "loss": 0.5958826541900635, "lr": 3.5356792834527533e-06, "epoch": 1.671840354767184, "percentage": 41.8, "elapsed_time": "0:55:20", "remaining_time": "1:17:04"} | |
| {"current_steps": 756, "total_steps": 1804, "loss": 0.9339615106582642, "lr": 3.527940649836238e-06, "epoch": 1.6762749445676275, "percentage": 41.91, "elapsed_time": "0:55:27", "remaining_time": "1:16:52"} | |
| {"current_steps": 758, "total_steps": 1804, "loss": 1.0644793510437012, "lr": 3.520191549972494e-06, "epoch": 1.680709534368071, "percentage": 42.02, "elapsed_time": "0:55:37", "remaining_time": "1:16:45"} | |
| {"current_steps": 760, "total_steps": 1804, "loss": 0.9741522669792175, "lr": 3.512432088116255e-06, "epoch": 1.6851441241685143, "percentage": 42.13, "elapsed_time": "0:55:47", "remaining_time": "1:16:38"} | |
| {"current_steps": 762, "total_steps": 1804, "loss": 1.0623693466186523, "lr": 3.5046623686616627e-06, "epoch": 1.6895787139689578, "percentage": 42.24, "elapsed_time": "0:55:57", "remaining_time": "1:16:30"} | |
| {"current_steps": 764, "total_steps": 1804, "loss": 1.0941108465194702, "lr": 3.496882496140861e-06, "epoch": 1.6940133037694012, "percentage": 42.35, "elapsed_time": "0:56:07", "remaining_time": "1:16:23"} | |
| {"current_steps": 766, "total_steps": 1804, "loss": 0.9967592358589172, "lr": 3.4890925752225935e-06, "epoch": 1.6984478935698448, "percentage": 42.46, "elapsed_time": "0:56:17", "remaining_time": "1:16:16"} | |
| {"current_steps": 768, "total_steps": 1804, "loss": 0.8642722964286804, "lr": 3.48129271071079e-06, "epoch": 1.7028824833702882, "percentage": 42.57, "elapsed_time": "0:56:26", "remaining_time": "1:16:08"} | |
| {"current_steps": 770, "total_steps": 1804, "loss": 1.0309913158416748, "lr": 3.4734830075431605e-06, "epoch": 1.7073170731707317, "percentage": 42.68, "elapsed_time": "0:56:36", "remaining_time": "1:16:01"} | |
| {"current_steps": 772, "total_steps": 1804, "loss": 1.0626434087753296, "lr": 3.4656635707897823e-06, "epoch": 1.7117516629711753, "percentage": 42.79, "elapsed_time": "0:56:46", "remaining_time": "1:15:54"} | |
| {"current_steps": 774, "total_steps": 1804, "loss": 1.0065494775772095, "lr": 3.457834505651687e-06, "epoch": 1.7161862527716187, "percentage": 42.9, "elapsed_time": "0:56:59", "remaining_time": "1:15:50"} | |
| {"current_steps": 776, "total_steps": 1804, "loss": 1.1634471416473389, "lr": 3.449995917459442e-06, "epoch": 1.7206208425720622, "percentage": 43.02, "elapsed_time": "0:57:32", "remaining_time": "1:16:13"} | |
| {"current_steps": 778, "total_steps": 1804, "loss": 1.0542564392089844, "lr": 3.4421479116717394e-06, "epoch": 1.7250554323725056, "percentage": 43.13, "elapsed_time": "0:57:47", "remaining_time": "1:16:13"} | |
| {"current_steps": 780, "total_steps": 1804, "loss": 0.71857750415802, "lr": 3.4342905938739707e-06, "epoch": 1.729490022172949, "percentage": 43.24, "elapsed_time": "0:57:56", "remaining_time": "1:16:03"} | |
| {"current_steps": 782, "total_steps": 1804, "loss": 1.039180040359497, "lr": 3.4264240697768096e-06, "epoch": 1.7339246119733924, "percentage": 43.35, "elapsed_time": "0:58:07", "remaining_time": "1:15:57"} | |
| {"current_steps": 784, "total_steps": 1804, "loss": 0.7089306712150574, "lr": 3.418548445214791e-06, "epoch": 1.7383592017738358, "percentage": 43.46, "elapsed_time": "0:58:16", "remaining_time": "1:15:49"} | |
| {"current_steps": 786, "total_steps": 1804, "loss": 0.8006396293640137, "lr": 3.410663826144884e-06, "epoch": 1.7427937915742793, "percentage": 43.57, "elapsed_time": "0:58:25", "remaining_time": "1:15:39"} | |
| {"current_steps": 788, "total_steps": 1804, "loss": 0.6931395530700684, "lr": 3.4027703186450672e-06, "epoch": 1.7472283813747227, "percentage": 43.68, "elapsed_time": "0:58:31", "remaining_time": "1:15:28"} | |
| {"current_steps": 790, "total_steps": 1804, "loss": 0.6739537119865417, "lr": 3.394868028912906e-06, "epoch": 1.7516629711751663, "percentage": 43.79, "elapsed_time": "0:58:38", "remaining_time": "1:15:15"} | |
| {"current_steps": 792, "total_steps": 1804, "loss": 0.7071681618690491, "lr": 3.386957063264115e-06, "epoch": 1.7560975609756098, "percentage": 43.9, "elapsed_time": "0:58:45", "remaining_time": "1:15:05"} | |
| {"current_steps": 794, "total_steps": 1804, "loss": 0.8816713094711304, "lr": 3.3790375281311355e-06, "epoch": 1.7605321507760532, "percentage": 44.01, "elapsed_time": "0:58:55", "remaining_time": "1:14:57"} | |
| {"current_steps": 796, "total_steps": 1804, "loss": 0.8279204368591309, "lr": 3.3711095300617015e-06, "epoch": 1.7649667405764968, "percentage": 44.12, "elapsed_time": "0:59:05", "remaining_time": "1:14:49"} | |
| {"current_steps": 798, "total_steps": 1804, "loss": 1.0542290210723877, "lr": 3.3631731757174048e-06, "epoch": 1.7694013303769403, "percentage": 44.24, "elapsed_time": "0:59:15", "remaining_time": "1:14:42"} | |
| {"current_steps": 800, "total_steps": 1804, "loss": 1.0976489782333374, "lr": 3.3552285718722593e-06, "epoch": 1.7738359201773837, "percentage": 44.35, "elapsed_time": "0:59:27", "remaining_time": "1:14:36"} | |
| {"current_steps": 802, "total_steps": 1804, "loss": 1.0607342720031738, "lr": 3.3472758254112662e-06, "epoch": 1.778270509977827, "percentage": 44.46, "elapsed_time": "0:59:38", "remaining_time": "1:14:30"} | |
| {"current_steps": 804, "total_steps": 1804, "loss": 1.1424230337142944, "lr": 3.3393150433289795e-06, "epoch": 1.7827050997782705, "percentage": 44.57, "elapsed_time": "0:59:49", "remaining_time": "1:14:24"} | |
| {"current_steps": 806, "total_steps": 1804, "loss": 0.7997353672981262, "lr": 3.3313463327280576e-06, "epoch": 1.787139689578714, "percentage": 44.68, "elapsed_time": "1:00:01", "remaining_time": "1:14:19"} | |
| {"current_steps": 808, "total_steps": 1804, "loss": 1.1001020669937134, "lr": 3.3233698008178306e-06, "epoch": 1.7915742793791574, "percentage": 44.79, "elapsed_time": "1:00:12", "remaining_time": "1:14:12"} | |
| {"current_steps": 810, "total_steps": 1804, "loss": 0.60479736328125, "lr": 3.3153855549128537e-06, "epoch": 1.7960088691796008, "percentage": 44.9, "elapsed_time": "1:00:20", "remaining_time": "1:14:02"} | |
| {"current_steps": 812, "total_steps": 1804, "loss": 0.6236993074417114, "lr": 3.3073937024314647e-06, "epoch": 1.8004434589800442, "percentage": 45.01, "elapsed_time": "1:00:26", "remaining_time": "1:13:50"} | |
| {"current_steps": 814, "total_steps": 1804, "loss": 1.2215160131454468, "lr": 3.2993943508943386e-06, "epoch": 1.8048780487804879, "percentage": 45.12, "elapsed_time": "1:00:36", "remaining_time": "1:13:42"} | |
| {"current_steps": 816, "total_steps": 1804, "loss": 0.9674443006515503, "lr": 3.291387607923041e-06, "epoch": 1.8093126385809313, "percentage": 45.23, "elapsed_time": "1:00:46", "remaining_time": "1:13:35"} | |
| {"current_steps": 818, "total_steps": 1804, "loss": 0.49516186118125916, "lr": 3.283373581238582e-06, "epoch": 1.8137472283813747, "percentage": 45.34, "elapsed_time": "1:00:53", "remaining_time": "1:13:23"} | |
| {"current_steps": 820, "total_steps": 1804, "loss": 1.1064543724060059, "lr": 3.2753523786599618e-06, "epoch": 1.8181818181818183, "percentage": 45.45, "elapsed_time": "1:01:03", "remaining_time": "1:13:16"} | |
| {"current_steps": 822, "total_steps": 1804, "loss": 1.086673617362976, "lr": 3.2673241081027263e-06, "epoch": 1.8226164079822618, "percentage": 45.57, "elapsed_time": "1:01:11", "remaining_time": "1:13:06"} | |
| {"current_steps": 824, "total_steps": 1804, "loss": 1.1535236835479736, "lr": 3.259288877577512e-06, "epoch": 1.8270509977827052, "percentage": 45.68, "elapsed_time": "1:01:21", "remaining_time": "1:12:58"} | |
| {"current_steps": 826, "total_steps": 1804, "loss": 0.9606926441192627, "lr": 3.251246795188592e-06, "epoch": 1.8314855875831486, "percentage": 45.79, "elapsed_time": "1:01:30", "remaining_time": "1:12:49"} | |
| {"current_steps": 828, "total_steps": 1804, "loss": 0.8494828343391418, "lr": 3.243197969132425e-06, "epoch": 1.835920177383592, "percentage": 45.9, "elapsed_time": "1:01:40", "remaining_time": "1:12:42"} | |
| {"current_steps": 830, "total_steps": 1804, "loss": 1.0482726097106934, "lr": 3.2351425076961957e-06, "epoch": 1.8403547671840355, "percentage": 46.01, "elapsed_time": "1:01:50", "remaining_time": "1:12:34"} | |
| {"current_steps": 832, "total_steps": 1804, "loss": 1.0887196063995361, "lr": 3.22708051925636e-06, "epoch": 1.8447893569844789, "percentage": 46.12, "elapsed_time": "1:02:00", "remaining_time": "1:12:27"} | |
| {"current_steps": 834, "total_steps": 1804, "loss": 1.0047810077667236, "lr": 3.219012112277189e-06, "epoch": 1.8492239467849223, "percentage": 46.23, "elapsed_time": "1:02:10", "remaining_time": "1:12:19"} | |
| {"current_steps": 836, "total_steps": 1804, "loss": 1.151458501815796, "lr": 3.210937395309304e-06, "epoch": 1.8536585365853657, "percentage": 46.34, "elapsed_time": "1:02:20", "remaining_time": "1:12:11"} | |
| {"current_steps": 838, "total_steps": 1804, "loss": 1.0137677192687988, "lr": 3.202856476988222e-06, "epoch": 1.8580931263858091, "percentage": 46.45, "elapsed_time": "1:02:28", "remaining_time": "1:12:01"} | |
| {"current_steps": 840, "total_steps": 1804, "loss": 1.1807574033737183, "lr": 3.1947694660328914e-06, "epoch": 1.8625277161862528, "percentage": 46.56, "elapsed_time": "1:02:37", "remaining_time": "1:11:51"} | |
| {"current_steps": 842, "total_steps": 1804, "loss": 0.6505411863327026, "lr": 3.1866764712442273e-06, "epoch": 1.8669623059866962, "percentage": 46.67, "elapsed_time": "1:02:44", "remaining_time": "1:11:40"} | |
| {"current_steps": 844, "total_steps": 1804, "loss": 0.771373987197876, "lr": 3.1785776015036533e-06, "epoch": 1.8713968957871396, "percentage": 46.78, "elapsed_time": "1:02:53", "remaining_time": "1:11:32"} | |
| {"current_steps": 846, "total_steps": 1804, "loss": 0.8910792469978333, "lr": 3.1704729657716314e-06, "epoch": 1.8758314855875833, "percentage": 46.9, "elapsed_time": "1:03:06", "remaining_time": "1:11:27"} | |
| {"current_steps": 848, "total_steps": 1804, "loss": 1.0148093700408936, "lr": 3.1623626730861996e-06, "epoch": 1.8802660753880267, "percentage": 47.01, "elapsed_time": "1:03:21", "remaining_time": "1:11:25"} | |
| {"current_steps": 850, "total_steps": 1804, "loss": 1.359726071357727, "lr": 3.1542468325615e-06, "epoch": 1.8847006651884701, "percentage": 47.12, "elapsed_time": "1:03:34", "remaining_time": "1:11:21"} | |
| {"current_steps": 852, "total_steps": 1804, "loss": 0.7917972207069397, "lr": 3.1461255533863183e-06, "epoch": 1.8891352549889135, "percentage": 47.23, "elapsed_time": "1:03:48", "remaining_time": "1:11:17"} | |
| {"current_steps": 854, "total_steps": 1804, "loss": 1.073644757270813, "lr": 3.1379989448226077e-06, "epoch": 1.893569844789357, "percentage": 47.34, "elapsed_time": "1:04:02", "remaining_time": "1:11:14"} | |
| {"current_steps": 856, "total_steps": 1804, "loss": 0.9803899526596069, "lr": 3.1298671162040236e-06, "epoch": 1.8980044345898004, "percentage": 47.45, "elapsed_time": "1:04:13", "remaining_time": "1:11:07"} | |
| {"current_steps": 858, "total_steps": 1804, "loss": 0.7556897401809692, "lr": 3.1217301769344488e-06, "epoch": 1.9024390243902438, "percentage": 47.56, "elapsed_time": "1:04:22", "remaining_time": "1:10:58"} | |
| {"current_steps": 860, "total_steps": 1804, "loss": 0.8003832101821899, "lr": 3.1135882364865262e-06, "epoch": 1.9068736141906872, "percentage": 47.67, "elapsed_time": "1:04:32", "remaining_time": "1:10:50"} | |
| {"current_steps": 862, "total_steps": 1804, "loss": 1.0547810792922974, "lr": 3.105441404400183e-06, "epoch": 1.9113082039911307, "percentage": 47.78, "elapsed_time": "1:04:41", "remaining_time": "1:10:41"} | |
| {"current_steps": 864, "total_steps": 1804, "loss": 0.8129977583885193, "lr": 3.097289790281155e-06, "epoch": 1.9157427937915743, "percentage": 47.89, "elapsed_time": "1:04:51", "remaining_time": "1:10:33"} | |
| {"current_steps": 866, "total_steps": 1804, "loss": 0.6351861953735352, "lr": 3.089133503799517e-06, "epoch": 1.9201773835920177, "percentage": 48.0, "elapsed_time": "1:05:01", "remaining_time": "1:10:25"} | |
| {"current_steps": 868, "total_steps": 1804, "loss": 1.1457592248916626, "lr": 3.0809726546882045e-06, "epoch": 1.9246119733924612, "percentage": 48.12, "elapsed_time": "1:05:11", "remaining_time": "1:10:18"} | |
| {"current_steps": 870, "total_steps": 1804, "loss": 1.0072696208953857, "lr": 3.0728073527415376e-06, "epoch": 1.9290465631929048, "percentage": 48.23, "elapsed_time": "1:05:22", "remaining_time": "1:10:10"} | |
| {"current_steps": 872, "total_steps": 1804, "loss": 0.7055673599243164, "lr": 3.0646377078137424e-06, "epoch": 1.9334811529933482, "percentage": 48.34, "elapsed_time": "1:05:30", "remaining_time": "1:10:01"} | |
| {"current_steps": 874, "total_steps": 1804, "loss": 0.6644902229309082, "lr": 3.056463829817475e-06, "epoch": 1.9379157427937916, "percentage": 48.45, "elapsed_time": "1:05:39", "remaining_time": "1:09:52"} | |
| {"current_steps": 876, "total_steps": 1804, "loss": 0.7358293533325195, "lr": 3.048285828722345e-06, "epoch": 1.942350332594235, "percentage": 48.56, "elapsed_time": "1:05:47", "remaining_time": "1:09:42"} | |
| {"current_steps": 878, "total_steps": 1804, "loss": 0.8098105788230896, "lr": 3.0401038145534297e-06, "epoch": 1.9467849223946785, "percentage": 48.67, "elapsed_time": "1:05:55", "remaining_time": "1:09:31"} | |
| {"current_steps": 880, "total_steps": 1804, "loss": 0.8312487602233887, "lr": 3.031917897389799e-06, "epoch": 1.951219512195122, "percentage": 48.78, "elapsed_time": "1:06:05", "remaining_time": "1:09:23"} | |
| {"current_steps": 882, "total_steps": 1804, "loss": 0.6966821551322937, "lr": 3.0237281873630335e-06, "epoch": 1.9556541019955653, "percentage": 48.89, "elapsed_time": "1:06:17", "remaining_time": "1:09:18"} | |
| {"current_steps": 884, "total_steps": 1804, "loss": 0.8802693486213684, "lr": 3.0155347946557407e-06, "epoch": 1.9600886917960088, "percentage": 49.0, "elapsed_time": "1:06:59", "remaining_time": "1:09:43"} | |
| {"current_steps": 886, "total_steps": 1804, "loss": 1.0593414306640625, "lr": 3.007337829500075e-06, "epoch": 1.9645232815964522, "percentage": 49.11, "elapsed_time": "1:07:10", "remaining_time": "1:09:36"} | |
| {"current_steps": 888, "total_steps": 1804, "loss": 1.0278575420379639, "lr": 2.999137402176255e-06, "epoch": 1.9689578713968958, "percentage": 49.22, "elapsed_time": "1:07:21", "remaining_time": "1:09:29"} | |
| {"current_steps": 890, "total_steps": 1804, "loss": 1.0861477851867676, "lr": 2.9909336230110747e-06, "epoch": 1.9733924611973392, "percentage": 49.33, "elapsed_time": "1:07:31", "remaining_time": "1:09:20"} | |
| {"current_steps": 892, "total_steps": 1804, "loss": 1.1251873970031738, "lr": 2.9827266023764274e-06, "epoch": 1.9778270509977827, "percentage": 49.45, "elapsed_time": "1:07:41", "remaining_time": "1:09:12"} | |
| {"current_steps": 894, "total_steps": 1804, "loss": 1.1135941743850708, "lr": 2.9745164506878134e-06, "epoch": 1.9822616407982263, "percentage": 49.56, "elapsed_time": "1:07:51", "remaining_time": "1:09:04"} | |
| {"current_steps": 896, "total_steps": 1804, "loss": 0.8289718627929688, "lr": 2.9663032784028596e-06, "epoch": 1.9866962305986697, "percentage": 49.67, "elapsed_time": "1:07:58", "remaining_time": "1:08:52"} | |
| {"current_steps": 898, "total_steps": 1804, "loss": 1.0270378589630127, "lr": 2.9580871960198297e-06, "epoch": 1.9911308203991132, "percentage": 49.78, "elapsed_time": "1:08:08", "remaining_time": "1:08:45"} | |
| {"current_steps": 900, "total_steps": 1804, "loss": 0.792127251625061, "lr": 2.949868314076142e-06, "epoch": 1.9955654101995566, "percentage": 49.89, "elapsed_time": "1:08:18", "remaining_time": "1:08:36"} | |
| {"current_steps": 902, "total_steps": 1804, "loss": 1.0664583444595337, "lr": 2.941646743146875e-06, "epoch": 2.0, "percentage": 50.0, "elapsed_time": "1:08:28", "remaining_time": "1:08:28"} | |
| {"current_steps": 904, "total_steps": 1804, "loss": 0.8489485383033752, "lr": 2.9334225938432868e-06, "epoch": 2.0044345898004434, "percentage": 50.11, "elapsed_time": "1:08:39", "remaining_time": "1:08:20"} | |
| {"current_steps": 906, "total_steps": 1804, "loss": 0.9491725564002991, "lr": 2.925195976811326e-06, "epoch": 2.008869179600887, "percentage": 50.22, "elapsed_time": "1:08:49", "remaining_time": "1:08:13"} | |
| {"current_steps": 908, "total_steps": 1804, "loss": 1.0454566478729248, "lr": 2.9169670027301387e-06, "epoch": 2.0133037694013303, "percentage": 50.33, "elapsed_time": "1:08:59", "remaining_time": "1:08:05"} | |
| {"current_steps": 910, "total_steps": 1804, "loss": 0.9005047082901001, "lr": 2.9087357823105843e-06, "epoch": 2.0177383592017737, "percentage": 50.44, "elapsed_time": "1:09:09", "remaining_time": "1:07:56"} | |
| {"current_steps": 912, "total_steps": 1804, "loss": 0.6029907464981079, "lr": 2.9005024262937427e-06, "epoch": 2.022172949002217, "percentage": 50.55, "elapsed_time": "1:09:17", "remaining_time": "1:07:46"} | |
| {"current_steps": 914, "total_steps": 1804, "loss": 0.656877875328064, "lr": 2.8922670454494247e-06, "epoch": 2.0266075388026605, "percentage": 50.67, "elapsed_time": "1:09:29", "remaining_time": "1:07:39"} | |
| {"current_steps": 916, "total_steps": 1804, "loss": 0.6500725746154785, "lr": 2.8840297505746843e-06, "epoch": 2.0310421286031044, "percentage": 50.78, "elapsed_time": "1:09:38", "remaining_time": "1:07:30"} | |
| {"current_steps": 918, "total_steps": 1804, "loss": 0.8578327298164368, "lr": 2.8757906524923286e-06, "epoch": 2.035476718403548, "percentage": 50.89, "elapsed_time": "1:09:50", "remaining_time": "1:07:24"} | |
| {"current_steps": 920, "total_steps": 1804, "loss": 0.6973807215690613, "lr": 2.867549862049419e-06, "epoch": 2.0399113082039912, "percentage": 51.0, "elapsed_time": "1:10:01", "remaining_time": "1:07:16"} | |
| {"current_steps": 922, "total_steps": 1804, "loss": 0.7475817799568176, "lr": 2.859307490115791e-06, "epoch": 2.0443458980044347, "percentage": 51.11, "elapsed_time": "1:10:13", "remaining_time": "1:07:10"} | |
| {"current_steps": 924, "total_steps": 1804, "loss": 0.24903425574302673, "lr": 2.8510636475825533e-06, "epoch": 2.048780487804878, "percentage": 51.22, "elapsed_time": "1:10:17", "remaining_time": "1:06:56"} | |
| {"current_steps": 926, "total_steps": 1804, "loss": 0.849932074546814, "lr": 2.8428184453606027e-06, "epoch": 2.0532150776053215, "percentage": 51.33, "elapsed_time": "1:10:27", "remaining_time": "1:06:48"} | |
| {"current_steps": 928, "total_steps": 1804, "loss": 0.6786062121391296, "lr": 2.8345719943791266e-06, "epoch": 2.057649667405765, "percentage": 51.44, "elapsed_time": "1:10:35", "remaining_time": "1:06:38"} | |
| {"current_steps": 930, "total_steps": 1804, "loss": 0.6096989512443542, "lr": 2.826324405584114e-06, "epoch": 2.0620842572062084, "percentage": 51.55, "elapsed_time": "1:10:42", "remaining_time": "1:06:27"} | |
| {"current_steps": 932, "total_steps": 1804, "loss": 0.7550086379051208, "lr": 2.818075789936863e-06, "epoch": 2.066518847006652, "percentage": 51.66, "elapsed_time": "1:10:52", "remaining_time": "1:06:19"} | |
| {"current_steps": 934, "total_steps": 1804, "loss": 0.8016495704650879, "lr": 2.8098262584124834e-06, "epoch": 2.070953436807095, "percentage": 51.77, "elapsed_time": "1:11:02", "remaining_time": "1:06:10"} | |
| {"current_steps": 936, "total_steps": 1804, "loss": 0.9375801682472229, "lr": 2.801575921998411e-06, "epoch": 2.0753880266075386, "percentage": 51.88, "elapsed_time": "1:11:13", "remaining_time": "1:06:02"} | |
| {"current_steps": 938, "total_steps": 1804, "loss": 0.6009020805358887, "lr": 2.7933248916929066e-06, "epoch": 2.079822616407982, "percentage": 52.0, "elapsed_time": "1:11:22", "remaining_time": "1:05:53"} | |
| {"current_steps": 940, "total_steps": 1804, "loss": 0.6308031678199768, "lr": 2.7850732785035705e-06, "epoch": 2.084257206208426, "percentage": 52.11, "elapsed_time": "1:11:31", "remaining_time": "1:05:44"} | |
| {"current_steps": 942, "total_steps": 1804, "loss": 0.8890527486801147, "lr": 2.7768211934458417e-06, "epoch": 2.0886917960088693, "percentage": 52.22, "elapsed_time": "1:11:41", "remaining_time": "1:05:36"} | |
| {"current_steps": 944, "total_steps": 1804, "loss": 0.5294168591499329, "lr": 2.768568747541509e-06, "epoch": 2.0931263858093128, "percentage": 52.33, "elapsed_time": "1:11:48", "remaining_time": "1:05:25"} | |
| {"current_steps": 946, "total_steps": 1804, "loss": 0.8766027688980103, "lr": 2.7603160518172152e-06, "epoch": 2.097560975609756, "percentage": 52.44, "elapsed_time": "1:11:59", "remaining_time": "1:05:17"} | |
| {"current_steps": 948, "total_steps": 1804, "loss": 0.9853121042251587, "lr": 2.752063217302966e-06, "epoch": 2.1019955654101996, "percentage": 52.55, "elapsed_time": "1:12:08", "remaining_time": "1:05:08"} | |
| {"current_steps": 950, "total_steps": 1804, "loss": 0.6440744996070862, "lr": 2.743810355030631e-06, "epoch": 2.106430155210643, "percentage": 52.66, "elapsed_time": "1:12:16", "remaining_time": "1:04:57"} | |
| {"current_steps": 952, "total_steps": 1804, "loss": 0.7647169828414917, "lr": 2.735557576032458e-06, "epoch": 2.1108647450110865, "percentage": 52.77, "elapsed_time": "1:12:24", "remaining_time": "1:04:47"} | |
| {"current_steps": 954, "total_steps": 1804, "loss": 0.8036750555038452, "lr": 2.727304991339569e-06, "epoch": 2.11529933481153, "percentage": 52.88, "elapsed_time": "1:12:35", "remaining_time": "1:04:40"} | |
| {"current_steps": 956, "total_steps": 1804, "loss": 0.8988800644874573, "lr": 2.7190527119804762e-06, "epoch": 2.1197339246119733, "percentage": 52.99, "elapsed_time": "1:12:47", "remaining_time": "1:04:33"} | |
| {"current_steps": 958, "total_steps": 1804, "loss": 0.7791066765785217, "lr": 2.710800848979582e-06, "epoch": 2.1241685144124167, "percentage": 53.1, "elapsed_time": "1:12:57", "remaining_time": "1:04:25"} | |
| {"current_steps": 960, "total_steps": 1804, "loss": 0.7849658131599426, "lr": 2.702549513355687e-06, "epoch": 2.12860310421286, "percentage": 53.22, "elapsed_time": "1:13:07", "remaining_time": "1:04:17"} | |
| {"current_steps": 962, "total_steps": 1804, "loss": 0.5346379280090332, "lr": 2.694298816120497e-06, "epoch": 2.1330376940133036, "percentage": 53.33, "elapsed_time": "1:13:23", "remaining_time": "1:04:13"} | |
| {"current_steps": 964, "total_steps": 1804, "loss": 1.0068566799163818, "lr": 2.6860488682771306e-06, "epoch": 2.1374722838137474, "percentage": 53.44, "elapsed_time": "1:13:35", "remaining_time": "1:04:07"} | |
| {"current_steps": 966, "total_steps": 1804, "loss": 0.9128319025039673, "lr": 2.67779978081862e-06, "epoch": 2.141906873614191, "percentage": 53.55, "elapsed_time": "1:13:51", "remaining_time": "1:04:04"} | |
| {"current_steps": 968, "total_steps": 1804, "loss": 0.6138067245483398, "lr": 2.669551664726428e-06, "epoch": 2.1463414634146343, "percentage": 53.66, "elapsed_time": "1:13:58", "remaining_time": "1:03:53"} | |
| {"current_steps": 970, "total_steps": 1804, "loss": 0.7260922193527222, "lr": 2.6613046309689433e-06, "epoch": 2.1507760532150777, "percentage": 53.77, "elapsed_time": "1:14:05", "remaining_time": "1:03:42"} | |
| {"current_steps": 972, "total_steps": 1804, "loss": 0.5952677726745605, "lr": 2.6530587904999966e-06, "epoch": 2.155210643015521, "percentage": 53.88, "elapsed_time": "1:14:15", "remaining_time": "1:03:33"} | |
| {"current_steps": 974, "total_steps": 1804, "loss": 0.933496356010437, "lr": 2.6448142542573624e-06, "epoch": 2.1596452328159645, "percentage": 53.99, "elapsed_time": "1:14:26", "remaining_time": "1:03:26"} | |
| {"current_steps": 976, "total_steps": 1804, "loss": 0.906917929649353, "lr": 2.6365711331612692e-06, "epoch": 2.164079822616408, "percentage": 54.1, "elapsed_time": "1:14:36", "remaining_time": "1:03:17"} | |
| {"current_steps": 978, "total_steps": 1804, "loss": 0.6723505258560181, "lr": 2.6283295381129066e-06, "epoch": 2.1685144124168514, "percentage": 54.21, "elapsed_time": "1:14:45", "remaining_time": "1:03:08"} | |
| {"current_steps": 980, "total_steps": 1804, "loss": 0.9519558548927307, "lr": 2.620089579992933e-06, "epoch": 2.172949002217295, "percentage": 54.32, "elapsed_time": "1:14:56", "remaining_time": "1:03:00"} | |
| {"current_steps": 982, "total_steps": 1804, "loss": 0.5118272304534912, "lr": 2.6118513696599823e-06, "epoch": 2.1773835920177382, "percentage": 54.43, "elapsed_time": "1:15:04", "remaining_time": "1:02:50"} | |
| {"current_steps": 984, "total_steps": 1804, "loss": 0.31727081537246704, "lr": 2.603615017949178e-06, "epoch": 2.1818181818181817, "percentage": 54.55, "elapsed_time": "1:15:12", "remaining_time": "1:02:40"} | |
| {"current_steps": 986, "total_steps": 1804, "loss": 0.6958884596824646, "lr": 2.595380635670634e-06, "epoch": 2.186252771618625, "percentage": 54.66, "elapsed_time": "1:15:23", "remaining_time": "1:02:33"} | |
| {"current_steps": 988, "total_steps": 1804, "loss": 0.4852255582809448, "lr": 2.5871483336079694e-06, "epoch": 2.1906873614190685, "percentage": 54.77, "elapsed_time": "1:15:31", "remaining_time": "1:02:22"} | |
| {"current_steps": 990, "total_steps": 1804, "loss": 0.8706328868865967, "lr": 2.578918222516818e-06, "epoch": 2.1951219512195124, "percentage": 54.88, "elapsed_time": "1:15:38", "remaining_time": "1:02:11"} | |
| {"current_steps": 992, "total_steps": 1804, "loss": 1.0283949375152588, "lr": 2.5706904131233336e-06, "epoch": 2.199556541019956, "percentage": 54.99, "elapsed_time": "1:15:48", "remaining_time": "1:02:03"} | |
| {"current_steps": 994, "total_steps": 1804, "loss": 0.8138683438301086, "lr": 2.5624650161227073e-06, "epoch": 2.203991130820399, "percentage": 55.1, "elapsed_time": "1:15:56", "remaining_time": "1:01:53"} | |
| {"current_steps": 996, "total_steps": 1804, "loss": 0.9206845760345459, "lr": 2.5542421421776696e-06, "epoch": 2.2084257206208426, "percentage": 55.21, "elapsed_time": "1:16:06", "remaining_time": "1:01:44"} | |
| {"current_steps": 998, "total_steps": 1804, "loss": 0.8530703783035278, "lr": 2.5460219019170097e-06, "epoch": 2.212860310421286, "percentage": 55.32, "elapsed_time": "1:16:16", "remaining_time": "1:01:36"} | |
| {"current_steps": 1000, "total_steps": 1804, "loss": 0.7263464331626892, "lr": 2.5378044059340845e-06, "epoch": 2.2172949002217295, "percentage": 55.43, "elapsed_time": "1:16:26", "remaining_time": "1:01:27"} | |
| {"current_steps": 1002, "total_steps": 1804, "loss": 0.42510873079299927, "lr": 2.5295897647853283e-06, "epoch": 2.221729490022173, "percentage": 55.54, "elapsed_time": "1:16:33", "remaining_time": "1:01:16"} | |
| {"current_steps": 1004, "total_steps": 1804, "loss": 0.9547646641731262, "lr": 2.521378088988767e-06, "epoch": 2.2261640798226163, "percentage": 55.65, "elapsed_time": "1:16:43", "remaining_time": "1:01:07"} | |
| {"current_steps": 1006, "total_steps": 1804, "loss": 0.8016040921211243, "lr": 2.513169489022531e-06, "epoch": 2.2305986696230597, "percentage": 55.76, "elapsed_time": "1:16:50", "remaining_time": "1:00:57"} | |
| {"current_steps": 1008, "total_steps": 1804, "loss": 0.8940417766571045, "lr": 2.5049640753233705e-06, "epoch": 2.235033259423503, "percentage": 55.88, "elapsed_time": "1:17:00", "remaining_time": "1:00:48"} | |
| {"current_steps": 1010, "total_steps": 1804, "loss": 0.5136449933052063, "lr": 2.496761958285167e-06, "epoch": 2.2394678492239466, "percentage": 55.99, "elapsed_time": "1:17:07", "remaining_time": "1:00:38"} | |
| {"current_steps": 1012, "total_steps": 1804, "loss": 0.8877573609352112, "lr": 2.488563248257451e-06, "epoch": 2.2439024390243905, "percentage": 56.1, "elapsed_time": "1:17:17", "remaining_time": "1:00:29"} | |
| {"current_steps": 1014, "total_steps": 1804, "loss": 0.769023060798645, "lr": 2.4803680555439136e-06, "epoch": 2.248337028824834, "percentage": 56.21, "elapsed_time": "1:17:27", "remaining_time": "1:00:21"} | |
| {"current_steps": 1016, "total_steps": 1804, "loss": 0.8959583640098572, "lr": 2.4721764904009272e-06, "epoch": 2.2527716186252773, "percentage": 56.32, "elapsed_time": "1:17:38", "remaining_time": "1:00:12"} | |
| {"current_steps": 1018, "total_steps": 1804, "loss": 0.5399938821792603, "lr": 2.4639886630360574e-06, "epoch": 2.2572062084257207, "percentage": 56.43, "elapsed_time": "1:17:44", "remaining_time": "1:00:01"} | |
| {"current_steps": 1020, "total_steps": 1804, "loss": 0.9447622895240784, "lr": 2.455804683606584e-06, "epoch": 2.261640798226164, "percentage": 56.54, "elapsed_time": "1:17:55", "remaining_time": "0:59:53"} | |
| {"current_steps": 1022, "total_steps": 1804, "loss": 0.4857233166694641, "lr": 2.4476246622180174e-06, "epoch": 2.2660753880266076, "percentage": 56.65, "elapsed_time": "1:18:01", "remaining_time": "0:59:42"} | |
| {"current_steps": 1024, "total_steps": 1804, "loss": 1.283814787864685, "lr": 2.4394487089226158e-06, "epoch": 2.270509977827051, "percentage": 56.76, "elapsed_time": "1:18:11", "remaining_time": "0:59:33"} | |
| {"current_steps": 1026, "total_steps": 1804, "loss": 0.6131373643875122, "lr": 2.43127693371791e-06, "epoch": 2.2749445676274944, "percentage": 56.87, "elapsed_time": "1:18:20", "remaining_time": "0:59:24"} | |
| {"current_steps": 1028, "total_steps": 1804, "loss": 0.9470140337944031, "lr": 2.423109446545213e-06, "epoch": 2.279379157427938, "percentage": 56.98, "elapsed_time": "1:18:30", "remaining_time": "0:59:15"} | |
| {"current_steps": 1030, "total_steps": 1804, "loss": 0.9135305881500244, "lr": 2.4149463572881537e-06, "epoch": 2.2838137472283813, "percentage": 57.1, "elapsed_time": "1:18:40", "remaining_time": "0:59:07"} | |
| {"current_steps": 1032, "total_steps": 1804, "loss": 0.6614438891410828, "lr": 2.4067877757711907e-06, "epoch": 2.2882483370288247, "percentage": 57.21, "elapsed_time": "1:18:50", "remaining_time": "0:58:58"} | |
| {"current_steps": 1034, "total_steps": 1804, "loss": 0.7243059277534485, "lr": 2.3986338117581357e-06, "epoch": 2.292682926829268, "percentage": 57.32, "elapsed_time": "1:18:58", "remaining_time": "0:58:48"} | |
| {"current_steps": 1036, "total_steps": 1804, "loss": 0.9390950798988342, "lr": 2.390484574950677e-06, "epoch": 2.2971175166297115, "percentage": 57.43, "elapsed_time": "1:19:08", "remaining_time": "0:58:40"} | |
| {"current_steps": 1038, "total_steps": 1804, "loss": 0.6997823715209961, "lr": 2.382340174986906e-06, "epoch": 2.3015521064301554, "percentage": 57.54, "elapsed_time": "1:19:16", "remaining_time": "0:58:30"} | |
| {"current_steps": 1040, "total_steps": 1804, "loss": 0.9242331385612488, "lr": 2.374200721439837e-06, "epoch": 2.305986696230599, "percentage": 57.65, "elapsed_time": "1:19:26", "remaining_time": "0:58:21"} | |
| {"current_steps": 1042, "total_steps": 1804, "loss": 1.012819766998291, "lr": 2.3660663238159405e-06, "epoch": 2.3104212860310422, "percentage": 57.76, "elapsed_time": "1:19:51", "remaining_time": "0:58:24"} | |
| {"current_steps": 1044, "total_steps": 1804, "loss": 0.6412765383720398, "lr": 2.357937091553662e-06, "epoch": 2.3148558758314857, "percentage": 57.87, "elapsed_time": "1:20:21", "remaining_time": "0:58:30"} | |
| {"current_steps": 1046, "total_steps": 1804, "loss": 0.9358582496643066, "lr": 2.3498131340219554e-06, "epoch": 2.319290465631929, "percentage": 57.98, "elapsed_time": "1:20:32", "remaining_time": "0:58:21"} | |
| {"current_steps": 1048, "total_steps": 1804, "loss": 0.8715764880180359, "lr": 2.341694560518809e-06, "epoch": 2.3237250554323725, "percentage": 58.09, "elapsed_time": "1:20:42", "remaining_time": "0:58:13"} | |
| {"current_steps": 1050, "total_steps": 1804, "loss": 0.9279825687408447, "lr": 2.333581480269776e-06, "epoch": 2.328159645232816, "percentage": 58.2, "elapsed_time": "1:21:00", "remaining_time": "0:58:10"} | |
| {"current_steps": 1052, "total_steps": 1804, "loss": 0.8819960951805115, "lr": 2.325474002426503e-06, "epoch": 2.3325942350332594, "percentage": 58.31, "elapsed_time": "1:21:10", "remaining_time": "0:58:01"} | |
| {"current_steps": 1054, "total_steps": 1804, "loss": 0.6645777821540833, "lr": 2.3173722360652644e-06, "epoch": 2.337028824833703, "percentage": 58.43, "elapsed_time": "1:21:48", "remaining_time": "0:58:12"} | |
| {"current_steps": 1056, "total_steps": 1804, "loss": 0.8606789708137512, "lr": 2.309276290185494e-06, "epoch": 2.341463414634146, "percentage": 58.54, "elapsed_time": "1:21:58", "remaining_time": "0:58:03"} | |
| {"current_steps": 1058, "total_steps": 1804, "loss": 0.5443306565284729, "lr": 2.3011862737083162e-06, "epoch": 2.3458980044345896, "percentage": 58.65, "elapsed_time": "1:22:23", "remaining_time": "0:58:05"} | |
| {"current_steps": 1060, "total_steps": 1804, "loss": 1.009533405303955, "lr": 2.2931022954750843e-06, "epoch": 2.3503325942350335, "percentage": 58.76, "elapsed_time": "1:22:32", "remaining_time": "0:57:56"} | |
| {"current_steps": 1062, "total_steps": 1804, "loss": 0.4540158808231354, "lr": 2.285024464245912e-06, "epoch": 2.354767184035477, "percentage": 58.87, "elapsed_time": "1:22:39", "remaining_time": "0:57:44"} | |
| {"current_steps": 1064, "total_steps": 1804, "loss": 0.49174901843070984, "lr": 2.2769528886982158e-06, "epoch": 2.3592017738359203, "percentage": 58.98, "elapsed_time": "1:22:46", "remaining_time": "0:57:34"} | |
| {"current_steps": 1066, "total_steps": 1804, "loss": 0.7375195026397705, "lr": 2.268887677425248e-06, "epoch": 2.3636363636363638, "percentage": 59.09, "elapsed_time": "1:23:06", "remaining_time": "0:57:32"} | |
| {"current_steps": 1068, "total_steps": 1804, "loss": 1.0150161981582642, "lr": 2.2608289389346362e-06, "epoch": 2.368070953436807, "percentage": 59.2, "elapsed_time": "1:23:33", "remaining_time": "0:57:34"} | |
| {"current_steps": 1070, "total_steps": 1804, "loss": 0.8399034142494202, "lr": 2.2527767816469263e-06, "epoch": 2.3725055432372506, "percentage": 59.31, "elapsed_time": "1:24:01", "remaining_time": "0:57:38"} | |
| {"current_steps": 1072, "total_steps": 1804, "loss": 0.87836754322052, "lr": 2.244731313894121e-06, "epoch": 2.376940133037694, "percentage": 59.42, "elapsed_time": "1:24:11", "remaining_time": "0:57:29"} | |
| {"current_steps": 1074, "total_steps": 1804, "loss": 0.040761929005384445, "lr": 2.236692643918224e-06, "epoch": 2.3813747228381374, "percentage": 59.53, "elapsed_time": "1:24:15", "remaining_time": "0:57:16"} | |
| {"current_steps": 1076, "total_steps": 1804, "loss": 0.5826558470726013, "lr": 2.2286608798697834e-06, "epoch": 2.385809312638581, "percentage": 59.65, "elapsed_time": "1:24:23", "remaining_time": "0:57:05"} | |
| {"current_steps": 1078, "total_steps": 1804, "loss": 0.29340535402297974, "lr": 2.2206361298064343e-06, "epoch": 2.3902439024390243, "percentage": 59.76, "elapsed_time": "1:24:28", "remaining_time": "0:56:53"} | |
| {"current_steps": 1080, "total_steps": 1804, "loss": 0.8513185381889343, "lr": 2.2126185016914515e-06, "epoch": 2.3946784922394677, "percentage": 59.87, "elapsed_time": "1:24:38", "remaining_time": "0:56:44"} | |
| {"current_steps": 1082, "total_steps": 1804, "loss": 0.5718480348587036, "lr": 2.2046081033922884e-06, "epoch": 2.399113082039911, "percentage": 59.98, "elapsed_time": "1:24:45", "remaining_time": "0:56:33"} | |
| {"current_steps": 1084, "total_steps": 1804, "loss": 0.7035835385322571, "lr": 2.1966050426791325e-06, "epoch": 2.4035476718403546, "percentage": 60.09, "elapsed_time": "1:24:55", "remaining_time": "0:56:24"} | |
| {"current_steps": 1086, "total_steps": 1804, "loss": 0.906488299369812, "lr": 2.1886094272234508e-06, "epoch": 2.4079822616407984, "percentage": 60.2, "elapsed_time": "1:25:04", "remaining_time": "0:56:14"} | |
| {"current_steps": 1088, "total_steps": 1804, "loss": 0.29125481843948364, "lr": 2.1806213645965457e-06, "epoch": 2.412416851441242, "percentage": 60.31, "elapsed_time": "1:25:11", "remaining_time": "0:56:03"} | |
| {"current_steps": 1090, "total_steps": 1804, "loss": 0.9137746691703796, "lr": 2.172640962268104e-06, "epoch": 2.4168514412416853, "percentage": 60.42, "elapsed_time": "1:25:21", "remaining_time": "0:55:54"} | |
| {"current_steps": 1092, "total_steps": 1804, "loss": 0.9136351943016052, "lr": 2.1646683276047525e-06, "epoch": 2.4212860310421287, "percentage": 60.53, "elapsed_time": "1:25:30", "remaining_time": "0:55:45"} | |
| {"current_steps": 1094, "total_steps": 1804, "loss": 0.4814295470714569, "lr": 2.156703567868615e-06, "epoch": 2.425720620842572, "percentage": 60.64, "elapsed_time": "1:25:36", "remaining_time": "0:55:33"} | |
| {"current_steps": 1096, "total_steps": 1804, "loss": 0.5724620819091797, "lr": 2.148746790215866e-06, "epoch": 2.4301552106430155, "percentage": 60.75, "elapsed_time": "1:25:45", "remaining_time": "0:55:24"} | |
| {"current_steps": 1098, "total_steps": 1804, "loss": 0.45596760511398315, "lr": 2.140798101695291e-06, "epoch": 2.434589800443459, "percentage": 60.86, "elapsed_time": "1:25:52", "remaining_time": "0:55:12"} | |
| {"current_steps": 1100, "total_steps": 1804, "loss": 1.0661569833755493, "lr": 2.1328576092468476e-06, "epoch": 2.4390243902439024, "percentage": 60.98, "elapsed_time": "1:26:01", "remaining_time": "0:55:03"} | |
| {"current_steps": 1102, "total_steps": 1804, "loss": 0.9525973796844482, "lr": 2.124925419700223e-06, "epoch": 2.443458980044346, "percentage": 61.09, "elapsed_time": "1:26:12", "remaining_time": "0:54:54"} | |
| {"current_steps": 1104, "total_steps": 1804, "loss": 0.45727112889289856, "lr": 2.1170016397734e-06, "epoch": 2.4478935698447892, "percentage": 61.2, "elapsed_time": "1:26:21", "remaining_time": "0:54:45"} | |
| {"current_steps": 1106, "total_steps": 1804, "loss": 0.940199077129364, "lr": 2.109086376071221e-06, "epoch": 2.4523281596452327, "percentage": 61.31, "elapsed_time": "1:26:30", "remaining_time": "0:54:35"} | |
| {"current_steps": 1108, "total_steps": 1804, "loss": 0.8561551570892334, "lr": 2.1011797350839513e-06, "epoch": 2.4567627494456765, "percentage": 61.42, "elapsed_time": "1:26:41", "remaining_time": "0:54:27"} | |
| {"current_steps": 1110, "total_steps": 1804, "loss": 0.9693219661712646, "lr": 2.093281823185848e-06, "epoch": 2.4611973392461195, "percentage": 61.53, "elapsed_time": "1:26:50", "remaining_time": "0:54:17"} | |
| {"current_steps": 1112, "total_steps": 1804, "loss": 0.6467586755752563, "lr": 2.0853927466337315e-06, "epoch": 2.4656319290465634, "percentage": 61.64, "elapsed_time": "1:27:00", "remaining_time": "0:54:08"} | |
| {"current_steps": 1114, "total_steps": 1804, "loss": 0.870927095413208, "lr": 2.077512611565551e-06, "epoch": 2.470066518847007, "percentage": 61.75, "elapsed_time": "1:27:10", "remaining_time": "0:53:59"} | |
| {"current_steps": 1116, "total_steps": 1804, "loss": 0.325950562953949, "lr": 2.0696415239989593e-06, "epoch": 2.47450110864745, "percentage": 61.86, "elapsed_time": "1:27:16", "remaining_time": "0:53:48"} | |
| {"current_steps": 1118, "total_steps": 1804, "loss": 0.8618558645248413, "lr": 2.0617795898298855e-06, "epoch": 2.4789356984478936, "percentage": 61.97, "elapsed_time": "1:27:27", "remaining_time": "0:53:39"} | |
| {"current_steps": 1120, "total_steps": 1804, "loss": 0.8484733700752258, "lr": 2.053926914831112e-06, "epoch": 2.483370288248337, "percentage": 62.08, "elapsed_time": "1:27:35", "remaining_time": "0:53:29"} | |
| {"current_steps": 1122, "total_steps": 1804, "loss": 0.9692363142967224, "lr": 2.04608360465085e-06, "epoch": 2.4878048780487805, "percentage": 62.2, "elapsed_time": "1:27:45", "remaining_time": "0:53:20"} | |
| {"current_steps": 1124, "total_steps": 1804, "loss": 0.9982001781463623, "lr": 2.038249764811318e-06, "epoch": 2.492239467849224, "percentage": 62.31, "elapsed_time": "1:27:55", "remaining_time": "0:53:11"} | |
| {"current_steps": 1126, "total_steps": 1804, "loss": 0.9597415328025818, "lr": 2.0304255007073227e-06, "epoch": 2.4966740576496673, "percentage": 62.42, "elapsed_time": "1:28:05", "remaining_time": "0:53:02"} | |
| {"current_steps": 1128, "total_steps": 1804, "loss": 0.6873862147331238, "lr": 2.022610917604842e-06, "epoch": 2.5011086474501107, "percentage": 62.53, "elapsed_time": "1:28:15", "remaining_time": "0:52:53"} | |
| {"current_steps": 1130, "total_steps": 1804, "loss": 0.6469390392303467, "lr": 2.014806120639605e-06, "epoch": 2.505543237250554, "percentage": 62.64, "elapsed_time": "1:28:25", "remaining_time": "0:52:44"} | |
| {"current_steps": 1132, "total_steps": 1804, "loss": 0.7718120813369751, "lr": 2.007011214815684e-06, "epoch": 2.5099778270509976, "percentage": 62.75, "elapsed_time": "1:28:34", "remaining_time": "0:52:35"} | |
| {"current_steps": 1134, "total_steps": 1804, "loss": 0.5093148350715637, "lr": 1.9992263050040737e-06, "epoch": 2.5144124168514415, "percentage": 62.86, "elapsed_time": "1:28:41", "remaining_time": "0:52:24"} | |
| {"current_steps": 1136, "total_steps": 1804, "loss": 1.002577543258667, "lr": 1.991451495941289e-06, "epoch": 2.5188470066518844, "percentage": 62.97, "elapsed_time": "1:28:51", "remaining_time": "0:52:15"} | |
| {"current_steps": 1138, "total_steps": 1804, "loss": 0.7795249223709106, "lr": 1.983686892227948e-06, "epoch": 2.5232815964523283, "percentage": 63.08, "elapsed_time": "1:29:00", "remaining_time": "0:52:05"} | |
| {"current_steps": 1140, "total_steps": 1804, "loss": 0.9324785470962524, "lr": 1.975932598327369e-06, "epoch": 2.5277161862527717, "percentage": 63.19, "elapsed_time": "1:29:10", "remaining_time": "0:51:56"} | |
| {"current_steps": 1142, "total_steps": 1804, "loss": 0.4589383602142334, "lr": 1.9681887185641646e-06, "epoch": 2.532150776053215, "percentage": 63.3, "elapsed_time": "1:29:18", "remaining_time": "0:51:46"} | |
| {"current_steps": 1144, "total_steps": 1804, "loss": 0.5734773874282837, "lr": 1.9604553571228395e-06, "epoch": 2.5365853658536586, "percentage": 63.41, "elapsed_time": "1:29:25", "remaining_time": "0:51:35"} | |
| {"current_steps": 1146, "total_steps": 1804, "loss": 0.8844019770622253, "lr": 1.9527326180463855e-06, "epoch": 2.541019955654102, "percentage": 63.53, "elapsed_time": "1:29:36", "remaining_time": "0:51:26"} | |
| {"current_steps": 1148, "total_steps": 1804, "loss": 0.8842139840126038, "lr": 1.9450206052348823e-06, "epoch": 2.5454545454545454, "percentage": 63.64, "elapsed_time": "1:29:46", "remaining_time": "0:51:17"} | |
| {"current_steps": 1150, "total_steps": 1804, "loss": 0.8934570550918579, "lr": 1.9373194224441028e-06, "epoch": 2.549889135254989, "percentage": 63.75, "elapsed_time": "1:29:56", "remaining_time": "0:51:08"} | |
| {"current_steps": 1152, "total_steps": 1804, "loss": 0.63346928358078, "lr": 1.929629173284114e-06, "epoch": 2.5543237250554323, "percentage": 63.86, "elapsed_time": "1:30:04", "remaining_time": "0:50:58"} | |
| {"current_steps": 1154, "total_steps": 1804, "loss": 0.30481529235839844, "lr": 1.9219499612178836e-06, "epoch": 2.5587583148558757, "percentage": 63.97, "elapsed_time": "1:30:08", "remaining_time": "0:50:46"} | |
| {"current_steps": 1156, "total_steps": 1804, "loss": 0.5541834831237793, "lr": 1.9142818895598908e-06, "epoch": 2.5631929046563195, "percentage": 64.08, "elapsed_time": "1:30:14", "remaining_time": "0:50:35"} | |
| {"current_steps": 1158, "total_steps": 1804, "loss": 0.6360606551170349, "lr": 1.9066250614747317e-06, "epoch": 2.5676274944567625, "percentage": 64.19, "elapsed_time": "1:30:24", "remaining_time": "0:50:26"} | |
| {"current_steps": 1160, "total_steps": 1804, "loss": 0.824614405632019, "lr": 1.8989795799757348e-06, "epoch": 2.5720620842572064, "percentage": 64.3, "elapsed_time": "1:30:34", "remaining_time": "0:50:16"} | |
| {"current_steps": 1162, "total_steps": 1804, "loss": 0.8806930184364319, "lr": 1.8913455479235754e-06, "epoch": 2.57649667405765, "percentage": 64.41, "elapsed_time": "1:30:43", "remaining_time": "0:50:07"} | |
| {"current_steps": 1164, "total_steps": 1804, "loss": 0.8695907592773438, "lr": 1.8837230680248874e-06, "epoch": 2.5809312638580932, "percentage": 64.52, "elapsed_time": "1:30:53", "remaining_time": "0:49:58"} | |
| {"current_steps": 1166, "total_steps": 1804, "loss": 0.6029998660087585, "lr": 1.8761122428308875e-06, "epoch": 2.5853658536585367, "percentage": 64.63, "elapsed_time": "1:31:00", "remaining_time": "0:49:48"} | |
| {"current_steps": 1168, "total_steps": 1804, "loss": 0.9235411882400513, "lr": 1.8685131747359902e-06, "epoch": 2.58980044345898, "percentage": 64.75, "elapsed_time": "1:31:10", "remaining_time": "0:49:38"} | |
| {"current_steps": 1170, "total_steps": 1804, "loss": 0.8941707611083984, "lr": 1.8609259659764345e-06, "epoch": 2.5942350332594235, "percentage": 64.86, "elapsed_time": "1:31:20", "remaining_time": "0:49:29"} | |
| {"current_steps": 1172, "total_steps": 1804, "loss": 0.3670371174812317, "lr": 1.853350718628904e-06, "epoch": 2.598669623059867, "percentage": 64.97, "elapsed_time": "1:31:26", "remaining_time": "0:49:18"} | |
| {"current_steps": 1174, "total_steps": 1804, "loss": 0.6117712259292603, "lr": 1.845787534609157e-06, "epoch": 2.6031042128603104, "percentage": 65.08, "elapsed_time": "1:31:33", "remaining_time": "0:49:07"} | |
| {"current_steps": 1176, "total_steps": 1804, "loss": 0.6652023196220398, "lr": 1.8382365156706566e-06, "epoch": 2.6075388026607538, "percentage": 65.19, "elapsed_time": "1:31:42", "remaining_time": "0:48:58"} | |
| {"current_steps": 1178, "total_steps": 1804, "loss": 0.5446036458015442, "lr": 1.8306977634031976e-06, "epoch": 2.611973392461197, "percentage": 65.3, "elapsed_time": "1:31:49", "remaining_time": "0:48:48"} | |
| {"current_steps": 1180, "total_steps": 1804, "loss": 0.7938367128372192, "lr": 1.8231713792315403e-06, "epoch": 2.6164079822616406, "percentage": 65.41, "elapsed_time": "1:31:59", "remaining_time": "0:48:38"} | |
| {"current_steps": 1182, "total_steps": 1804, "loss": 0.8983339667320251, "lr": 1.8156574644140495e-06, "epoch": 2.6208425720620845, "percentage": 65.52, "elapsed_time": "1:32:09", "remaining_time": "0:48:29"} | |
| {"current_steps": 1184, "total_steps": 1804, "loss": 0.8714417815208435, "lr": 1.8081561200413295e-06, "epoch": 2.6252771618625275, "percentage": 65.63, "elapsed_time": "1:32:20", "remaining_time": "0:48:21"} | |
| {"current_steps": 1186, "total_steps": 1804, "loss": 0.874497652053833, "lr": 1.800667447034864e-06, "epoch": 2.6297117516629713, "percentage": 65.74, "elapsed_time": "1:32:30", "remaining_time": "0:48:12"} | |
| {"current_steps": 1188, "total_steps": 1804, "loss": 1.0262384414672852, "lr": 1.7931915461456573e-06, "epoch": 2.6341463414634148, "percentage": 65.85, "elapsed_time": "1:32:40", "remaining_time": "0:48:02"} | |
| {"current_steps": 1190, "total_steps": 1804, "loss": 0.5378797054290771, "lr": 1.7857285179528838e-06, "epoch": 2.638580931263858, "percentage": 65.96, "elapsed_time": "1:32:46", "remaining_time": "0:47:52"} | |
| {"current_steps": 1192, "total_steps": 1804, "loss": 0.7439752221107483, "lr": 1.7782784628625305e-06, "epoch": 2.6430155210643016, "percentage": 66.08, "elapsed_time": "1:32:56", "remaining_time": "0:47:43"} | |
| {"current_steps": 1194, "total_steps": 1804, "loss": 0.5809391736984253, "lr": 1.7708414811060437e-06, "epoch": 2.647450110864745, "percentage": 66.19, "elapsed_time": "1:33:06", "remaining_time": "0:47:34"} | |
| {"current_steps": 1196, "total_steps": 1804, "loss": 0.7553034424781799, "lr": 1.763417672738989e-06, "epoch": 2.6518847006651884, "percentage": 66.3, "elapsed_time": "1:33:16", "remaining_time": "0:47:24"} | |
| {"current_steps": 1198, "total_steps": 1804, "loss": 0.25871163606643677, "lr": 1.7560071376396953e-06, "epoch": 2.656319290465632, "percentage": 66.41, "elapsed_time": "1:33:22", "remaining_time": "0:47:14"} | |
| {"current_steps": 1200, "total_steps": 1804, "loss": 0.9024768471717834, "lr": 1.7486099755079197e-06, "epoch": 2.6607538802660753, "percentage": 66.52, "elapsed_time": "1:33:32", "remaining_time": "0:47:05"} | |
| {"current_steps": 1202, "total_steps": 1804, "loss": 0.9686548113822937, "lr": 1.7412262858634987e-06, "epoch": 2.6651884700665187, "percentage": 66.63, "elapsed_time": "1:33:42", "remaining_time": "0:46:56"} | |
| {"current_steps": 1204, "total_steps": 1804, "loss": 0.46400630474090576, "lr": 1.7338561680450171e-06, "epoch": 2.6696230598669626, "percentage": 66.74, "elapsed_time": "1:33:49", "remaining_time": "0:46:45"} | |
| {"current_steps": 1206, "total_steps": 1804, "loss": 0.7697018384933472, "lr": 1.7264997212084616e-06, "epoch": 2.6740576496674056, "percentage": 66.85, "elapsed_time": "1:33:59", "remaining_time": "0:46:36"} | |
| {"current_steps": 1208, "total_steps": 1804, "loss": 1.056341290473938, "lr": 1.7191570443258976e-06, "epoch": 2.6784922394678494, "percentage": 66.96, "elapsed_time": "1:34:06", "remaining_time": "0:46:26"} | |
| {"current_steps": 1210, "total_steps": 1804, "loss": 0.5748533606529236, "lr": 1.711828236184131e-06, "epoch": 2.682926829268293, "percentage": 67.07, "elapsed_time": "1:34:16", "remaining_time": "0:46:16"} | |
| {"current_steps": 1212, "total_steps": 1804, "loss": 0.31726494431495667, "lr": 1.704513395383378e-06, "epoch": 2.6873614190687363, "percentage": 67.18, "elapsed_time": "1:34:25", "remaining_time": "0:46:07"} | |
| {"current_steps": 1214, "total_steps": 1804, "loss": 0.6147481203079224, "lr": 1.6972126203359454e-06, "epoch": 2.6917960088691797, "percentage": 67.29, "elapsed_time": "1:34:32", "remaining_time": "0:45:56"} | |
| {"current_steps": 1216, "total_steps": 1804, "loss": 0.6757416129112244, "lr": 1.6899260092648995e-06, "epoch": 2.696230598669623, "percentage": 67.41, "elapsed_time": "1:34:41", "remaining_time": "0:45:47"} | |
| {"current_steps": 1218, "total_steps": 1804, "loss": 0.6192297339439392, "lr": 1.6826536602027471e-06, "epoch": 2.7006651884700665, "percentage": 67.52, "elapsed_time": "1:34:51", "remaining_time": "0:45:38"} | |
| {"current_steps": 1220, "total_steps": 1804, "loss": 1.0000026226043701, "lr": 1.6753956709901202e-06, "epoch": 2.70509977827051, "percentage": 67.63, "elapsed_time": "1:35:01", "remaining_time": "0:45:29"} | |
| {"current_steps": 1222, "total_steps": 1804, "loss": 0.7289214730262756, "lr": 1.6681521392744515e-06, "epoch": 2.7095343680709534, "percentage": 67.74, "elapsed_time": "1:35:10", "remaining_time": "0:45:19"} | |
| {"current_steps": 1224, "total_steps": 1804, "loss": 0.8310694694519043, "lr": 1.660923162508671e-06, "epoch": 2.713968957871397, "percentage": 67.85, "elapsed_time": "1:35:20", "remaining_time": "0:45:10"} | |
| {"current_steps": 1226, "total_steps": 1804, "loss": 0.47293317317962646, "lr": 1.6537088379498872e-06, "epoch": 2.7184035476718402, "percentage": 67.96, "elapsed_time": "1:35:29", "remaining_time": "0:45:01"} | |
| {"current_steps": 1228, "total_steps": 1804, "loss": 0.983069896697998, "lr": 1.6465092626580787e-06, "epoch": 2.7228381374722836, "percentage": 68.07, "elapsed_time": "1:35:41", "remaining_time": "0:44:52"} | |
| {"current_steps": 1230, "total_steps": 1804, "loss": 0.7170325517654419, "lr": 1.6393245334947942e-06, "epoch": 2.7272727272727275, "percentage": 68.18, "elapsed_time": "1:35:48", "remaining_time": "0:44:42"} | |
| {"current_steps": 1232, "total_steps": 1804, "loss": 0.882874608039856, "lr": 1.6321547471218432e-06, "epoch": 2.7317073170731705, "percentage": 68.29, "elapsed_time": "1:35:58", "remaining_time": "0:44:33"} | |
| {"current_steps": 1234, "total_steps": 1804, "loss": 0.5405250787734985, "lr": 1.6250000000000007e-06, "epoch": 2.7361419068736144, "percentage": 68.4, "elapsed_time": "1:36:08", "remaining_time": "0:44:24"} | |
| {"current_steps": 1236, "total_steps": 1804, "loss": 0.8325910568237305, "lr": 1.6178603883877032e-06, "epoch": 2.740576496674058, "percentage": 68.51, "elapsed_time": "1:36:17", "remaining_time": "0:44:15"} | |
| {"current_steps": 1238, "total_steps": 1804, "loss": 0.5409272313117981, "lr": 1.6107360083397604e-06, "epoch": 2.745011086474501, "percentage": 68.63, "elapsed_time": "1:36:24", "remaining_time": "0:44:04"} | |
| {"current_steps": 1240, "total_steps": 1804, "loss": 0.6126099824905396, "lr": 1.6036269557060594e-06, "epoch": 2.7494456762749446, "percentage": 68.74, "elapsed_time": "1:36:31", "remaining_time": "0:43:54"} | |
| {"current_steps": 1242, "total_steps": 1804, "loss": 0.9320923089981079, "lr": 1.5965333261302735e-06, "epoch": 2.753880266075388, "percentage": 68.85, "elapsed_time": "1:36:41", "remaining_time": "0:43:45"} | |
| {"current_steps": 1244, "total_steps": 1804, "loss": 1.0979969501495361, "lr": 1.5894552150485801e-06, "epoch": 2.7583148558758315, "percentage": 68.96, "elapsed_time": "1:36:51", "remaining_time": "0:43:36"} | |
| {"current_steps": 1246, "total_steps": 1804, "loss": 0.4379834234714508, "lr": 1.5823927176883725e-06, "epoch": 2.762749445676275, "percentage": 69.07, "elapsed_time": "1:36:58", "remaining_time": "0:43:25"} | |
| {"current_steps": 1248, "total_steps": 1804, "loss": 0.6199414134025574, "lr": 1.5753459290669792e-06, "epoch": 2.7671840354767183, "percentage": 69.18, "elapsed_time": "1:37:06", "remaining_time": "0:43:15"} | |
| {"current_steps": 1250, "total_steps": 1804, "loss": 0.6263423562049866, "lr": 1.5683149439903905e-06, "epoch": 2.7716186252771617, "percentage": 69.29, "elapsed_time": "1:37:16", "remaining_time": "0:43:06"} | |
| {"current_steps": 1252, "total_steps": 1804, "loss": 0.9981138706207275, "lr": 1.5612998570519746e-06, "epoch": 2.776053215077605, "percentage": 69.4, "elapsed_time": "1:37:26", "remaining_time": "0:42:57"} | |
| {"current_steps": 1254, "total_steps": 1804, "loss": 1.003664255142212, "lr": 1.5543007626312129e-06, "epoch": 2.7804878048780486, "percentage": 69.51, "elapsed_time": "1:37:35", "remaining_time": "0:42:48"} | |
| {"current_steps": 1256, "total_steps": 1804, "loss": 0.9180096983909607, "lr": 1.5473177548924267e-06, "epoch": 2.7849223946784925, "percentage": 69.62, "elapsed_time": "1:37:45", "remaining_time": "0:42:39"} | |
| {"current_steps": 1258, "total_steps": 1804, "loss": 0.18954633176326752, "lr": 1.5403509277835077e-06, "epoch": 2.7893569844789354, "percentage": 69.73, "elapsed_time": "1:37:49", "remaining_time": "0:42:27"} | |
| {"current_steps": 1260, "total_steps": 1804, "loss": 0.8927637338638306, "lr": 1.5334003750346608e-06, "epoch": 2.7937915742793793, "percentage": 69.84, "elapsed_time": "1:37:59", "remaining_time": "0:42:18"} | |
| {"current_steps": 1262, "total_steps": 1804, "loss": 0.6852344870567322, "lr": 1.5264661901571349e-06, "epoch": 2.7982261640798227, "percentage": 69.96, "elapsed_time": "1:38:08", "remaining_time": "0:42:09"} | |
| {"current_steps": 1264, "total_steps": 1804, "loss": 0.5791198015213013, "lr": 1.5195484664419732e-06, "epoch": 2.802660753880266, "percentage": 70.07, "elapsed_time": "1:38:15", "remaining_time": "0:41:58"} | |
| {"current_steps": 1266, "total_steps": 1804, "loss": 1.0053340196609497, "lr": 1.5126472969587502e-06, "epoch": 2.8070953436807096, "percentage": 70.18, "elapsed_time": "1:38:25", "remaining_time": "0:41:49"} | |
| {"current_steps": 1268, "total_steps": 1804, "loss": 0.9256460666656494, "lr": 1.5057627745543269e-06, "epoch": 2.811529933481153, "percentage": 70.29, "elapsed_time": "1:38:35", "remaining_time": "0:41:40"} | |
| {"current_steps": 1270, "total_steps": 1804, "loss": 0.872265100479126, "lr": 1.4988949918515947e-06, "epoch": 2.8159645232815964, "percentage": 70.4, "elapsed_time": "1:38:45", "remaining_time": "0:41:31"} | |
| {"current_steps": 1272, "total_steps": 1804, "loss": 0.38903388381004333, "lr": 1.4920440412482345e-06, "epoch": 2.82039911308204, "percentage": 70.51, "elapsed_time": "1:38:52", "remaining_time": "0:41:21"} | |
| {"current_steps": 1274, "total_steps": 1804, "loss": 0.5502209663391113, "lr": 1.485210014915473e-06, "epoch": 2.8248337028824833, "percentage": 70.62, "elapsed_time": "1:39:01", "remaining_time": "0:41:11"} | |
| {"current_steps": 1276, "total_steps": 1804, "loss": 0.9950301051139832, "lr": 1.4783930047968388e-06, "epoch": 2.8292682926829267, "percentage": 70.73, "elapsed_time": "1:39:11", "remaining_time": "0:41:02"} | |
| {"current_steps": 1278, "total_steps": 1804, "loss": 0.9035691618919373, "lr": 1.4715931026069273e-06, "epoch": 2.8337028824833705, "percentage": 70.84, "elapsed_time": "1:39:21", "remaining_time": "0:40:53"} | |
| {"current_steps": 1280, "total_steps": 1804, "loss": 0.5569464564323425, "lr": 1.4648103998301716e-06, "epoch": 2.8381374722838135, "percentage": 70.95, "elapsed_time": "1:39:30", "remaining_time": "0:40:44"} | |
| {"current_steps": 1282, "total_steps": 1804, "loss": 0.6820242404937744, "lr": 1.4580449877196035e-06, "epoch": 2.8425720620842574, "percentage": 71.06, "elapsed_time": "1:39:40", "remaining_time": "0:40:35"} | |
| {"current_steps": 1284, "total_steps": 1804, "loss": 0.6583420038223267, "lr": 1.4512969572956328e-06, "epoch": 2.847006651884701, "percentage": 71.18, "elapsed_time": "1:39:50", "remaining_time": "0:40:25"} | |
| {"current_steps": 1286, "total_steps": 1804, "loss": 0.9154214262962341, "lr": 1.4445663993448173e-06, "epoch": 2.8514412416851442, "percentage": 71.29, "elapsed_time": "1:40:00", "remaining_time": "0:40:16"} | |
| {"current_steps": 1288, "total_steps": 1804, "loss": 0.31365761160850525, "lr": 1.437853404418646e-06, "epoch": 2.8558758314855877, "percentage": 71.4, "elapsed_time": "1:40:06", "remaining_time": "0:40:06"} | |
| {"current_steps": 1290, "total_steps": 1804, "loss": 0.9376072883605957, "lr": 1.431158062832318e-06, "epoch": 2.860310421286031, "percentage": 71.51, "elapsed_time": "1:40:16", "remaining_time": "0:39:57"} | |
| {"current_steps": 1292, "total_steps": 1804, "loss": 1.0736974477767944, "lr": 1.4244804646635266e-06, "epoch": 2.8647450110864745, "percentage": 71.62, "elapsed_time": "1:40:26", "remaining_time": "0:39:48"} | |
| {"current_steps": 1294, "total_steps": 1804, "loss": 0.8537707924842834, "lr": 1.4178206997512522e-06, "epoch": 2.869179600886918, "percentage": 71.73, "elapsed_time": "1:40:36", "remaining_time": "0:39:39"} | |
| {"current_steps": 1296, "total_steps": 1804, "loss": 0.9072995781898499, "lr": 1.4111788576945467e-06, "epoch": 2.8736141906873613, "percentage": 71.84, "elapsed_time": "1:40:46", "remaining_time": "0:39:29"} | |
| {"current_steps": 1298, "total_steps": 1804, "loss": 0.5681540966033936, "lr": 1.4045550278513351e-06, "epoch": 2.8780487804878048, "percentage": 71.95, "elapsed_time": "1:40:53", "remaining_time": "0:39:19"} | |
| {"current_steps": 1300, "total_steps": 1804, "loss": 0.9094551205635071, "lr": 1.3979492993372074e-06, "epoch": 2.882483370288248, "percentage": 72.06, "elapsed_time": "1:41:03", "remaining_time": "0:39:10"} | |
| {"current_steps": 1302, "total_steps": 1804, "loss": 0.3109537363052368, "lr": 1.391361761024222e-06, "epoch": 2.8869179600886916, "percentage": 72.17, "elapsed_time": "1:41:09", "remaining_time": "0:39:00"} | |
| {"current_steps": 1304, "total_steps": 1804, "loss": 0.8516042828559875, "lr": 1.3847925015397146e-06, "epoch": 2.8913525498891355, "percentage": 72.28, "elapsed_time": "1:41:18", "remaining_time": "0:38:50"} | |
| {"current_steps": 1306, "total_steps": 1804, "loss": 0.7857693433761597, "lr": 1.3782416092650957e-06, "epoch": 2.8957871396895785, "percentage": 72.39, "elapsed_time": "1:41:28", "remaining_time": "0:38:41"} | |
| {"current_steps": 1308, "total_steps": 1804, "loss": 0.40065449476242065, "lr": 1.3717091723346699e-06, "epoch": 2.9002217294900223, "percentage": 72.51, "elapsed_time": "1:41:35", "remaining_time": "0:38:31"} | |
| {"current_steps": 1310, "total_steps": 1804, "loss": 0.518020749092102, "lr": 1.3651952786344485e-06, "epoch": 2.9046563192904657, "percentage": 72.62, "elapsed_time": "1:41:42", "remaining_time": "0:38:21"} | |
| {"current_steps": 1312, "total_steps": 1804, "loss": 0.8922036290168762, "lr": 1.3587000158009638e-06, "epoch": 2.909090909090909, "percentage": 72.73, "elapsed_time": "1:41:52", "remaining_time": "0:38:12"} | |
| {"current_steps": 1314, "total_steps": 1804, "loss": 0.9531146883964539, "lr": 1.3522234712200954e-06, "epoch": 2.9135254988913526, "percentage": 72.84, "elapsed_time": "1:42:01", "remaining_time": "0:38:02"} | |
| {"current_steps": 1316, "total_steps": 1804, "loss": 0.6479524374008179, "lr": 1.3457657320258878e-06, "epoch": 2.917960088691796, "percentage": 72.95, "elapsed_time": "1:42:06", "remaining_time": "0:37:51"} | |
| {"current_steps": 1318, "total_steps": 1804, "loss": 0.9447596073150635, "lr": 1.3393268850993852e-06, "epoch": 2.9223946784922394, "percentage": 73.06, "elapsed_time": "1:42:16", "remaining_time": "0:37:42"} | |
| {"current_steps": 1320, "total_steps": 1804, "loss": 0.7807310819625854, "lr": 1.332907017067458e-06, "epoch": 2.926829268292683, "percentage": 73.17, "elapsed_time": "1:42:23", "remaining_time": "0:37:32"} | |
| {"current_steps": 1322, "total_steps": 1804, "loss": 0.8912954926490784, "lr": 1.3265062143016378e-06, "epoch": 2.9312638580931263, "percentage": 73.28, "elapsed_time": "1:42:33", "remaining_time": "0:37:23"} | |
| {"current_steps": 1324, "total_steps": 1804, "loss": 0.9309762716293335, "lr": 1.3201245629169574e-06, "epoch": 2.9356984478935697, "percentage": 73.39, "elapsed_time": "1:42:42", "remaining_time": "0:37:14"} | |
| {"current_steps": 1326, "total_steps": 1804, "loss": 0.8276110887527466, "lr": 1.3137621487707902e-06, "epoch": 2.9401330376940136, "percentage": 73.5, "elapsed_time": "1:42:50", "remaining_time": "0:37:04"} | |
| {"current_steps": 1328, "total_steps": 1804, "loss": 0.9218543171882629, "lr": 1.307419057461697e-06, "epoch": 2.9445676274944566, "percentage": 73.61, "elapsed_time": "1:43:00", "remaining_time": "0:36:55"} | |
| {"current_steps": 1330, "total_steps": 1804, "loss": 0.8906182050704956, "lr": 1.3010953743282724e-06, "epoch": 2.9490022172949004, "percentage": 73.73, "elapsed_time": "1:43:10", "remaining_time": "0:36:46"} | |
| {"current_steps": 1332, "total_steps": 1804, "loss": 0.907779335975647, "lr": 1.294791184447996e-06, "epoch": 2.953436807095344, "percentage": 73.84, "elapsed_time": "1:43:21", "remaining_time": "0:36:37"} | |
| {"current_steps": 1334, "total_steps": 1804, "loss": 0.45520275831222534, "lr": 1.2885065726360925e-06, "epoch": 2.9578713968957873, "percentage": 73.95, "elapsed_time": "1:43:27", "remaining_time": "0:36:27"} | |
| {"current_steps": 1336, "total_steps": 1804, "loss": 0.9082697629928589, "lr": 1.282241623444386e-06, "epoch": 2.9623059866962307, "percentage": 74.06, "elapsed_time": "1:43:37", "remaining_time": "0:36:18"} | |
| {"current_steps": 1338, "total_steps": 1804, "loss": 0.8805263042449951, "lr": 1.2759964211601633e-06, "epoch": 2.966740576496674, "percentage": 74.17, "elapsed_time": "1:43:47", "remaining_time": "0:36:09"} | |
| {"current_steps": 1340, "total_steps": 1804, "loss": 0.7469978928565979, "lr": 1.269771049805042e-06, "epoch": 2.9711751662971175, "percentage": 74.28, "elapsed_time": "1:43:57", "remaining_time": "0:35:59"} | |
| {"current_steps": 1342, "total_steps": 1804, "loss": 0.6190311312675476, "lr": 1.2635655931338364e-06, "epoch": 2.975609756097561, "percentage": 74.39, "elapsed_time": "1:44:07", "remaining_time": "0:35:50"} | |
| {"current_steps": 1344, "total_steps": 1804, "loss": 0.19588389992713928, "lr": 1.2573801346334355e-06, "epoch": 2.9800443458980044, "percentage": 74.5, "elapsed_time": "1:44:14", "remaining_time": "0:35:40"} | |
| {"current_steps": 1346, "total_steps": 1804, "loss": 0.6986634731292725, "lr": 1.251214757521675e-06, "epoch": 2.984478935698448, "percentage": 74.61, "elapsed_time": "1:44:21", "remaining_time": "0:35:30"} | |
| {"current_steps": 1348, "total_steps": 1804, "loss": 0.5783390998840332, "lr": 1.2450695447462214e-06, "epoch": 2.988913525498891, "percentage": 74.72, "elapsed_time": "1:44:28", "remaining_time": "0:35:20"} | |
| {"current_steps": 1350, "total_steps": 1804, "loss": 0.6150118112564087, "lr": 1.2389445789834534e-06, "epoch": 2.9933481152993346, "percentage": 74.83, "elapsed_time": "1:44:37", "remaining_time": "0:35:10"} | |
| {"current_steps": 1352, "total_steps": 1804, "loss": 0.6677907109260559, "lr": 1.2328399426373511e-06, "epoch": 2.9977827050997785, "percentage": 74.94, "elapsed_time": "1:44:45", "remaining_time": "0:35:01"} | |
| {"current_steps": 1354, "total_steps": 1804, "loss": 0.8499741554260254, "lr": 1.2267557178383886e-06, "epoch": 3.002217294900222, "percentage": 75.06, "elapsed_time": "1:44:56", "remaining_time": "0:34:52"} | |
| {"current_steps": 1356, "total_steps": 1804, "loss": 0.42442217469215393, "lr": 1.220691986442424e-06, "epoch": 3.0066518847006654, "percentage": 75.17, "elapsed_time": "1:45:03", "remaining_time": "0:34:42"} | |
| {"current_steps": 1358, "total_steps": 1804, "loss": 0.6506487727165222, "lr": 1.2146488300296047e-06, "epoch": 3.011086474501109, "percentage": 75.28, "elapsed_time": "1:45:14", "remaining_time": "0:34:33"} | |
| {"current_steps": 1360, "total_steps": 1804, "loss": 0.8044725656509399, "lr": 1.2086263299032652e-06, "epoch": 3.015521064301552, "percentage": 75.39, "elapsed_time": "1:45:24", "remaining_time": "0:34:24"} | |
| {"current_steps": 1362, "total_steps": 1804, "loss": 0.7103544473648071, "lr": 1.2026245670888343e-06, "epoch": 3.0199556541019956, "percentage": 75.5, "elapsed_time": "1:45:34", "remaining_time": "0:34:15"} | |
| {"current_steps": 1364, "total_steps": 1804, "loss": 0.8767702579498291, "lr": 1.196643622332747e-06, "epoch": 3.024390243902439, "percentage": 75.61, "elapsed_time": "1:45:44", "remaining_time": "0:34:06"} | |
| {"current_steps": 1366, "total_steps": 1804, "loss": 0.30692797899246216, "lr": 1.1906835761013547e-06, "epoch": 3.0288248337028825, "percentage": 75.72, "elapsed_time": "1:45:51", "remaining_time": "0:33:56"} | |
| {"current_steps": 1368, "total_steps": 1804, "loss": 0.46566513180732727, "lr": 1.184744508579846e-06, "epoch": 3.033259423503326, "percentage": 75.83, "elapsed_time": "1:46:01", "remaining_time": "0:33:47"} | |
| {"current_steps": 1370, "total_steps": 1804, "loss": 0.5476133227348328, "lr": 1.178826499671167e-06, "epoch": 3.0376940133037693, "percentage": 75.94, "elapsed_time": "1:46:08", "remaining_time": "0:33:37"} | |
| {"current_steps": 1372, "total_steps": 1804, "loss": 0.7700155377388, "lr": 1.172929628994943e-06, "epoch": 3.0421286031042127, "percentage": 76.05, "elapsed_time": "1:46:18", "remaining_time": "0:33:28"} | |
| {"current_steps": 1374, "total_steps": 1804, "loss": 0.29538294672966003, "lr": 1.167053975886413e-06, "epoch": 3.046563192904656, "percentage": 76.16, "elapsed_time": "1:46:23", "remaining_time": "0:33:17"} | |
| {"current_steps": 1376, "total_steps": 1804, "loss": 0.4983513057231903, "lr": 1.1611996193953569e-06, "epoch": 3.0509977827050996, "percentage": 76.27, "elapsed_time": "1:46:32", "remaining_time": "0:33:08"} | |
| {"current_steps": 1378, "total_steps": 1804, "loss": 0.3278832733631134, "lr": 1.1553666382850366e-06, "epoch": 3.0554323725055434, "percentage": 76.39, "elapsed_time": "1:46:39", "remaining_time": "0:32:58"} | |
| {"current_steps": 1380, "total_steps": 1804, "loss": 0.319016695022583, "lr": 1.1495551110311324e-06, "epoch": 3.059866962305987, "percentage": 76.5, "elapsed_time": "1:46:46", "remaining_time": "0:32:48"} | |
| {"current_steps": 1382, "total_steps": 1804, "loss": 0.2882533073425293, "lr": 1.1437651158206904e-06, "epoch": 3.0643015521064303, "percentage": 76.61, "elapsed_time": "1:46:53", "remaining_time": "0:32:38"} | |
| {"current_steps": 1384, "total_steps": 1804, "loss": 0.5356056690216064, "lr": 1.137996730551069e-06, "epoch": 3.0687361419068737, "percentage": 76.72, "elapsed_time": "1:47:02", "remaining_time": "0:32:28"} | |
| {"current_steps": 1386, "total_steps": 1804, "loss": 0.31114962697029114, "lr": 1.1322500328288897e-06, "epoch": 3.073170731707317, "percentage": 76.83, "elapsed_time": "1:47:09", "remaining_time": "0:32:19"} | |
| {"current_steps": 1388, "total_steps": 1804, "loss": 0.4476943016052246, "lr": 1.1265250999689966e-06, "epoch": 3.0776053215077606, "percentage": 76.94, "elapsed_time": "1:47:16", "remaining_time": "0:32:08"} | |
| {"current_steps": 1390, "total_steps": 1804, "loss": 0.6670686602592468, "lr": 1.1208220089934118e-06, "epoch": 3.082039911308204, "percentage": 77.05, "elapsed_time": "1:47:25", "remaining_time": "0:31:59"} | |
| {"current_steps": 1392, "total_steps": 1804, "loss": 0.31225770711898804, "lr": 1.1151408366303024e-06, "epoch": 3.0864745011086474, "percentage": 77.16, "elapsed_time": "1:47:35", "remaining_time": "0:31:50"} | |
| {"current_steps": 1394, "total_steps": 1804, "loss": 0.513544499874115, "lr": 1.1094816593129475e-06, "epoch": 3.090909090909091, "percentage": 77.27, "elapsed_time": "1:47:44", "remaining_time": "0:31:41"} | |
| {"current_steps": 1396, "total_steps": 1804, "loss": 0.7358729243278503, "lr": 1.1038445531787083e-06, "epoch": 3.0953436807095343, "percentage": 77.38, "elapsed_time": "1:47:54", "remaining_time": "0:31:32"} | |
| {"current_steps": 1398, "total_steps": 1804, "loss": 0.739376425743103, "lr": 1.098229594068007e-06, "epoch": 3.0997782705099777, "percentage": 77.49, "elapsed_time": "1:48:04", "remaining_time": "0:31:23"} | |
| {"current_steps": 1400, "total_steps": 1804, "loss": 0.562745213508606, "lr": 1.0926368575233032e-06, "epoch": 3.104212860310421, "percentage": 77.61, "elapsed_time": "1:48:11", "remaining_time": "0:31:13"} | |
| {"current_steps": 1402, "total_steps": 1804, "loss": 0.2242104858160019, "lr": 1.087066418788078e-06, "epoch": 3.1086474501108645, "percentage": 77.72, "elapsed_time": "1:48:17", "remaining_time": "0:31:03"} | |
| {"current_steps": 1404, "total_steps": 1804, "loss": 0.47392910718917847, "lr": 1.0815183528058248e-06, "epoch": 3.1130820399113084, "percentage": 77.83, "elapsed_time": "1:48:24", "remaining_time": "0:30:53"} | |
| {"current_steps": 1406, "total_steps": 1804, "loss": 0.6808566451072693, "lr": 1.0759927342190362e-06, "epoch": 3.117516629711752, "percentage": 77.94, "elapsed_time": "1:48:34", "remaining_time": "0:30:44"} | |
| {"current_steps": 1408, "total_steps": 1804, "loss": 0.8002766966819763, "lr": 1.0704896373682052e-06, "epoch": 3.1219512195121952, "percentage": 78.05, "elapsed_time": "1:48:44", "remaining_time": "0:30:35"} | |
| {"current_steps": 1410, "total_steps": 1804, "loss": 0.6717104315757751, "lr": 1.0650091362908189e-06, "epoch": 3.1263858093126387, "percentage": 78.16, "elapsed_time": "1:48:54", "remaining_time": "0:30:25"} | |
| {"current_steps": 1412, "total_steps": 1804, "loss": 0.6879040598869324, "lr": 1.0595513047203693e-06, "epoch": 3.130820399113082, "percentage": 78.27, "elapsed_time": "1:49:04", "remaining_time": "0:30:17"} | |
| {"current_steps": 1414, "total_steps": 1804, "loss": 0.4185694754123688, "lr": 1.0541162160853538e-06, "epoch": 3.1352549889135255, "percentage": 78.38, "elapsed_time": "1:49:13", "remaining_time": "0:30:07"} | |
| {"current_steps": 1416, "total_steps": 1804, "loss": 0.057561103254556656, "lr": 1.0487039435082941e-06, "epoch": 3.139689578713969, "percentage": 78.49, "elapsed_time": "1:49:17", "remaining_time": "0:29:56"} | |
| {"current_steps": 1418, "total_steps": 1804, "loss": 0.6490657925605774, "lr": 1.0433145598047495e-06, "epoch": 3.1441241685144123, "percentage": 78.6, "elapsed_time": "1:49:26", "remaining_time": "0:29:47"} | |
| {"current_steps": 1420, "total_steps": 1804, "loss": 0.7569578886032104, "lr": 1.0379481374823358e-06, "epoch": 3.1485587583148558, "percentage": 78.71, "elapsed_time": "1:49:35", "remaining_time": "0:29:38"} | |
| {"current_steps": 1422, "total_steps": 1804, "loss": 0.6902734041213989, "lr": 1.032604748739751e-06, "epoch": 3.152993348115299, "percentage": 78.82, "elapsed_time": "1:49:44", "remaining_time": "0:29:28"} | |
| {"current_steps": 1424, "total_steps": 1804, "loss": 0.12579640746116638, "lr": 1.0272844654658069e-06, "epoch": 3.1574279379157426, "percentage": 78.94, "elapsed_time": "1:49:51", "remaining_time": "0:29:18"} | |
| {"current_steps": 1426, "total_steps": 1804, "loss": 0.6934728026390076, "lr": 1.0219873592384556e-06, "epoch": 3.1618625277161865, "percentage": 79.05, "elapsed_time": "1:50:00", "remaining_time": "0:29:09"} | |
| {"current_steps": 1428, "total_steps": 1804, "loss": 0.7339826822280884, "lr": 1.016713501323834e-06, "epoch": 3.16629711751663, "percentage": 79.16, "elapsed_time": "1:50:10", "remaining_time": "0:29:00"} | |
| {"current_steps": 1430, "total_steps": 1804, "loss": 0.4983486831188202, "lr": 1.0114629626752973e-06, "epoch": 3.1707317073170733, "percentage": 79.27, "elapsed_time": "1:50:18", "remaining_time": "0:28:50"} | |
| {"current_steps": 1432, "total_steps": 1804, "loss": 0.39363744854927063, "lr": 1.0062358139324715e-06, "epoch": 3.1751662971175167, "percentage": 79.38, "elapsed_time": "1:50:25", "remaining_time": "0:28:41"} | |
| {"current_steps": 1434, "total_steps": 1804, "loss": 0.5175668597221375, "lr": 1.0010321254202992e-06, "epoch": 3.17960088691796, "percentage": 79.49, "elapsed_time": "1:50:32", "remaining_time": "0:28:31"} | |
| {"current_steps": 1436, "total_steps": 1804, "loss": 0.17975324392318726, "lr": 9.958519671480919e-07, "epoch": 3.1840354767184036, "percentage": 79.6, "elapsed_time": "1:50:39", "remaining_time": "0:28:21"} | |
| {"current_steps": 1438, "total_steps": 1804, "loss": 0.42570653557777405, "lr": 9.906954088085929e-07, "epoch": 3.188470066518847, "percentage": 79.71, "elapsed_time": "1:50:48", "remaining_time": "0:28:12"} | |
| {"current_steps": 1440, "total_steps": 1804, "loss": 0.585996150970459, "lr": 9.85562519777035e-07, "epoch": 3.1929046563192904, "percentage": 79.82, "elapsed_time": "1:50:57", "remaining_time": "0:28:02"} | |
| {"current_steps": 1442, "total_steps": 1804, "loss": 0.7266984581947327, "lr": 9.804533691102112e-07, "epoch": 3.197339246119734, "percentage": 79.93, "elapsed_time": "1:51:07", "remaining_time": "0:27:53"} | |
| {"current_steps": 1444, "total_steps": 1804, "loss": 0.7630691528320312, "lr": 9.75368025545542e-07, "epoch": 3.2017738359201773, "percentage": 80.04, "elapsed_time": "1:51:18", "remaining_time": "0:27:44"} | |
| {"current_steps": 1446, "total_steps": 1804, "loss": 0.6669173240661621, "lr": 9.703065575001518e-07, "epoch": 3.2062084257206207, "percentage": 80.16, "elapsed_time": "1:51:28", "remaining_time": "0:27:35"} | |
| {"current_steps": 1448, "total_steps": 1804, "loss": 0.6446712613105774, "lr": 9.65269033069952e-07, "epoch": 3.210643015521064, "percentage": 80.27, "elapsed_time": "1:51:39", "remaining_time": "0:27:27"} | |
| {"current_steps": 1450, "total_steps": 1804, "loss": 0.8526778817176819, "lr": 9.602555200287184e-07, "epoch": 3.2150776053215075, "percentage": 80.38, "elapsed_time": "1:51:50", "remaining_time": "0:27:18"} | |
| {"current_steps": 1452, "total_steps": 1804, "loss": 0.4225665330886841, "lr": 9.552660858271835e-07, "epoch": 3.2195121951219514, "percentage": 80.49, "elapsed_time": "1:52:00", "remaining_time": "0:27:09"} | |
| {"current_steps": 1454, "total_steps": 1804, "loss": 0.5034030675888062, "lr": 9.503007975921294e-07, "epoch": 3.223946784922395, "percentage": 80.6, "elapsed_time": "1:52:08", "remaining_time": "0:26:59"} | |
| {"current_steps": 1456, "total_steps": 1804, "loss": 0.7649686336517334, "lr": 9.453597221254821e-07, "epoch": 3.2283813747228383, "percentage": 80.71, "elapsed_time": "1:52:19", "remaining_time": "0:26:50"} | |
| {"current_steps": 1458, "total_steps": 1804, "loss": 0.45995619893074036, "lr": 9.404429259034156e-07, "epoch": 3.2328159645232817, "percentage": 80.82, "elapsed_time": "1:52:29", "remaining_time": "0:26:41"} | |
| {"current_steps": 1460, "total_steps": 1804, "loss": 0.8184725046157837, "lr": 9.355504750754543e-07, "epoch": 3.237250554323725, "percentage": 80.93, "elapsed_time": "1:52:39", "remaining_time": "0:26:32"} | |
| {"current_steps": 1462, "total_steps": 1804, "loss": 0.5478826761245728, "lr": 9.306824354635866e-07, "epoch": 3.2416851441241685, "percentage": 81.04, "elapsed_time": "1:52:46", "remaining_time": "0:26:22"} | |
| {"current_steps": 1464, "total_steps": 1804, "loss": 0.69716876745224, "lr": 9.258388725613776e-07, "epoch": 3.246119733924612, "percentage": 81.15, "elapsed_time": "1:52:56", "remaining_time": "0:26:13"} | |
| {"current_steps": 1466, "total_steps": 1804, "loss": 0.8049119710922241, "lr": 9.21019851533086e-07, "epoch": 3.2505543237250554, "percentage": 81.26, "elapsed_time": "1:53:06", "remaining_time": "0:26:04"} | |
| {"current_steps": 1468, "total_steps": 1804, "loss": 0.12469253689050674, "lr": 9.162254372127921e-07, "epoch": 3.254988913525499, "percentage": 81.37, "elapsed_time": "1:53:12", "remaining_time": "0:25:54"} | |
| {"current_steps": 1470, "total_steps": 1804, "loss": 0.8042660355567932, "lr": 9.114556941035199e-07, "epoch": 3.259423503325942, "percentage": 81.49, "elapsed_time": "1:53:22", "remaining_time": "0:25:45"} | |
| {"current_steps": 1472, "total_steps": 1804, "loss": 0.8942850828170776, "lr": 9.067106863763752e-07, "epoch": 3.2638580931263856, "percentage": 81.6, "elapsed_time": "1:53:31", "remaining_time": "0:25:36"} | |
| {"current_steps": 1474, "total_steps": 1804, "loss": 0.3387027978897095, "lr": 9.01990477869677e-07, "epoch": 3.2682926829268295, "percentage": 81.71, "elapsed_time": "1:53:41", "remaining_time": "0:25:27"} | |
| {"current_steps": 1476, "total_steps": 1804, "loss": 0.5067039728164673, "lr": 8.972951320881014e-07, "epoch": 3.2727272727272725, "percentage": 81.82, "elapsed_time": "1:53:50", "remaining_time": "0:25:17"} | |
| {"current_steps": 1478, "total_steps": 1804, "loss": 0.4391624331474304, "lr": 8.92624712201827e-07, "epoch": 3.2771618625277164, "percentage": 81.93, "elapsed_time": "1:53:56", "remaining_time": "0:25:08"} | |
| {"current_steps": 1480, "total_steps": 1804, "loss": 0.5848779678344727, "lr": 8.879792810456861e-07, "epoch": 3.2815964523281598, "percentage": 82.04, "elapsed_time": "1:54:04", "remaining_time": "0:24:58"} | |
| {"current_steps": 1482, "total_steps": 1804, "loss": 0.6908966898918152, "lr": 8.833589011183147e-07, "epoch": 3.286031042128603, "percentage": 82.15, "elapsed_time": "1:54:13", "remaining_time": "0:24:48"} | |
| {"current_steps": 1484, "total_steps": 1804, "loss": 0.541054904460907, "lr": 8.78763634581318e-07, "epoch": 3.2904656319290466, "percentage": 82.26, "elapsed_time": "1:54:23", "remaining_time": "0:24:39"} | |
| {"current_steps": 1486, "total_steps": 1804, "loss": 0.4378011226654053, "lr": 8.741935432584292e-07, "epoch": 3.29490022172949, "percentage": 82.37, "elapsed_time": "1:54:33", "remaining_time": "0:24:30"} | |
| {"current_steps": 1488, "total_steps": 1804, "loss": 0.4610789716243744, "lr": 8.696486886346805e-07, "epoch": 3.2993348115299335, "percentage": 82.48, "elapsed_time": "1:54:42", "remaining_time": "0:24:21"} | |
| {"current_steps": 1490, "total_steps": 1804, "loss": 0.26280251145362854, "lr": 8.651291318555745e-07, "epoch": 3.303769401330377, "percentage": 82.59, "elapsed_time": "1:54:49", "remaining_time": "0:24:11"} | |
| {"current_steps": 1492, "total_steps": 1804, "loss": 0.7376017570495605, "lr": 8.606349337262623e-07, "epoch": 3.3082039911308203, "percentage": 82.71, "elapsed_time": "1:54:59", "remaining_time": "0:24:02"} | |
| {"current_steps": 1494, "total_steps": 1804, "loss": 0.5012311935424805, "lr": 8.561661547107243e-07, "epoch": 3.3126385809312637, "percentage": 82.82, "elapsed_time": "1:55:08", "remaining_time": "0:23:53"} | |
| {"current_steps": 1496, "total_steps": 1804, "loss": 0.5897710919380188, "lr": 8.517228549309588e-07, "epoch": 3.317073170731707, "percentage": 82.93, "elapsed_time": "1:55:18", "remaining_time": "0:23:44"} | |
| {"current_steps": 1498, "total_steps": 1804, "loss": 0.7489433288574219, "lr": 8.473050941661717e-07, "epoch": 3.3215077605321506, "percentage": 83.04, "elapsed_time": "1:55:28", "remaining_time": "0:23:35"} | |
| {"current_steps": 1500, "total_steps": 1804, "loss": 0.40758612751960754, "lr": 8.429129318519711e-07, "epoch": 3.3259423503325944, "percentage": 83.15, "elapsed_time": "1:55:35", "remaining_time": "0:23:25"} | |
| {"current_steps": 1502, "total_steps": 1804, "loss": 0.749980092048645, "lr": 8.38546427079571e-07, "epoch": 3.330376940133038, "percentage": 83.26, "elapsed_time": "1:55:45", "remaining_time": "0:23:16"} | |
| {"current_steps": 1504, "total_steps": 1804, "loss": 0.7348231673240662, "lr": 8.342056385949929e-07, "epoch": 3.3348115299334813, "percentage": 83.37, "elapsed_time": "1:55:55", "remaining_time": "0:23:07"} | |
| {"current_steps": 1506, "total_steps": 1804, "loss": 0.32258349657058716, "lr": 8.298906247982768e-07, "epoch": 3.3392461197339247, "percentage": 83.48, "elapsed_time": "1:56:01", "remaining_time": "0:22:57"} | |
| {"current_steps": 1508, "total_steps": 1804, "loss": 0.4655570089817047, "lr": 8.25601443742697e-07, "epoch": 3.343680709534368, "percentage": 83.59, "elapsed_time": "1:56:09", "remaining_time": "0:22:47"} | |
| {"current_steps": 1510, "total_steps": 1804, "loss": 0.8381154537200928, "lr": 8.213381531339776e-07, "epoch": 3.3481152993348116, "percentage": 83.7, "elapsed_time": "1:56:18", "remaining_time": "0:22:38"} | |
| {"current_steps": 1512, "total_steps": 1804, "loss": 0.40903714299201965, "lr": 8.1710081032952e-07, "epoch": 3.352549889135255, "percentage": 83.81, "elapsed_time": "1:56:26", "remaining_time": "0:22:29"} | |
| {"current_steps": 1514, "total_steps": 1804, "loss": 0.78590327501297, "lr": 8.128894723376285e-07, "epoch": 3.3569844789356984, "percentage": 83.92, "elapsed_time": "1:56:36", "remaining_time": "0:22:20"} | |
| {"current_steps": 1516, "total_steps": 1804, "loss": 0.5213326215744019, "lr": 8.087041958167438e-07, "epoch": 3.361419068736142, "percentage": 84.04, "elapsed_time": "1:56:45", "remaining_time": "0:22:10"} | |
| {"current_steps": 1518, "total_steps": 1804, "loss": 0.5064011812210083, "lr": 8.04545037074683e-07, "epoch": 3.3658536585365852, "percentage": 84.15, "elapsed_time": "1:56:52", "remaining_time": "0:22:01"} | |
| {"current_steps": 1520, "total_steps": 1804, "loss": 0.7650377154350281, "lr": 8.004120520678768e-07, "epoch": 3.3702882483370287, "percentage": 84.26, "elapsed_time": "1:57:02", "remaining_time": "0:21:52"} | |
| {"current_steps": 1522, "total_steps": 1804, "loss": 0.45021387934684753, "lr": 7.963052964006243e-07, "epoch": 3.374722838137472, "percentage": 84.37, "elapsed_time": "1:57:08", "remaining_time": "0:21:42"} | |
| {"current_steps": 1524, "total_steps": 1804, "loss": 0.6630456447601318, "lr": 7.922248253243367e-07, "epoch": 3.3791574279379155, "percentage": 84.48, "elapsed_time": "1:57:17", "remaining_time": "0:21:32"} | |
| {"current_steps": 1526, "total_steps": 1804, "loss": 0.6750819683074951, "lr": 7.881706937368005e-07, "epoch": 3.3835920177383594, "percentage": 84.59, "elapsed_time": "1:57:26", "remaining_time": "0:21:23"} | |
| {"current_steps": 1528, "total_steps": 1804, "loss": 0.37881627678871155, "lr": 7.84142956181436e-07, "epoch": 3.388026607538803, "percentage": 84.7, "elapsed_time": "1:57:36", "remaining_time": "0:21:14"} | |
| {"current_steps": 1530, "total_steps": 1804, "loss": 0.3390671908855438, "lr": 7.801416668465621e-07, "epoch": 3.3924611973392462, "percentage": 84.81, "elapsed_time": "1:57:42", "remaining_time": "0:21:04"} | |
| {"current_steps": 1532, "total_steps": 1804, "loss": 0.2297561913728714, "lr": 7.76166879564672e-07, "epoch": 3.3968957871396896, "percentage": 84.92, "elapsed_time": "1:57:52", "remaining_time": "0:20:55"} | |
| {"current_steps": 1534, "total_steps": 1804, "loss": 0.6209254860877991, "lr": 7.722186478117031e-07, "epoch": 3.401330376940133, "percentage": 85.03, "elapsed_time": "1:58:01", "remaining_time": "0:20:46"} | |
| {"current_steps": 1536, "total_steps": 1804, "loss": 0.6978744864463806, "lr": 7.682970247063212e-07, "epoch": 3.4057649667405765, "percentage": 85.14, "elapsed_time": "1:58:11", "remaining_time": "0:20:37"} | |
| {"current_steps": 1538, "total_steps": 1804, "loss": 0.6786344647407532, "lr": 7.644020630092066e-07, "epoch": 3.41019955654102, "percentage": 85.25, "elapsed_time": "1:58:21", "remaining_time": "0:20:28"} | |
| {"current_steps": 1540, "total_steps": 1804, "loss": 0.7001453042030334, "lr": 7.605338151223401e-07, "epoch": 3.4146341463414633, "percentage": 85.37, "elapsed_time": "1:58:31", "remaining_time": "0:20:19"} | |
| {"current_steps": 1542, "total_steps": 1804, "loss": 0.5472738742828369, "lr": 7.566923330883029e-07, "epoch": 3.4190687361419068, "percentage": 85.48, "elapsed_time": "1:58:39", "remaining_time": "0:20:09"} | |
| {"current_steps": 1544, "total_steps": 1804, "loss": 0.6272318363189697, "lr": 7.528776685895731e-07, "epoch": 3.42350332594235, "percentage": 85.59, "elapsed_time": "1:58:46", "remaining_time": "0:20:00"} | |
| {"current_steps": 1546, "total_steps": 1804, "loss": 0.25309649109840393, "lr": 7.490898729478312e-07, "epoch": 3.4279379157427936, "percentage": 85.7, "elapsed_time": "1:58:56", "remaining_time": "0:19:50"} | |
| {"current_steps": 1548, "total_steps": 1804, "loss": 0.1823056936264038, "lr": 7.45328997123271e-07, "epoch": 3.4323725055432375, "percentage": 85.81, "elapsed_time": "1:59:00", "remaining_time": "0:19:40"} | |
| {"current_steps": 1550, "total_steps": 1804, "loss": 0.8248109221458435, "lr": 7.415950917139106e-07, "epoch": 3.436807095343681, "percentage": 85.92, "elapsed_time": "1:59:09", "remaining_time": "0:19:31"} | |
| {"current_steps": 1552, "total_steps": 1804, "loss": 0.6940004229545593, "lr": 7.378882069549166e-07, "epoch": 3.4412416851441243, "percentage": 86.03, "elapsed_time": "1:59:18", "remaining_time": "0:19:22"} | |
| {"current_steps": 1554, "total_steps": 1804, "loss": 0.3549342155456543, "lr": 7.342083927179235e-07, "epoch": 3.4456762749445677, "percentage": 86.14, "elapsed_time": "1:59:26", "remaining_time": "0:19:12"} | |
| {"current_steps": 1556, "total_steps": 1804, "loss": 0.808836817741394, "lr": 7.30555698510366e-07, "epoch": 3.450110864745011, "percentage": 86.25, "elapsed_time": "1:59:36", "remaining_time": "0:19:03"} | |
| {"current_steps": 1558, "total_steps": 1804, "loss": 0.7992438077926636, "lr": 7.269301734748107e-07, "epoch": 3.4545454545454546, "percentage": 86.36, "elapsed_time": "1:59:46", "remaining_time": "0:18:54"} | |
| {"current_steps": 1560, "total_steps": 1804, "loss": 0.7546770572662354, "lr": 7.233318663882968e-07, "epoch": 3.458980044345898, "percentage": 86.47, "elapsed_time": "1:59:55", "remaining_time": "0:18:45"} | |
| {"current_steps": 1562, "total_steps": 1804, "loss": 0.2441236525774002, "lr": 7.197608256616792e-07, "epoch": 3.4634146341463414, "percentage": 86.59, "elapsed_time": "2:00:02", "remaining_time": "0:18:35"} | |
| {"current_steps": 1564, "total_steps": 1804, "loss": 0.72310870885849, "lr": 7.162170993389763e-07, "epoch": 3.467849223946785, "percentage": 86.7, "elapsed_time": "2:00:10", "remaining_time": "0:18:26"} | |
| {"current_steps": 1566, "total_steps": 1804, "loss": 0.38693636655807495, "lr": 7.127007350967241e-07, "epoch": 3.4722838137472283, "percentage": 86.81, "elapsed_time": "2:00:17", "remaining_time": "0:18:16"} | |
| {"current_steps": 1568, "total_steps": 1804, "loss": 0.6902230381965637, "lr": 7.092117802433362e-07, "epoch": 3.4767184035476717, "percentage": 86.92, "elapsed_time": "2:00:27", "remaining_time": "0:18:07"} | |
| {"current_steps": 1570, "total_steps": 1804, "loss": 0.4683529734611511, "lr": 7.057502817184648e-07, "epoch": 3.481152993348115, "percentage": 87.03, "elapsed_time": "2:00:37", "remaining_time": "0:17:58"} | |
| {"current_steps": 1572, "total_steps": 1804, "loss": 0.4866012930870056, "lr": 7.023162860923722e-07, "epoch": 3.4855875831485585, "percentage": 87.14, "elapsed_time": "2:00:44", "remaining_time": "0:17:49"} | |
| {"current_steps": 1574, "total_steps": 1804, "loss": 0.878727912902832, "lr": 6.989098395653005e-07, "epoch": 3.4900221729490024, "percentage": 87.25, "elapsed_time": "2:00:55", "remaining_time": "0:17:40"} | |
| {"current_steps": 1576, "total_steps": 1804, "loss": 0.4023507833480835, "lr": 6.955309879668537e-07, "epoch": 3.494456762749446, "percentage": 87.36, "elapsed_time": "2:01:05", "remaining_time": "0:17:31"} | |
| {"current_steps": 1578, "total_steps": 1804, "loss": 0.47066283226013184, "lr": 6.921797767553794e-07, "epoch": 3.4988913525498893, "percentage": 87.47, "elapsed_time": "2:01:14", "remaining_time": "0:17:21"} | |
| {"current_steps": 1580, "total_steps": 1804, "loss": 0.7100802659988403, "lr": 6.88856251017356e-07, "epoch": 3.5033259423503327, "percentage": 87.58, "elapsed_time": "2:01:25", "remaining_time": "0:17:12"} | |
| {"current_steps": 1582, "total_steps": 1804, "loss": 0.8129547834396362, "lr": 6.855604554667897e-07, "epoch": 3.507760532150776, "percentage": 87.69, "elapsed_time": "2:01:36", "remaining_time": "0:17:03"} | |
| {"current_steps": 1584, "total_steps": 1804, "loss": 0.45433831214904785, "lr": 6.822924344446081e-07, "epoch": 3.5121951219512195, "percentage": 87.8, "elapsed_time": "2:01:45", "remaining_time": "0:16:54"} | |
| {"current_steps": 1586, "total_steps": 1804, "loss": 0.38872238993644714, "lr": 6.790522319180687e-07, "epoch": 3.516629711751663, "percentage": 87.92, "elapsed_time": "2:01:54", "remaining_time": "0:16:45"} | |
| {"current_steps": 1588, "total_steps": 1804, "loss": 0.7368452548980713, "lr": 6.758398914801628e-07, "epoch": 3.5210643015521064, "percentage": 88.03, "elapsed_time": "2:02:04", "remaining_time": "0:16:36"} | |
| {"current_steps": 1590, "total_steps": 1804, "loss": 0.48450690507888794, "lr": 6.726554563490321e-07, "epoch": 3.52549889135255, "percentage": 88.14, "elapsed_time": "2:02:11", "remaining_time": "0:16:26"} | |
| {"current_steps": 1592, "total_steps": 1804, "loss": 0.4795666038990021, "lr": 6.694989693673872e-07, "epoch": 3.529933481152993, "percentage": 88.25, "elapsed_time": "2:02:21", "remaining_time": "0:16:17"} | |
| {"current_steps": 1594, "total_steps": 1804, "loss": 0.8343867063522339, "lr": 6.663704730019285e-07, "epoch": 3.5343680709534366, "percentage": 88.36, "elapsed_time": "2:02:31", "remaining_time": "0:16:08"} | |
| {"current_steps": 1596, "total_steps": 1804, "loss": 0.2447872906923294, "lr": 6.632700093427774e-07, "epoch": 3.5388026607538805, "percentage": 88.47, "elapsed_time": "2:02:40", "remaining_time": "0:15:59"} | |
| {"current_steps": 1598, "total_steps": 1804, "loss": 0.4911421537399292, "lr": 6.601976201029095e-07, "epoch": 3.5432372505543235, "percentage": 88.58, "elapsed_time": "2:02:47", "remaining_time": "0:15:49"} | |
| {"current_steps": 1600, "total_steps": 1804, "loss": 0.5422607064247131, "lr": 6.571533466175928e-07, "epoch": 3.5476718403547673, "percentage": 88.69, "elapsed_time": "2:02:57", "remaining_time": "0:15:40"} | |
| {"current_steps": 1602, "total_steps": 1804, "loss": 0.7197349667549133, "lr": 6.541372298438325e-07, "epoch": 3.5521064301552108, "percentage": 88.8, "elapsed_time": "2:03:06", "remaining_time": "0:15:31"} | |
| {"current_steps": 1604, "total_steps": 1804, "loss": 0.8826733231544495, "lr": 6.511493103598184e-07, "epoch": 3.556541019955654, "percentage": 88.91, "elapsed_time": "2:03:17", "remaining_time": "0:15:22"} | |
| {"current_steps": 1606, "total_steps": 1804, "loss": 0.7101774215698242, "lr": 6.481896283643808e-07, "epoch": 3.5609756097560976, "percentage": 89.02, "elapsed_time": "2:03:28", "remaining_time": "0:15:13"} | |
| {"current_steps": 1608, "total_steps": 1804, "loss": 0.0969938412308693, "lr": 6.452582236764495e-07, "epoch": 3.565410199556541, "percentage": 89.14, "elapsed_time": "2:03:36", "remaining_time": "0:15:04"} | |
| {"current_steps": 1610, "total_steps": 1804, "loss": 0.5980420112609863, "lr": 6.423551357345154e-07, "epoch": 3.5698447893569845, "percentage": 89.25, "elapsed_time": "2:03:48", "remaining_time": "0:14:55"} | |
| {"current_steps": 1612, "total_steps": 1804, "loss": 0.15914049744606018, "lr": 6.394804035961038e-07, "epoch": 3.574279379157428, "percentage": 89.36, "elapsed_time": "2:03:58", "remaining_time": "0:14:46"} | |
| {"current_steps": 1614, "total_steps": 1804, "loss": 0.5454177260398865, "lr": 6.366340659372462e-07, "epoch": 3.5787139689578713, "percentage": 89.47, "elapsed_time": "2:04:09", "remaining_time": "0:14:36"} | |
| {"current_steps": 1616, "total_steps": 1804, "loss": 0.7745500206947327, "lr": 6.338161610519618e-07, "epoch": 3.5831485587583147, "percentage": 89.58, "elapsed_time": "2:04:20", "remaining_time": "0:14:27"} | |
| {"current_steps": 1618, "total_steps": 1804, "loss": 0.46495673060417175, "lr": 6.310267268517397e-07, "epoch": 3.587583148558758, "percentage": 89.69, "elapsed_time": "2:04:30", "remaining_time": "0:14:18"} | |
| {"current_steps": 1620, "total_steps": 1804, "loss": 0.6363322138786316, "lr": 6.282658008650318e-07, "epoch": 3.5920177383592016, "percentage": 89.8, "elapsed_time": "2:04:39", "remaining_time": "0:14:09"} | |
| {"current_steps": 1622, "total_steps": 1804, "loss": 0.7026889324188232, "lr": 6.255334202367462e-07, "epoch": 3.5964523281596454, "percentage": 89.91, "elapsed_time": "2:04:49", "remaining_time": "0:14:00"} | |
| {"current_steps": 1624, "total_steps": 1804, "loss": 0.6059472560882568, "lr": 6.228296217277481e-07, "epoch": 3.6008869179600884, "percentage": 90.02, "elapsed_time": "2:04:59", "remaining_time": "0:13:51"} | |
| {"current_steps": 1626, "total_steps": 1804, "loss": 0.23650091886520386, "lr": 6.201544417143641e-07, "epoch": 3.6053215077605323, "percentage": 90.13, "elapsed_time": "2:05:06", "remaining_time": "0:13:41"} | |
| {"current_steps": 1628, "total_steps": 1804, "loss": 0.7636827826499939, "lr": 6.175079161878951e-07, "epoch": 3.6097560975609757, "percentage": 90.24, "elapsed_time": "2:05:16", "remaining_time": "0:13:32"} | |
| {"current_steps": 1630, "total_steps": 1804, "loss": 0.6264622807502747, "lr": 6.148900807541295e-07, "epoch": 3.614190687361419, "percentage": 90.35, "elapsed_time": "2:05:26", "remaining_time": "0:13:23"} | |
| {"current_steps": 1632, "total_steps": 1804, "loss": 0.6594371795654297, "lr": 6.123009706328659e-07, "epoch": 3.6186252771618626, "percentage": 90.47, "elapsed_time": "2:05:36", "remaining_time": "0:13:14"} | |
| {"current_steps": 1634, "total_steps": 1804, "loss": 0.7096071839332581, "lr": 6.097406206574378e-07, "epoch": 3.623059866962306, "percentage": 90.58, "elapsed_time": "2:05:46", "remaining_time": "0:13:05"} | |
| {"current_steps": 1636, "total_steps": 1804, "loss": 0.4414654076099396, "lr": 6.072090652742475e-07, "epoch": 3.6274944567627494, "percentage": 90.69, "elapsed_time": "2:05:55", "remaining_time": "0:12:55"} | |
| {"current_steps": 1638, "total_steps": 1804, "loss": 0.6763387322425842, "lr": 6.047063385422993e-07, "epoch": 3.631929046563193, "percentage": 90.8, "elapsed_time": "2:06:05", "remaining_time": "0:12:46"} | |
| {"current_steps": 1640, "total_steps": 1804, "loss": 0.7263614535331726, "lr": 6.022324741327438e-07, "epoch": 3.6363636363636362, "percentage": 90.91, "elapsed_time": "2:06:15", "remaining_time": "0:12:37"} | |
| {"current_steps": 1642, "total_steps": 1804, "loss": 0.3981097936630249, "lr": 5.997875053284248e-07, "epoch": 3.6407982261640797, "percentage": 91.02, "elapsed_time": "2:06:22", "remaining_time": "0:12:28"} | |
| {"current_steps": 1644, "total_steps": 1804, "loss": 0.5544517636299133, "lr": 5.973714650234287e-07, "epoch": 3.6452328159645235, "percentage": 91.13, "elapsed_time": "2:06:30", "remaining_time": "0:12:18"} | |
| {"current_steps": 1646, "total_steps": 1804, "loss": 0.33355870842933655, "lr": 5.949843857226466e-07, "epoch": 3.6496674057649665, "percentage": 91.24, "elapsed_time": "2:06:36", "remaining_time": "0:12:09"} | |
| {"current_steps": 1648, "total_steps": 1804, "loss": 0.3144383728504181, "lr": 5.926262995413329e-07, "epoch": 3.6541019955654104, "percentage": 91.35, "elapsed_time": "2:06:42", "remaining_time": "0:11:59"} | |
| {"current_steps": 1650, "total_steps": 1804, "loss": 0.47315874695777893, "lr": 5.902972382046742e-07, "epoch": 3.658536585365854, "percentage": 91.46, "elapsed_time": "2:06:49", "remaining_time": "0:11:50"} | |
| {"current_steps": 1652, "total_steps": 1804, "loss": 0.6661707758903503, "lr": 5.879972330473651e-07, "epoch": 3.662971175166297, "percentage": 91.57, "elapsed_time": "2:06:58", "remaining_time": "0:11:40"} | |
| {"current_steps": 1654, "total_steps": 1804, "loss": 0.3526417315006256, "lr": 5.857263150131825e-07, "epoch": 3.6674057649667406, "percentage": 91.69, "elapsed_time": "2:07:05", "remaining_time": "0:11:31"} | |
| {"current_steps": 1656, "total_steps": 1804, "loss": 0.787284791469574, "lr": 5.834845146545726e-07, "epoch": 3.671840354767184, "percentage": 91.8, "elapsed_time": "2:07:14", "remaining_time": "0:11:22"} | |
| {"current_steps": 1658, "total_steps": 1804, "loss": 0.5197221636772156, "lr": 5.812718621322386e-07, "epoch": 3.6762749445676275, "percentage": 91.91, "elapsed_time": "2:07:24", "remaining_time": "0:11:13"} | |
| {"current_steps": 1660, "total_steps": 1804, "loss": 0.4882799983024597, "lr": 5.790883872147341e-07, "epoch": 3.680709534368071, "percentage": 92.02, "elapsed_time": "2:07:31", "remaining_time": "0:11:03"} | |
| {"current_steps": 1662, "total_steps": 1804, "loss": 0.38302525877952576, "lr": 5.769341192780643e-07, "epoch": 3.6851441241685143, "percentage": 92.13, "elapsed_time": "2:07:38", "remaining_time": "0:10:54"} | |
| {"current_steps": 1664, "total_steps": 1804, "loss": 0.3155737817287445, "lr": 5.748090873052892e-07, "epoch": 3.6895787139689578, "percentage": 92.24, "elapsed_time": "2:07:47", "remaining_time": "0:10:45"} | |
| {"current_steps": 1666, "total_steps": 1804, "loss": 0.6590555906295776, "lr": 5.727133198861353e-07, "epoch": 3.694013303769401, "percentage": 92.35, "elapsed_time": "2:07:57", "remaining_time": "0:10:35"} | |
| {"current_steps": 1668, "total_steps": 1804, "loss": 0.6262103915214539, "lr": 5.706468452166091e-07, "epoch": 3.6984478935698446, "percentage": 92.46, "elapsed_time": "2:08:07", "remaining_time": "0:10:26"} | |
| {"current_steps": 1670, "total_steps": 1804, "loss": 0.6349048018455505, "lr": 5.686096910986189e-07, "epoch": 3.7028824833702885, "percentage": 92.57, "elapsed_time": "2:08:17", "remaining_time": "0:10:17"} | |
| {"current_steps": 1672, "total_steps": 1804, "loss": 0.6529886722564697, "lr": 5.666018849396016e-07, "epoch": 3.7073170731707314, "percentage": 92.68, "elapsed_time": "2:08:26", "remaining_time": "0:10:08"} | |
| {"current_steps": 1674, "total_steps": 1804, "loss": 0.7602441310882568, "lr": 5.646234537521513e-07, "epoch": 3.7117516629711753, "percentage": 92.79, "elapsed_time": "2:08:36", "remaining_time": "0:09:59"} | |
| {"current_steps": 1676, "total_steps": 1804, "loss": 0.7868402004241943, "lr": 5.626744241536589e-07, "epoch": 3.7161862527716187, "percentage": 92.9, "elapsed_time": "2:08:46", "remaining_time": "0:09:50"} | |
| {"current_steps": 1678, "total_steps": 1804, "loss": 0.7444002628326416, "lr": 5.607548223659519e-07, "epoch": 3.720620842572062, "percentage": 93.02, "elapsed_time": "2:08:57", "remaining_time": "0:09:40"} | |
| {"current_steps": 1680, "total_steps": 1804, "loss": 0.3951123058795929, "lr": 5.58864674214942e-07, "epoch": 3.7250554323725056, "percentage": 93.13, "elapsed_time": "2:09:06", "remaining_time": "0:09:31"} | |
| {"current_steps": 1682, "total_steps": 1804, "loss": 0.5796975493431091, "lr": 5.57004005130279e-07, "epoch": 3.729490022172949, "percentage": 93.24, "elapsed_time": "2:09:17", "remaining_time": "0:09:22"} | |
| {"current_steps": 1684, "total_steps": 1804, "loss": 0.43335816264152527, "lr": 5.551728401450067e-07, "epoch": 3.7339246119733924, "percentage": 93.35, "elapsed_time": "2:09:26", "remaining_time": "0:09:13"} | |
| {"current_steps": 1686, "total_steps": 1804, "loss": 0.6772918105125427, "lr": 5.533712038952278e-07, "epoch": 3.738359201773836, "percentage": 93.46, "elapsed_time": "2:09:36", "remaining_time": "0:09:04"} | |
| {"current_steps": 1688, "total_steps": 1804, "loss": 0.2403266280889511, "lr": 5.51599120619771e-07, "epoch": 3.7427937915742793, "percentage": 93.57, "elapsed_time": "2:09:44", "remaining_time": "0:08:54"} | |
| {"current_steps": 1690, "total_steps": 1804, "loss": 0.7278658747673035, "lr": 5.498566141598662e-07, "epoch": 3.7472283813747227, "percentage": 93.68, "elapsed_time": "2:09:54", "remaining_time": "0:08:45"} | |
| {"current_steps": 1692, "total_steps": 1804, "loss": 0.6658391952514648, "lr": 5.481437079588227e-07, "epoch": 3.7516629711751666, "percentage": 93.79, "elapsed_time": "2:10:04", "remaining_time": "0:08:36"} | |
| {"current_steps": 1694, "total_steps": 1804, "loss": 0.8541386127471924, "lr": 5.464604250617143e-07, "epoch": 3.7560975609756095, "percentage": 93.9, "elapsed_time": "2:10:14", "remaining_time": "0:08:27"} | |
| {"current_steps": 1696, "total_steps": 1804, "loss": 0.8131424784660339, "lr": 5.448067881150697e-07, "epoch": 3.7605321507760534, "percentage": 94.01, "elapsed_time": "2:10:24", "remaining_time": "0:08:18"} | |
| {"current_steps": 1698, "total_steps": 1804, "loss": 0.4288075268268585, "lr": 5.431828193665664e-07, "epoch": 3.764966740576497, "percentage": 94.12, "elapsed_time": "2:10:34", "remaining_time": "0:08:09"} | |
| {"current_steps": 1700, "total_steps": 1804, "loss": 0.5178676843643188, "lr": 5.415885406647334e-07, "epoch": 3.7694013303769403, "percentage": 94.24, "elapsed_time": "2:10:44", "remaining_time": "0:07:59"} | |
| {"current_steps": 1702, "total_steps": 1804, "loss": 0.912034273147583, "lr": 5.400239734586551e-07, "epoch": 3.7738359201773837, "percentage": 94.35, "elapsed_time": "2:10:55", "remaining_time": "0:07:50"} | |
| {"current_steps": 1704, "total_steps": 1804, "loss": 0.11222898960113525, "lr": 5.384891387976845e-07, "epoch": 3.778270509977827, "percentage": 94.46, "elapsed_time": "2:10:58", "remaining_time": "0:07:41"} | |
| {"current_steps": 1706, "total_steps": 1804, "loss": 0.4198029041290283, "lr": 5.369840573311593e-07, "epoch": 3.7827050997782705, "percentage": 94.57, "elapsed_time": "2:11:06", "remaining_time": "0:07:31"} | |
| {"current_steps": 1708, "total_steps": 1804, "loss": 0.5717388391494751, "lr": 5.355087493081236e-07, "epoch": 3.787139689578714, "percentage": 94.68, "elapsed_time": "2:11:16", "remaining_time": "0:07:22"} | |
| {"current_steps": 1710, "total_steps": 1804, "loss": 0.8026604652404785, "lr": 5.340632345770564e-07, "epoch": 3.7915742793791574, "percentage": 94.79, "elapsed_time": "2:11:26", "remaining_time": "0:07:13"} | |
| {"current_steps": 1712, "total_steps": 1804, "loss": 0.5300987958908081, "lr": 5.326475325856036e-07, "epoch": 3.796008869179601, "percentage": 94.9, "elapsed_time": "2:11:36", "remaining_time": "0:07:04"} | |
| {"current_steps": 1714, "total_steps": 1804, "loss": 0.5400223731994629, "lr": 5.312616623803174e-07, "epoch": 3.800443458980044, "percentage": 95.01, "elapsed_time": "2:11:46", "remaining_time": "0:06:55"} | |
| {"current_steps": 1716, "total_steps": 1804, "loss": 0.676804780960083, "lr": 5.299056426063995e-07, "epoch": 3.8048780487804876, "percentage": 95.12, "elapsed_time": "2:11:56", "remaining_time": "0:06:45"} | |
| {"current_steps": 1718, "total_steps": 1804, "loss": 0.5867292881011963, "lr": 5.2857949150745e-07, "epoch": 3.8093126385809315, "percentage": 95.23, "elapsed_time": "2:12:04", "remaining_time": "0:06:36"} | |
| {"current_steps": 1720, "total_steps": 1804, "loss": 0.5383524894714355, "lr": 5.27283226925222e-07, "epoch": 3.8137472283813745, "percentage": 95.34, "elapsed_time": "2:12:16", "remaining_time": "0:06:27"} | |
| {"current_steps": 1722, "total_steps": 1804, "loss": 0.7805699110031128, "lr": 5.260168662993824e-07, "epoch": 3.8181818181818183, "percentage": 95.45, "elapsed_time": "2:12:27", "remaining_time": "0:06:18"} | |
| {"current_steps": 1724, "total_steps": 1804, "loss": 0.6948915719985962, "lr": 5.247804266672765e-07, "epoch": 3.8226164079822618, "percentage": 95.57, "elapsed_time": "2:12:37", "remaining_time": "0:06:09"} | |
| {"current_steps": 1726, "total_steps": 1804, "loss": 0.6090778708457947, "lr": 5.235739246636988e-07, "epoch": 3.827050997782705, "percentage": 95.68, "elapsed_time": "2:12:44", "remaining_time": "0:05:59"} | |
| {"current_steps": 1728, "total_steps": 1804, "loss": 0.38666534423828125, "lr": 5.223973765206694e-07, "epoch": 3.8314855875831486, "percentage": 95.79, "elapsed_time": "2:12:54", "remaining_time": "0:05:50"} | |
| {"current_steps": 1730, "total_steps": 1804, "loss": 0.5477333664894104, "lr": 5.212507980672155e-07, "epoch": 3.835920177383592, "percentage": 95.9, "elapsed_time": "2:13:46", "remaining_time": "0:05:43"} | |
| {"current_steps": 1732, "total_steps": 1804, "loss": 0.7528340816497803, "lr": 5.201342047291587e-07, "epoch": 3.8403547671840355, "percentage": 96.01, "elapsed_time": "2:14:44", "remaining_time": "0:05:36"} | |
| {"current_steps": 1734, "total_steps": 1804, "loss": 0.7895328402519226, "lr": 5.190476115289063e-07, "epoch": 3.844789356984479, "percentage": 96.12, "elapsed_time": "2:17:39", "remaining_time": "0:05:33"} | |
| {"current_steps": 1736, "total_steps": 1804, "loss": 0.8134070634841919, "lr": 5.179910330852521e-07, "epoch": 3.8492239467849223, "percentage": 96.23, "elapsed_time": "2:19:33", "remaining_time": "0:05:27"} | |
| {"current_steps": 1738, "total_steps": 1804, "loss": 0.8293890953063965, "lr": 5.169644836131759e-07, "epoch": 3.8536585365853657, "percentage": 96.34, "elapsed_time": "2:21:24", "remaining_time": "0:05:22"} | |
| {"current_steps": 1740, "total_steps": 1804, "loss": 0.4353466331958771, "lr": 5.159679769236553e-07, "epoch": 3.858093126385809, "percentage": 96.45, "elapsed_time": "2:22:21", "remaining_time": "0:05:14"} | |
| {"current_steps": 1742, "total_steps": 1804, "loss": 0.42465826869010925, "lr": 5.150015264234782e-07, "epoch": 3.8625277161862526, "percentage": 96.56, "elapsed_time": "2:26:11", "remaining_time": "0:05:12"} | |
| {"current_steps": 1744, "total_steps": 1804, "loss": 0.7305734157562256, "lr": 5.140651451150627e-07, "epoch": 3.8669623059866964, "percentage": 96.67, "elapsed_time": "2:28:50", "remaining_time": "0:05:07"} | |
| {"current_steps": 1746, "total_steps": 1804, "loss": 0.6675954461097717, "lr": 5.131588455962835e-07, "epoch": 3.8713968957871394, "percentage": 96.78, "elapsed_time": "2:29:01", "remaining_time": "0:04:57"} | |
| {"current_steps": 1748, "total_steps": 1804, "loss": 0.43095389008522034, "lr": 5.122826400602999e-07, "epoch": 3.8758314855875833, "percentage": 96.9, "elapsed_time": "2:29:10", "remaining_time": "0:04:46"} | |
| {"current_steps": 1750, "total_steps": 1804, "loss": 0.5411372184753418, "lr": 5.114365402953946e-07, "epoch": 3.8802660753880267, "percentage": 97.01, "elapsed_time": "2:29:19", "remaining_time": "0:04:36"} | |
| {"current_steps": 1752, "total_steps": 1804, "loss": 0.6075332760810852, "lr": 5.106205576848123e-07, "epoch": 3.88470066518847, "percentage": 97.12, "elapsed_time": "2:29:29", "remaining_time": "0:04:26"} | |
| {"current_steps": 1754, "total_steps": 1804, "loss": 0.7073659896850586, "lr": 5.09834703206609e-07, "epoch": 3.8891352549889135, "percentage": 97.23, "elapsed_time": "2:29:39", "remaining_time": "0:04:15"} | |
| {"current_steps": 1756, "total_steps": 1804, "loss": 0.5417055487632751, "lr": 5.090789874335027e-07, "epoch": 3.893569844789357, "percentage": 97.34, "elapsed_time": "2:29:49", "remaining_time": "0:04:05"} | |
| {"current_steps": 1758, "total_steps": 1804, "loss": 0.796424150466919, "lr": 5.083534205327321e-07, "epoch": 3.8980044345898004, "percentage": 97.45, "elapsed_time": "2:29:59", "remaining_time": "0:03:55"} | |
| {"current_steps": 1760, "total_steps": 1804, "loss": 0.006255284883081913, "lr": 5.076580122659192e-07, "epoch": 3.902439024390244, "percentage": 97.56, "elapsed_time": "2:30:02", "remaining_time": "0:03:45"} | |
| {"current_steps": 1762, "total_steps": 1804, "loss": 0.36486220359802246, "lr": 5.069927719889383e-07, "epoch": 3.9068736141906872, "percentage": 97.67, "elapsed_time": "2:30:12", "remaining_time": "0:03:34"} | |
| {"current_steps": 1764, "total_steps": 1804, "loss": 0.3559632897377014, "lr": 5.063577086517894e-07, "epoch": 3.9113082039911307, "percentage": 97.78, "elapsed_time": "2:30:18", "remaining_time": "0:03:24"} | |
| {"current_steps": 1766, "total_steps": 1804, "loss": 0.538555920124054, "lr": 5.057528307984792e-07, "epoch": 3.9157427937915745, "percentage": 97.89, "elapsed_time": "2:30:25", "remaining_time": "0:03:14"} | |
| {"current_steps": 1768, "total_steps": 1804, "loss": 0.7477619647979736, "lr": 5.051781465669053e-07, "epoch": 3.9201773835920175, "percentage": 98.0, "elapsed_time": "2:30:35", "remaining_time": "0:03:03"} | |
| {"current_steps": 1770, "total_steps": 1804, "loss": 0.7172592878341675, "lr": 5.04633663688746e-07, "epoch": 3.9246119733924614, "percentage": 98.12, "elapsed_time": "2:30:45", "remaining_time": "0:02:53"} | |
| {"current_steps": 1772, "total_steps": 1804, "loss": 0.27976056933403015, "lr": 5.04119389489358e-07, "epoch": 3.929046563192905, "percentage": 98.23, "elapsed_time": "2:30:55", "remaining_time": "0:02:43"} | |
| {"current_steps": 1774, "total_steps": 1804, "loss": 0.6978716254234314, "lr": 5.036353308876764e-07, "epoch": 3.933481152993348, "percentage": 98.34, "elapsed_time": "2:31:05", "remaining_time": "0:02:33"} | |
| {"current_steps": 1776, "total_steps": 1804, "loss": 0.8046401143074036, "lr": 5.031814943961221e-07, "epoch": 3.9379157427937916, "percentage": 98.45, "elapsed_time": "2:31:14", "remaining_time": "0:02:23"} | |
| {"current_steps": 1778, "total_steps": 1804, "loss": 0.13979971408843994, "lr": 5.027578861205139e-07, "epoch": 3.942350332594235, "percentage": 98.56, "elapsed_time": "2:31:21", "remaining_time": "0:02:12"} | |
| {"current_steps": 1780, "total_steps": 1804, "loss": 0.48992088437080383, "lr": 5.023645117599877e-07, "epoch": 3.9467849223946785, "percentage": 98.67, "elapsed_time": "2:31:31", "remaining_time": "0:02:02"} | |
| {"current_steps": 1782, "total_steps": 1804, "loss": 0.4223584532737732, "lr": 5.020013766069176e-07, "epoch": 3.951219512195122, "percentage": 98.78, "elapsed_time": "2:31:37", "remaining_time": "0:01:52"} | |
| {"current_steps": 1784, "total_steps": 1804, "loss": 0.477926105260849, "lr": 5.016684855468464e-07, "epoch": 3.9556541019955653, "percentage": 98.89, "elapsed_time": "2:31:44", "remaining_time": "0:01:42"} | |
| {"current_steps": 1786, "total_steps": 1804, "loss": 0.7137855887413025, "lr": 5.013658430584194e-07, "epoch": 3.9600886917960088, "percentage": 99.0, "elapsed_time": "2:31:54", "remaining_time": "0:01:31"} | |
| {"current_steps": 1788, "total_steps": 1804, "loss": 0.4540127217769623, "lr": 5.010934532133236e-07, "epoch": 3.964523281596452, "percentage": 99.11, "elapsed_time": "2:32:01", "remaining_time": "0:01:21"} | |
| {"current_steps": 1790, "total_steps": 1804, "loss": 0.5620056390762329, "lr": 5.008513196762342e-07, "epoch": 3.9689578713968956, "percentage": 99.22, "elapsed_time": "2:32:09", "remaining_time": "0:01:11"} | |
| {"current_steps": 1792, "total_steps": 1804, "loss": 0.6748104095458984, "lr": 5.006394457047638e-07, "epoch": 3.9733924611973395, "percentage": 99.33, "elapsed_time": "2:32:19", "remaining_time": "0:01:01"} | |
| {"current_steps": 1794, "total_steps": 1804, "loss": 0.38833579421043396, "lr": 5.004578341494197e-07, "epoch": 3.9778270509977824, "percentage": 99.45, "elapsed_time": "2:32:25", "remaining_time": "0:00:50"} | |
| {"current_steps": 1796, "total_steps": 1804, "loss": 0.680509626865387, "lr": 5.003064874535649e-07, "epoch": 3.9822616407982263, "percentage": 99.56, "elapsed_time": "2:32:34", "remaining_time": "0:00:40"} | |
| {"current_steps": 1798, "total_steps": 1804, "loss": 0.7808913588523865, "lr": 5.00185407653385e-07, "epoch": 3.9866962305986697, "percentage": 99.67, "elapsed_time": "2:32:44", "remaining_time": "0:00:30"} | |
| {"current_steps": 1800, "total_steps": 1804, "loss": 0.6849638223648071, "lr": 5.000945963778627e-07, "epoch": 3.991130820399113, "percentage": 99.78, "elapsed_time": "2:32:54", "remaining_time": "0:00:20"} | |
| {"current_steps": 1802, "total_steps": 1804, "loss": 0.7774142622947693, "lr": 5.000340548487528e-07, "epoch": 3.9955654101995566, "percentage": 99.89, "elapsed_time": "2:33:04", "remaining_time": "0:00:10"} | |
| {"current_steps": 1804, "total_steps": 1804, "loss": 0.4334436058998108, "lr": 5.000037838805682e-07, "epoch": 4.0, "percentage": 100.0, "elapsed_time": "2:33:12", "remaining_time": "0:00:00"} | |
| {"current_steps": 1804, "total_steps": 1804, "epoch": 4.0, "percentage": 100.0, "elapsed_time": "2:33:12", "remaining_time": "0:00:00"} | |