Instructions to use furproxy/27b-5-lora with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use furproxy/27b-5-lora with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("/workspace/models/Qwen3.6-27B") model = PeftModel.from_pretrained(base_model, "furproxy/27b-5-lora") - Transformers
How to use furproxy/27b-5-lora with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="furproxy/27b-5-lora") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("furproxy/27b-5-lora", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use furproxy/27b-5-lora with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "furproxy/27b-5-lora" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "furproxy/27b-5-lora", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/furproxy/27b-5-lora
- SGLang
How to use furproxy/27b-5-lora with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "furproxy/27b-5-lora" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "furproxy/27b-5-lora", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "furproxy/27b-5-lora" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "furproxy/27b-5-lora", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use furproxy/27b-5-lora with Docker Model Runner:
docker model run hf.co/furproxy/27b-5-lora
| {"current_steps": 2, "total_steps": 1638, "loss": 2.681140899658203, "lr": 4.0000000000000003e-07, "epoch": 0.003663003663003663, "percentage": 0.12, "elapsed_time": "0:01:15", "remaining_time": "17:10:07"} | |
| {"current_steps": 4, "total_steps": 1638, "loss": 1.6674047708511353, "lr": 1.2000000000000002e-06, "epoch": 0.007326007326007326, "percentage": 0.24, "elapsed_time": "0:02:14", "remaining_time": "15:19:06"} | |
| {"current_steps": 6, "total_steps": 1638, "loss": 1.8801467418670654, "lr": 2.0000000000000003e-06, "epoch": 0.01098901098901099, "percentage": 0.37, "elapsed_time": "0:03:18", "remaining_time": "15:00:46"} | |
| {"current_steps": 8, "total_steps": 1638, "loss": 2.0659124851226807, "lr": 2.8000000000000003e-06, "epoch": 0.014652014652014652, "percentage": 0.49, "elapsed_time": "0:04:27", "remaining_time": "15:08:10"} | |
| {"current_steps": 10, "total_steps": 1638, "loss": 2.2201435565948486, "lr": 3.6000000000000003e-06, "epoch": 0.018315018315018316, "percentage": 0.61, "elapsed_time": "0:05:42", "remaining_time": "15:29:59"} | |
| {"current_steps": 12, "total_steps": 1638, "loss": 2.0232832431793213, "lr": 4.4e-06, "epoch": 0.02197802197802198, "percentage": 0.73, "elapsed_time": "0:06:49", "remaining_time": "15:25:26"} | |
| {"current_steps": 14, "total_steps": 1638, "loss": 1.7574424743652344, "lr": 5.2e-06, "epoch": 0.02564102564102564, "percentage": 0.85, "elapsed_time": "0:07:57", "remaining_time": "15:22:46"} | |
| {"current_steps": 16, "total_steps": 1638, "loss": 1.8244725465774536, "lr": 6e-06, "epoch": 0.029304029304029304, "percentage": 0.98, "elapsed_time": "0:09:03", "remaining_time": "15:18:55"} | |
| {"current_steps": 18, "total_steps": 1638, "loss": 1.7521573305130005, "lr": 6.800000000000001e-06, "epoch": 0.03296703296703297, "percentage": 1.1, "elapsed_time": "0:10:21", "remaining_time": "15:32:04"} | |
| {"current_steps": 20, "total_steps": 1638, "loss": 1.7442874908447266, "lr": 7.600000000000001e-06, "epoch": 0.03663003663003663, "percentage": 1.22, "elapsed_time": "0:11:33", "remaining_time": "15:34:57"} | |
| {"current_steps": 22, "total_steps": 1638, "loss": 1.137043833732605, "lr": 8.400000000000001e-06, "epoch": 0.040293040293040296, "percentage": 1.34, "elapsed_time": "0:12:27", "remaining_time": "15:14:47"} | |
| {"current_steps": 24, "total_steps": 1638, "loss": 1.3199552297592163, "lr": 9.200000000000002e-06, "epoch": 0.04395604395604396, "percentage": 1.47, "elapsed_time": "0:13:30", "remaining_time": "15:08:51"} | |
| {"current_steps": 26, "total_steps": 1638, "loss": 1.4519306421279907, "lr": 1e-05, "epoch": 0.047619047619047616, "percentage": 1.59, "elapsed_time": "0:14:49", "remaining_time": "15:19:21"} | |
| {"current_steps": 28, "total_steps": 1638, "loss": 1.4781732559204102, "lr": 1.0800000000000002e-05, "epoch": 0.05128205128205128, "percentage": 1.71, "elapsed_time": "0:15:43", "remaining_time": "15:03:44"} | |
| {"current_steps": 30, "total_steps": 1638, "loss": 1.1731195449829102, "lr": 1.16e-05, "epoch": 0.054945054945054944, "percentage": 1.83, "elapsed_time": "0:16:48", "remaining_time": "15:01:08"} | |
| {"current_steps": 32, "total_steps": 1638, "loss": 1.10336172580719, "lr": 1.2400000000000002e-05, "epoch": 0.05860805860805861, "percentage": 1.95, "elapsed_time": "0:17:57", "remaining_time": "15:01:33"} | |
| {"current_steps": 34, "total_steps": 1638, "loss": 1.2399317026138306, "lr": 1.3200000000000002e-05, "epoch": 0.06227106227106227, "percentage": 2.08, "elapsed_time": "0:19:06", "remaining_time": "15:01:34"} | |
| {"current_steps": 36, "total_steps": 1638, "loss": 1.61336088180542, "lr": 1.4e-05, "epoch": 0.06593406593406594, "percentage": 2.2, "elapsed_time": "0:20:18", "remaining_time": "15:03:58"} | |
| {"current_steps": 38, "total_steps": 1638, "loss": 1.372746229171753, "lr": 1.48e-05, "epoch": 0.0695970695970696, "percentage": 2.32, "elapsed_time": "0:21:26", "remaining_time": "15:02:56"} | |
| {"current_steps": 40, "total_steps": 1638, "loss": 1.4253513813018799, "lr": 1.5600000000000003e-05, "epoch": 0.07326007326007326, "percentage": 2.44, "elapsed_time": "0:22:33", "remaining_time": "15:01:06"} | |
| {"current_steps": 42, "total_steps": 1638, "loss": 1.136276364326477, "lr": 1.64e-05, "epoch": 0.07692307692307693, "percentage": 2.56, "elapsed_time": "0:23:47", "remaining_time": "15:04:23"} | |
| {"current_steps": 44, "total_steps": 1638, "loss": 1.5315269231796265, "lr": 1.72e-05, "epoch": 0.08058608058608059, "percentage": 2.69, "elapsed_time": "0:25:01", "remaining_time": "15:06:43"} | |
| {"current_steps": 46, "total_steps": 1638, "loss": 1.6208034753799438, "lr": 1.8e-05, "epoch": 0.08424908424908426, "percentage": 2.81, "elapsed_time": "0:26:09", "remaining_time": "15:05:26"} | |
| {"current_steps": 48, "total_steps": 1638, "loss": 1.05559241771698, "lr": 1.88e-05, "epoch": 0.08791208791208792, "percentage": 2.93, "elapsed_time": "0:27:17", "remaining_time": "15:04:10"} | |
| {"current_steps": 50, "total_steps": 1638, "loss": 1.5109608173370361, "lr": 1.9600000000000002e-05, "epoch": 0.09157509157509157, "percentage": 3.05, "elapsed_time": "0:28:34", "remaining_time": "15:07:36"} | |
| {"current_steps": 52, "total_steps": 1638, "loss": 0.7463083863258362, "lr": 1.999998238790087e-05, "epoch": 0.09523809523809523, "percentage": 3.17, "elapsed_time": "0:29:27", "remaining_time": "14:58:41"} | |
| {"current_steps": 54, "total_steps": 1638, "loss": 0.9765978455543518, "lr": 1.999984149152137e-05, "epoch": 0.0989010989010989, "percentage": 3.3, "elapsed_time": "0:30:34", "remaining_time": "14:57:01"} | |
| {"current_steps": 56, "total_steps": 1638, "loss": 1.3462445735931396, "lr": 1.999955970096814e-05, "epoch": 0.10256410256410256, "percentage": 3.42, "elapsed_time": "0:31:43", "remaining_time": "14:56:14"} | |
| {"current_steps": 58, "total_steps": 1638, "loss": 1.197383999824524, "lr": 1.9999137020652663e-05, "epoch": 0.10622710622710622, "percentage": 3.54, "elapsed_time": "0:32:54", "remaining_time": "14:56:37"} | |
| {"current_steps": 60, "total_steps": 1638, "loss": 1.4108028411865234, "lr": 1.999857345719207e-05, "epoch": 0.10989010989010989, "percentage": 3.66, "elapsed_time": "0:34:01", "remaining_time": "14:55:01"} | |
| {"current_steps": 62, "total_steps": 1638, "loss": 1.4261771440505981, "lr": 1.9997869019409047e-05, "epoch": 0.11355311355311355, "percentage": 3.79, "elapsed_time": "0:35:09", "remaining_time": "14:53:34"} | |
| {"current_steps": 64, "total_steps": 1638, "loss": 1.3881282806396484, "lr": 1.9997023718331707e-05, "epoch": 0.11721611721611722, "percentage": 3.91, "elapsed_time": "0:36:20", "remaining_time": "14:53:45"} | |
| {"current_steps": 66, "total_steps": 1638, "loss": 1.3539735078811646, "lr": 1.9996037567193388e-05, "epoch": 0.12087912087912088, "percentage": 4.03, "elapsed_time": "0:37:42", "remaining_time": "14:58:10"} | |
| {"current_steps": 68, "total_steps": 1638, "loss": 1.3212106227874756, "lr": 1.9994910581432466e-05, "epoch": 0.12454212454212454, "percentage": 4.15, "elapsed_time": "0:38:56", "remaining_time": "14:58:57"} | |
| {"current_steps": 70, "total_steps": 1638, "loss": 1.0624397993087769, "lr": 1.9993642778692116e-05, "epoch": 0.1282051282051282, "percentage": 4.27, "elapsed_time": "0:39:55", "remaining_time": "14:54:22"} | |
| {"current_steps": 72, "total_steps": 1638, "loss": 1.4300200939178467, "lr": 1.999223417882002e-05, "epoch": 0.13186813186813187, "percentage": 4.4, "elapsed_time": "0:41:07", "remaining_time": "14:54:25"} | |
| {"current_steps": 74, "total_steps": 1638, "loss": 1.5944573879241943, "lr": 1.9990684803868068e-05, "epoch": 0.13553113553113552, "percentage": 4.52, "elapsed_time": "0:42:23", "remaining_time": "14:55:48"} | |
| {"current_steps": 76, "total_steps": 1638, "loss": 1.0820951461791992, "lr": 1.9988994678092007e-05, "epoch": 0.1391941391941392, "percentage": 4.64, "elapsed_time": "0:43:18", "remaining_time": "14:50:07"} | |
| {"current_steps": 78, "total_steps": 1638, "loss": 1.4329181909561157, "lr": 1.9987163827951077e-05, "epoch": 0.14285714285714285, "percentage": 4.76, "elapsed_time": "0:44:25", "remaining_time": "14:48:38"} | |
| {"current_steps": 80, "total_steps": 1638, "loss": 1.5802102088928223, "lr": 1.998519228210756e-05, "epoch": 0.14652014652014653, "percentage": 4.88, "elapsed_time": "0:45:35", "remaining_time": "14:47:52"} | |
| {"current_steps": 82, "total_steps": 1638, "loss": 1.1970324516296387, "lr": 1.998308007142638e-05, "epoch": 0.15018315018315018, "percentage": 5.01, "elapsed_time": "0:46:42", "remaining_time": "14:46:19"} | |
| {"current_steps": 84, "total_steps": 1638, "loss": 1.3608276844024658, "lr": 1.9980827228974575e-05, "epoch": 0.15384615384615385, "percentage": 5.13, "elapsed_time": "0:47:53", "remaining_time": "14:46:01"} | |
| {"current_steps": 86, "total_steps": 1638, "loss": 1.4945706129074097, "lr": 1.997843379002081e-05, "epoch": 0.1575091575091575, "percentage": 5.25, "elapsed_time": "0:49:00", "remaining_time": "14:44:30"} | |
| {"current_steps": 88, "total_steps": 1638, "loss": 0.700541615486145, "lr": 1.9975899792034824e-05, "epoch": 0.16117216117216118, "percentage": 5.37, "elapsed_time": "0:49:50", "remaining_time": "14:37:56"} | |
| {"current_steps": 90, "total_steps": 1638, "loss": 0.8554237484931946, "lr": 1.9973225274686804e-05, "epoch": 0.16483516483516483, "percentage": 5.49, "elapsed_time": "0:50:53", "remaining_time": "14:35:27"} | |
| {"current_steps": 92, "total_steps": 1638, "loss": 1.31403648853302, "lr": 1.9970410279846816e-05, "epoch": 0.1684981684981685, "percentage": 5.62, "elapsed_time": "0:52:07", "remaining_time": "14:35:58"} | |
| {"current_steps": 94, "total_steps": 1638, "loss": 1.3383275270462036, "lr": 1.9967454851584132e-05, "epoch": 0.17216117216117216, "percentage": 5.74, "elapsed_time": "0:53:18", "remaining_time": "14:35:32"} | |
| {"current_steps": 96, "total_steps": 1638, "loss": 1.2345792055130005, "lr": 1.996435903616651e-05, "epoch": 0.17582417582417584, "percentage": 5.86, "elapsed_time": "0:54:17", "remaining_time": "14:32:00"} | |
| {"current_steps": 98, "total_steps": 1638, "loss": 1.3272985219955444, "lr": 1.9961122882059523e-05, "epoch": 0.1794871794871795, "percentage": 5.98, "elapsed_time": "0:55:37", "remaining_time": "14:34:11"} | |
| {"current_steps": 100, "total_steps": 1638, "loss": 1.1760129928588867, "lr": 1.9957746439925748e-05, "epoch": 0.18315018315018314, "percentage": 6.11, "elapsed_time": "0:56:46", "remaining_time": "14:33:15"} | |
| {"current_steps": 102, "total_steps": 1638, "loss": 1.2455718517303467, "lr": 1.9954229762624016e-05, "epoch": 0.18681318681318682, "percentage": 6.23, "elapsed_time": "0:57:50", "remaining_time": "14:30:56"} | |
| {"current_steps": 104, "total_steps": 1638, "loss": 0.897014319896698, "lr": 1.995057290520855e-05, "epoch": 0.19047619047619047, "percentage": 6.35, "elapsed_time": "0:58:57", "remaining_time": "14:29:41"} | |
| {"current_steps": 106, "total_steps": 1638, "loss": 1.0838041305541992, "lr": 1.9946775924928132e-05, "epoch": 0.19413919413919414, "percentage": 6.47, "elapsed_time": "0:59:58", "remaining_time": "14:26:43"} | |
| {"current_steps": 108, "total_steps": 1638, "loss": 1.3144299983978271, "lr": 1.9942838881225183e-05, "epoch": 0.1978021978021978, "percentage": 6.59, "elapsed_time": "1:01:05", "remaining_time": "14:25:22"} | |
| {"current_steps": 110, "total_steps": 1638, "loss": 1.1635433435440063, "lr": 1.9938761835734842e-05, "epoch": 0.20146520146520147, "percentage": 6.72, "elapsed_time": "1:02:06", "remaining_time": "14:22:45"} | |
| {"current_steps": 112, "total_steps": 1638, "loss": 1.2244360446929932, "lr": 1.9934544852284013e-05, "epoch": 0.20512820512820512, "percentage": 6.84, "elapsed_time": "1:03:21", "remaining_time": "14:23:10"} | |
| {"current_steps": 114, "total_steps": 1638, "loss": 0.6363462209701538, "lr": 1.9930187996890347e-05, "epoch": 0.2087912087912088, "percentage": 6.96, "elapsed_time": "1:04:31", "remaining_time": "14:22:31"} | |
| {"current_steps": 116, "total_steps": 1638, "loss": 1.300977349281311, "lr": 1.992569133776121e-05, "epoch": 0.21245421245421245, "percentage": 7.08, "elapsed_time": "1:05:42", "remaining_time": "14:22:04"} | |
| {"current_steps": 118, "total_steps": 1638, "loss": 1.2940763235092163, "lr": 1.992105494529264e-05, "epoch": 0.21611721611721613, "percentage": 7.2, "elapsed_time": "1:06:49", "remaining_time": "14:20:53"} | |
| {"current_steps": 120, "total_steps": 1638, "loss": 1.3638042211532593, "lr": 1.99162788920682e-05, "epoch": 0.21978021978021978, "percentage": 7.33, "elapsed_time": "1:07:43", "remaining_time": "14:16:44"} | |
| {"current_steps": 122, "total_steps": 1638, "loss": 1.2911320924758911, "lr": 1.9911363252857887e-05, "epoch": 0.22344322344322345, "percentage": 7.45, "elapsed_time": "1:09:03", "remaining_time": "14:18:10"} | |
| {"current_steps": 124, "total_steps": 1638, "loss": 1.022411823272705, "lr": 1.990630810461694e-05, "epoch": 0.2271062271062271, "percentage": 7.57, "elapsed_time": "1:10:12", "remaining_time": "14:17:10"} | |
| {"current_steps": 126, "total_steps": 1638, "loss": 0.8959170579910278, "lr": 1.990111352648463e-05, "epoch": 0.23076923076923078, "percentage": 7.69, "elapsed_time": "1:11:21", "remaining_time": "14:16:13"} | |
| {"current_steps": 128, "total_steps": 1638, "loss": 1.1906862258911133, "lr": 1.9895779599783033e-05, "epoch": 0.23443223443223443, "percentage": 7.81, "elapsed_time": "1:12:32", "remaining_time": "14:15:44"} | |
| {"current_steps": 130, "total_steps": 1638, "loss": 1.2996340990066528, "lr": 1.989030640801576e-05, "epoch": 0.23809523809523808, "percentage": 7.94, "elapsed_time": "1:13:54", "remaining_time": "14:17:24"} | |
| {"current_steps": 132, "total_steps": 1638, "loss": 1.391095757484436, "lr": 1.9884694036866624e-05, "epoch": 0.24175824175824176, "percentage": 8.06, "elapsed_time": "1:14:55", "remaining_time": "14:14:43"} | |
| {"current_steps": 134, "total_steps": 1638, "loss": 1.289358377456665, "lr": 1.9878942574198334e-05, "epoch": 0.2454212454212454, "percentage": 8.18, "elapsed_time": "1:16:05", "remaining_time": "14:13:58"} | |
| {"current_steps": 136, "total_steps": 1638, "loss": 1.273111343383789, "lr": 1.9873052110051094e-05, "epoch": 0.2490842490842491, "percentage": 8.3, "elapsed_time": "1:17:19", "remaining_time": "14:13:54"} | |
| {"current_steps": 138, "total_steps": 1638, "loss": 1.089441180229187, "lr": 1.9867022736641205e-05, "epoch": 0.25274725274725274, "percentage": 8.42, "elapsed_time": "1:18:39", "remaining_time": "14:14:58"} | |
| {"current_steps": 140, "total_steps": 1638, "loss": 1.2736470699310303, "lr": 1.9860854548359615e-05, "epoch": 0.2564102564102564, "percentage": 8.55, "elapsed_time": "1:19:50", "remaining_time": "14:14:17"} | |
| {"current_steps": 142, "total_steps": 1638, "loss": 1.2917908430099487, "lr": 1.9854547641770446e-05, "epoch": 0.2600732600732601, "percentage": 8.67, "elapsed_time": "1:20:54", "remaining_time": "14:12:18"} | |
| {"current_steps": 144, "total_steps": 1638, "loss": 1.2552529573440552, "lr": 1.9848102115609483e-05, "epoch": 0.26373626373626374, "percentage": 8.79, "elapsed_time": "1:22:05", "remaining_time": "14:11:40"} | |
| {"current_steps": 146, "total_steps": 1638, "loss": 1.4075181484222412, "lr": 1.9841518070782615e-05, "epoch": 0.2673992673992674, "percentage": 8.91, "elapsed_time": "1:23:23", "remaining_time": "14:12:10"} | |
| {"current_steps": 148, "total_steps": 1638, "loss": 1.3139081001281738, "lr": 1.983479561036429e-05, "epoch": 0.27106227106227104, "percentage": 9.04, "elapsed_time": "1:24:30", "remaining_time": "14:10:48"} | |
| {"current_steps": 150, "total_steps": 1638, "loss": 0.9256948828697205, "lr": 1.982793483959585e-05, "epoch": 0.27472527472527475, "percentage": 9.16, "elapsed_time": "1:25:32", "remaining_time": "14:08:34"} | |
| {"current_steps": 152, "total_steps": 1638, "loss": 0.635033369064331, "lr": 1.9820935865883924e-05, "epoch": 0.2783882783882784, "percentage": 9.28, "elapsed_time": "1:26:19", "remaining_time": "14:04:01"} | |
| {"current_steps": 154, "total_steps": 1638, "loss": 1.1024783849716187, "lr": 1.981379879879874e-05, "epoch": 0.28205128205128205, "percentage": 9.4, "elapsed_time": "1:27:12", "remaining_time": "14:00:19"} | |
| {"current_steps": 156, "total_steps": 1638, "loss": 1.3396060466766357, "lr": 1.9806523750072385e-05, "epoch": 0.2857142857142857, "percentage": 9.52, "elapsed_time": "1:28:22", "remaining_time": "13:59:35"} | |
| {"current_steps": 158, "total_steps": 1638, "loss": 1.2755292654037476, "lr": 1.9799110833597093e-05, "epoch": 0.2893772893772894, "percentage": 9.65, "elapsed_time": "1:29:35", "remaining_time": "13:59:09"} | |
| {"current_steps": 160, "total_steps": 1638, "loss": 0.9428300857543945, "lr": 1.9791560165423433e-05, "epoch": 0.29304029304029305, "percentage": 9.77, "elapsed_time": "1:30:41", "remaining_time": "13:57:43"} | |
| {"current_steps": 162, "total_steps": 1638, "loss": 1.5332800149917603, "lr": 1.9783871863758503e-05, "epoch": 0.2967032967032967, "percentage": 9.89, "elapsed_time": "1:31:58", "remaining_time": "13:58:01"} | |
| {"current_steps": 164, "total_steps": 1638, "loss": 1.0453159809112549, "lr": 1.9776046048964082e-05, "epoch": 0.30036630036630035, "percentage": 10.01, "elapsed_time": "1:33:05", "remaining_time": "13:56:42"} | |
| {"current_steps": 166, "total_steps": 1638, "loss": 1.389228105545044, "lr": 1.9768082843554737e-05, "epoch": 0.304029304029304, "percentage": 10.13, "elapsed_time": "1:34:16", "remaining_time": "13:55:58"} | |
| {"current_steps": 168, "total_steps": 1638, "loss": 1.129130244255066, "lr": 1.9759982372195918e-05, "epoch": 0.3076923076923077, "percentage": 10.26, "elapsed_time": "1:35:23", "remaining_time": "13:54:40"} | |
| {"current_steps": 170, "total_steps": 1638, "loss": 1.2560561895370483, "lr": 1.9751744761701984e-05, "epoch": 0.31135531135531136, "percentage": 10.38, "elapsed_time": "1:36:36", "remaining_time": "13:54:10"} | |
| {"current_steps": 172, "total_steps": 1638, "loss": 1.0026013851165771, "lr": 1.9743370141034248e-05, "epoch": 0.315018315018315, "percentage": 10.5, "elapsed_time": "1:37:43", "remaining_time": "13:52:59"} | |
| {"current_steps": 174, "total_steps": 1638, "loss": 0.8527880311012268, "lr": 1.973485864129894e-05, "epoch": 0.31868131868131866, "percentage": 10.62, "elapsed_time": "1:38:42", "remaining_time": "13:50:29"} | |
| {"current_steps": 176, "total_steps": 1638, "loss": 1.3922659158706665, "lr": 1.9726210395745148e-05, "epoch": 0.32234432234432236, "percentage": 10.74, "elapsed_time": "1:39:56", "remaining_time": "13:50:14"} | |
| {"current_steps": 178, "total_steps": 1638, "loss": 0.921902596950531, "lr": 1.971742553976275e-05, "epoch": 0.326007326007326, "percentage": 10.87, "elapsed_time": "1:41:06", "remaining_time": "13:49:21"} | |
| {"current_steps": 180, "total_steps": 1638, "loss": 1.4865270853042603, "lr": 1.9708504210880284e-05, "epoch": 0.32967032967032966, "percentage": 10.99, "elapsed_time": "1:42:13", "remaining_time": "13:47:58"} | |
| {"current_steps": 182, "total_steps": 1638, "loss": 0.973817765712738, "lr": 1.969944654876279e-05, "epoch": 0.3333333333333333, "percentage": 11.11, "elapsed_time": "1:43:19", "remaining_time": "13:46:34"} | |
| {"current_steps": 184, "total_steps": 1638, "loss": 1.2389814853668213, "lr": 1.9690252695209636e-05, "epoch": 0.336996336996337, "percentage": 11.23, "elapsed_time": "1:44:31", "remaining_time": "13:46:00"} | |
| {"current_steps": 186, "total_steps": 1638, "loss": 1.3093231916427612, "lr": 1.9680922794152294e-05, "epoch": 0.34065934065934067, "percentage": 11.36, "elapsed_time": "1:45:34", "remaining_time": "13:44:10"} | |
| {"current_steps": 188, "total_steps": 1638, "loss": 1.1712009906768799, "lr": 1.9671456991652072e-05, "epoch": 0.3443223443223443, "percentage": 11.48, "elapsed_time": "1:46:49", "remaining_time": "13:43:52"} | |
| {"current_steps": 190, "total_steps": 1638, "loss": 1.2694875001907349, "lr": 1.9661855435897858e-05, "epoch": 0.34798534798534797, "percentage": 11.6, "elapsed_time": "1:48:01", "remaining_time": "13:43:14"} | |
| {"current_steps": 192, "total_steps": 1638, "loss": 1.0975149869918823, "lr": 1.9652118277203767e-05, "epoch": 0.3516483516483517, "percentage": 11.72, "elapsed_time": "1:49:04", "remaining_time": "13:41:29"} | |
| {"current_steps": 194, "total_steps": 1638, "loss": 1.2505216598510742, "lr": 1.9642245668006814e-05, "epoch": 0.3553113553113553, "percentage": 11.84, "elapsed_time": "1:50:20", "remaining_time": "13:41:20"} | |
| {"current_steps": 196, "total_steps": 1638, "loss": 1.252524495124817, "lr": 1.963223776286451e-05, "epoch": 0.358974358974359, "percentage": 11.97, "elapsed_time": "1:51:28", "remaining_time": "13:40:08"} | |
| {"current_steps": 198, "total_steps": 1638, "loss": 0.8584736585617065, "lr": 1.9622094718452448e-05, "epoch": 0.3626373626373626, "percentage": 12.09, "elapsed_time": "1:52:31", "remaining_time": "13:38:18"} | |
| {"current_steps": 200, "total_steps": 1638, "loss": 1.0111479759216309, "lr": 1.9611816693561858e-05, "epoch": 0.3663003663003663, "percentage": 12.21, "elapsed_time": "1:53:37", "remaining_time": "13:37:01"} | |
| {"current_steps": 202, "total_steps": 1638, "loss": 1.4314929246902466, "lr": 1.96014038490971e-05, "epoch": 0.36996336996337, "percentage": 12.33, "elapsed_time": "1:54:54", "remaining_time": "13:36:54"} | |
| {"current_steps": 204, "total_steps": 1638, "loss": 1.205211877822876, "lr": 1.9590856348073182e-05, "epoch": 0.37362637362637363, "percentage": 12.45, "elapsed_time": "1:55:59", "remaining_time": "13:35:22"} | |
| {"current_steps": 206, "total_steps": 1638, "loss": 0.7619340419769287, "lr": 1.9580174355613168e-05, "epoch": 0.3772893772893773, "percentage": 12.58, "elapsed_time": "1:56:52", "remaining_time": "13:32:26"} | |
| {"current_steps": 208, "total_steps": 1638, "loss": 1.1318811178207397, "lr": 1.9569358038945617e-05, "epoch": 0.38095238095238093, "percentage": 12.7, "elapsed_time": "1:57:42", "remaining_time": "13:29:13"} | |
| {"current_steps": 210, "total_steps": 1638, "loss": 1.407196283340454, "lr": 1.9558407567401945e-05, "epoch": 0.38461538461538464, "percentage": 12.82, "elapsed_time": "1:58:54", "remaining_time": "13:28:32"} | |
| {"current_steps": 212, "total_steps": 1638, "loss": 1.0677672624588013, "lr": 1.9547323112413806e-05, "epoch": 0.3882783882783883, "percentage": 12.94, "elapsed_time": "2:00:03", "remaining_time": "13:27:36"} | |
| {"current_steps": 214, "total_steps": 1638, "loss": 1.1320164203643799, "lr": 1.9536104847510384e-05, "epoch": 0.39194139194139194, "percentage": 13.06, "elapsed_time": "2:01:07", "remaining_time": "13:25:58"} | |
| {"current_steps": 216, "total_steps": 1638, "loss": 1.2214279174804688, "lr": 1.9524752948315677e-05, "epoch": 0.3956043956043956, "percentage": 13.19, "elapsed_time": "2:02:15", "remaining_time": "13:24:51"} | |
| {"current_steps": 218, "total_steps": 1638, "loss": 1.2568475008010864, "lr": 1.9513267592545752e-05, "epoch": 0.3992673992673993, "percentage": 13.31, "elapsed_time": "2:03:35", "remaining_time": "13:25:02"} | |
| {"current_steps": 220, "total_steps": 1638, "loss": 0.6154600381851196, "lr": 1.9501648960005964e-05, "epoch": 0.40293040293040294, "percentage": 13.43, "elapsed_time": "2:04:38", "remaining_time": "13:23:20"} | |
| {"current_steps": 222, "total_steps": 1638, "loss": 1.3387889862060547, "lr": 1.948989723258815e-05, "epoch": 0.4065934065934066, "percentage": 13.55, "elapsed_time": "2:05:46", "remaining_time": "13:22:13"} | |
| {"current_steps": 224, "total_steps": 1638, "loss": 1.0697791576385498, "lr": 1.9478012594267757e-05, "epoch": 0.41025641025641024, "percentage": 13.68, "elapsed_time": "2:06:46", "remaining_time": "13:20:19"} | |
| {"current_steps": 226, "total_steps": 1638, "loss": 1.2391932010650635, "lr": 1.946599523110099e-05, "epoch": 0.4139194139194139, "percentage": 13.8, "elapsed_time": "2:07:59", "remaining_time": "13:19:37"} | |
| {"current_steps": 228, "total_steps": 1638, "loss": 1.3020524978637695, "lr": 1.945384533122187e-05, "epoch": 0.4175824175824176, "percentage": 13.92, "elapsed_time": "2:09:08", "remaining_time": "13:18:35"} | |
| {"current_steps": 230, "total_steps": 1638, "loss": 1.2441799640655518, "lr": 1.9441563084839324e-05, "epoch": 0.42124542124542125, "percentage": 14.04, "elapsed_time": "2:10:07", "remaining_time": "13:16:37"} | |
| {"current_steps": 232, "total_steps": 1638, "loss": 0.9889364838600159, "lr": 1.942914868423417e-05, "epoch": 0.4249084249084249, "percentage": 14.16, "elapsed_time": "2:11:03", "remaining_time": "13:14:16"} | |
| {"current_steps": 234, "total_steps": 1638, "loss": 1.4886717796325684, "lr": 1.941660232375614e-05, "epoch": 0.42857142857142855, "percentage": 14.29, "elapsed_time": "2:12:19", "remaining_time": "13:13:54"} | |
| {"current_steps": 236, "total_steps": 1638, "loss": 1.016167163848877, "lr": 1.9403924199820813e-05, "epoch": 0.43223443223443225, "percentage": 14.41, "elapsed_time": "2:13:21", "remaining_time": "13:12:11"} | |
| {"current_steps": 238, "total_steps": 1638, "loss": 1.0665429830551147, "lr": 1.9391114510906546e-05, "epoch": 0.4358974358974359, "percentage": 14.53, "elapsed_time": "2:14:25", "remaining_time": "13:10:41"} | |
| {"current_steps": 240, "total_steps": 1638, "loss": 0.8994011878967285, "lr": 1.937817345755138e-05, "epoch": 0.43956043956043955, "percentage": 14.65, "elapsed_time": "2:15:28", "remaining_time": "13:09:06"} | |
| {"current_steps": 242, "total_steps": 1638, "loss": 0.8775147795677185, "lr": 1.9365101242349883e-05, "epoch": 0.4432234432234432, "percentage": 14.77, "elapsed_time": "2:16:39", "remaining_time": "13:08:21"} | |
| {"current_steps": 244, "total_steps": 1638, "loss": 0.5708340406417847, "lr": 1.9351898069949985e-05, "epoch": 0.4468864468864469, "percentage": 14.9, "elapsed_time": "2:17:37", "remaining_time": "13:06:16"} | |
| {"current_steps": 246, "total_steps": 1638, "loss": 1.2500817775726318, "lr": 1.9338564147049785e-05, "epoch": 0.45054945054945056, "percentage": 15.02, "elapsed_time": "2:18:43", "remaining_time": "13:04:58"} | |
| {"current_steps": 248, "total_steps": 1638, "loss": 0.8762341141700745, "lr": 1.9325099682394296e-05, "epoch": 0.4542124542124542, "percentage": 15.14, "elapsed_time": "2:19:44", "remaining_time": "13:03:14"} | |
| {"current_steps": 250, "total_steps": 1638, "loss": 1.2730751037597656, "lr": 1.9311504886772183e-05, "epoch": 0.45787545787545786, "percentage": 15.26, "elapsed_time": "2:20:46", "remaining_time": "13:01:37"} | |
| {"current_steps": 252, "total_steps": 1638, "loss": 1.1749809980392456, "lr": 1.929777997301248e-05, "epoch": 0.46153846153846156, "percentage": 15.38, "elapsed_time": "2:21:54", "remaining_time": "13:00:28"} | |
| {"current_steps": 254, "total_steps": 1638, "loss": 0.9623442888259888, "lr": 1.9283925155981228e-05, "epoch": 0.4652014652014652, "percentage": 15.51, "elapsed_time": "2:23:01", "remaining_time": "12:59:18"} | |
| {"current_steps": 256, "total_steps": 1638, "loss": 1.2541102170944214, "lr": 1.9269940652578143e-05, "epoch": 0.46886446886446886, "percentage": 15.63, "elapsed_time": "2:24:11", "remaining_time": "12:58:27"} | |
| {"current_steps": 258, "total_steps": 1638, "loss": 1.2813372611999512, "lr": 1.9255826681733194e-05, "epoch": 0.4725274725274725, "percentage": 15.75, "elapsed_time": "2:25:31", "remaining_time": "12:58:23"} | |
| {"current_steps": 260, "total_steps": 1638, "loss": 0.7289823293685913, "lr": 1.924158346440319e-05, "epoch": 0.47619047619047616, "percentage": 15.87, "elapsed_time": "2:26:26", "remaining_time": "12:56:07"} | |
| {"current_steps": 262, "total_steps": 1638, "loss": 1.1414886713027954, "lr": 1.9227211223568317e-05, "epoch": 0.47985347985347987, "percentage": 16.0, "elapsed_time": "2:27:27", "remaining_time": "12:54:26"} | |
| {"current_steps": 264, "total_steps": 1638, "loss": 1.2244765758514404, "lr": 1.9212710184228654e-05, "epoch": 0.4835164835164835, "percentage": 16.12, "elapsed_time": "2:28:37", "remaining_time": "12:53:30"} | |
| {"current_steps": 266, "total_steps": 1638, "loss": 1.4965099096298218, "lr": 1.9198080573400634e-05, "epoch": 0.48717948717948717, "percentage": 16.24, "elapsed_time": "2:29:53", "remaining_time": "12:53:09"} | |
| {"current_steps": 268, "total_steps": 1638, "loss": 0.7830735445022583, "lr": 1.9183322620113505e-05, "epoch": 0.4908424908424908, "percentage": 16.36, "elapsed_time": "2:30:56", "remaining_time": "12:51:35"} | |
| {"current_steps": 270, "total_steps": 1638, "loss": 1.1967202425003052, "lr": 1.916843655540574e-05, "epoch": 0.4945054945054945, "percentage": 16.48, "elapsed_time": "2:32:09", "remaining_time": "12:50:55"} | |
| {"current_steps": 272, "total_steps": 1638, "loss": 0.8744536638259888, "lr": 1.915342261232142e-05, "epoch": 0.4981684981684982, "percentage": 16.61, "elapsed_time": "2:33:13", "remaining_time": "12:49:32"} | |
| {"current_steps": 274, "total_steps": 1638, "loss": 1.248273253440857, "lr": 1.913828102590659e-05, "epoch": 0.5018315018315018, "percentage": 16.73, "elapsed_time": "2:34:25", "remaining_time": "12:48:42"} | |
| {"current_steps": 276, "total_steps": 1638, "loss": 0.8064572215080261, "lr": 1.9123012033205564e-05, "epoch": 0.5054945054945055, "percentage": 16.85, "elapsed_time": "2:35:21", "remaining_time": "12:46:40"} | |
| {"current_steps": 278, "total_steps": 1638, "loss": 0.8765072226524353, "lr": 1.9107615873257234e-05, "epoch": 0.5091575091575091, "percentage": 16.97, "elapsed_time": "2:36:17", "remaining_time": "12:44:33"} | |
| {"current_steps": 280, "total_steps": 1638, "loss": 1.2487154006958008, "lr": 1.909209278709131e-05, "epoch": 0.5128205128205128, "percentage": 17.09, "elapsed_time": "2:37:28", "remaining_time": "12:43:43"} | |
| {"current_steps": 282, "total_steps": 1638, "loss": 1.2448886632919312, "lr": 1.9076443017724568e-05, "epoch": 0.5164835164835165, "percentage": 17.22, "elapsed_time": "2:38:44", "remaining_time": "12:43:17"} | |
| {"current_steps": 284, "total_steps": 1638, "loss": 1.2436648607254028, "lr": 1.9060666810157025e-05, "epoch": 0.5201465201465202, "percentage": 17.34, "elapsed_time": "2:39:54", "remaining_time": "12:42:21"} | |
| {"current_steps": 286, "total_steps": 1638, "loss": 1.0280476808547974, "lr": 1.9044764411368106e-05, "epoch": 0.5238095238095238, "percentage": 17.46, "elapsed_time": "2:41:01", "remaining_time": "12:41:14"} | |
| {"current_steps": 288, "total_steps": 1638, "loss": 1.2490639686584473, "lr": 1.9028736070312796e-05, "epoch": 0.5274725274725275, "percentage": 17.58, "elapsed_time": "2:42:12", "remaining_time": "12:40:20"} | |
| {"current_steps": 290, "total_steps": 1638, "loss": 1.2165021896362305, "lr": 1.9012582037917713e-05, "epoch": 0.5311355311355311, "percentage": 17.7, "elapsed_time": "2:43:32", "remaining_time": "12:40:10"} | |
| {"current_steps": 292, "total_steps": 1638, "loss": 0.7315054535865784, "lr": 1.8996302567077217e-05, "epoch": 0.5347985347985348, "percentage": 17.83, "elapsed_time": "2:44:28", "remaining_time": "12:38:09"} | |
| {"current_steps": 294, "total_steps": 1638, "loss": 0.9443866610527039, "lr": 1.897989791264941e-05, "epoch": 0.5384615384615384, "percentage": 17.95, "elapsed_time": "2:45:22", "remaining_time": "12:36:02"} | |
| {"current_steps": 296, "total_steps": 1638, "loss": 1.0225800275802612, "lr": 1.8963368331452172e-05, "epoch": 0.5421245421245421, "percentage": 18.07, "elapsed_time": "2:46:20", "remaining_time": "12:34:08"} | |
| {"current_steps": 298, "total_steps": 1638, "loss": 1.2971231937408447, "lr": 1.8946714082259145e-05, "epoch": 0.5457875457875457, "percentage": 18.19, "elapsed_time": "2:47:35", "remaining_time": "12:33:35"} | |
| {"current_steps": 300, "total_steps": 1638, "loss": 1.1916959285736084, "lr": 1.8929935425795655e-05, "epoch": 0.5494505494505495, "percentage": 18.32, "elapsed_time": "2:48:35", "remaining_time": "12:31:53"} | |
| {"current_steps": 302, "total_steps": 1638, "loss": 1.1871459484100342, "lr": 1.8913032624734657e-05, "epoch": 0.5531135531135531, "percentage": 18.44, "elapsed_time": "2:49:45", "remaining_time": "12:30:58"} | |
| {"current_steps": 304, "total_steps": 1638, "loss": 0.9745575189590454, "lr": 1.8896005943692614e-05, "epoch": 0.5567765567765568, "percentage": 18.56, "elapsed_time": "2:50:45", "remaining_time": "12:29:18"} | |
| {"current_steps": 306, "total_steps": 1638, "loss": 0.9455310106277466, "lr": 1.8878855649225346e-05, "epoch": 0.5604395604395604, "percentage": 18.68, "elapsed_time": "2:51:50", "remaining_time": "12:28:02"} | |
| {"current_steps": 308, "total_steps": 1638, "loss": 1.4047762155532837, "lr": 1.8861582009823868e-05, "epoch": 0.5641025641025641, "percentage": 18.8, "elapsed_time": "2:52:52", "remaining_time": "12:26:29"} | |
| {"current_steps": 310, "total_steps": 1638, "loss": 0.9867649078369141, "lr": 1.884418529591018e-05, "epoch": 0.5677655677655677, "percentage": 18.93, "elapsed_time": "2:54:00", "remaining_time": "12:25:25"} | |
| {"current_steps": 312, "total_steps": 1638, "loss": 1.2163362503051758, "lr": 1.882666577983304e-05, "epoch": 0.5714285714285714, "percentage": 19.05, "elapsed_time": "2:54:50", "remaining_time": "12:23:03"} | |
| {"current_steps": 314, "total_steps": 1638, "loss": 1.1416099071502686, "lr": 1.8809023735863693e-05, "epoch": 0.575091575091575, "percentage": 19.17, "elapsed_time": "2:56:08", "remaining_time": "12:22:42"} | |
| {"current_steps": 316, "total_steps": 1638, "loss": 1.2828210592269897, "lr": 1.879125944019158e-05, "epoch": 0.5787545787545788, "percentage": 19.29, "elapsed_time": "2:57:06", "remaining_time": "12:20:56"} | |
| {"current_steps": 318, "total_steps": 1638, "loss": 1.1197882890701294, "lr": 1.8773373170920022e-05, "epoch": 0.5824175824175825, "percentage": 19.41, "elapsed_time": "2:58:11", "remaining_time": "12:19:40"} | |
| {"current_steps": 320, "total_steps": 1638, "loss": 1.3375335931777954, "lr": 1.875536520806185e-05, "epoch": 0.5860805860805861, "percentage": 19.54, "elapsed_time": "2:59:16", "remaining_time": "12:18:25"} | |
| {"current_steps": 322, "total_steps": 1638, "loss": 1.5252546072006226, "lr": 1.8737235833535033e-05, "epoch": 0.5897435897435898, "percentage": 19.66, "elapsed_time": "3:00:32", "remaining_time": "12:17:51"} | |
| {"current_steps": 324, "total_steps": 1638, "loss": 1.2672544717788696, "lr": 1.871898533115827e-05, "epoch": 0.5934065934065934, "percentage": 19.78, "elapsed_time": "3:01:44", "remaining_time": "12:17:01"} | |
| {"current_steps": 326, "total_steps": 1638, "loss": 1.359837293624878, "lr": 1.870061398664653e-05, "epoch": 0.5970695970695971, "percentage": 19.9, "elapsed_time": "3:02:45", "remaining_time": "12:15:32"} | |
| {"current_steps": 328, "total_steps": 1638, "loss": 1.2261296510696411, "lr": 1.868212208760658e-05, "epoch": 0.6007326007326007, "percentage": 20.02, "elapsed_time": "3:03:56", "remaining_time": "12:14:38"} | |
| {"current_steps": 330, "total_steps": 1638, "loss": 1.1154756546020508, "lr": 1.8663509923532514e-05, "epoch": 0.6043956043956044, "percentage": 20.15, "elapsed_time": "3:05:10", "remaining_time": "12:13:59"} | |
| {"current_steps": 332, "total_steps": 1638, "loss": 1.1825737953186035, "lr": 1.8644777785801175e-05, "epoch": 0.608058608058608, "percentage": 20.27, "elapsed_time": "3:06:23", "remaining_time": "12:13:13"} | |
| {"current_steps": 334, "total_steps": 1638, "loss": 1.2822954654693604, "lr": 1.862592596766763e-05, "epoch": 0.6117216117216118, "percentage": 20.39, "elapsed_time": "3:07:33", "remaining_time": "12:12:14"} | |
| {"current_steps": 336, "total_steps": 1638, "loss": 0.8941524028778076, "lr": 1.8606954764260556e-05, "epoch": 0.6153846153846154, "percentage": 20.51, "elapsed_time": "3:08:26", "remaining_time": "12:10:13"} | |
| {"current_steps": 338, "total_steps": 1638, "loss": 1.2352339029312134, "lr": 1.8587864472577632e-05, "epoch": 0.6190476190476191, "percentage": 20.63, "elapsed_time": "3:09:42", "remaining_time": "12:09:38"} | |
| {"current_steps": 340, "total_steps": 1638, "loss": 1.2283233404159546, "lr": 1.8568655391480882e-05, "epoch": 0.6227106227106227, "percentage": 20.76, "elapsed_time": "3:10:52", "remaining_time": "12:08:43"} | |
| {"current_steps": 342, "total_steps": 1638, "loss": 0.5744314193725586, "lr": 1.8549327821692008e-05, "epoch": 0.6263736263736264, "percentage": 20.88, "elapsed_time": "3:11:54", "remaining_time": "12:07:14"} | |
| {"current_steps": 344, "total_steps": 1638, "loss": 1.4381451606750488, "lr": 1.852988206578767e-05, "epoch": 0.63003663003663, "percentage": 21.0, "elapsed_time": "3:13:04", "remaining_time": "12:06:16"} | |
| {"current_steps": 346, "total_steps": 1638, "loss": 0.6940987706184387, "lr": 1.851031842819475e-05, "epoch": 0.6336996336996337, "percentage": 21.12, "elapsed_time": "3:14:14", "remaining_time": "12:05:20"} | |
| {"current_steps": 348, "total_steps": 1638, "loss": 1.1522477865219116, "lr": 1.849063721518559e-05, "epoch": 0.6373626373626373, "percentage": 21.25, "elapsed_time": "3:15:14", "remaining_time": "12:03:44"} | |
| {"current_steps": 350, "total_steps": 1638, "loss": 0.8789457082748413, "lr": 1.8470838734873205e-05, "epoch": 0.6410256410256411, "percentage": 21.37, "elapsed_time": "3:16:20", "remaining_time": "12:02:32"} | |
| {"current_steps": 352, "total_steps": 1638, "loss": 0.921578049659729, "lr": 1.8450923297206446e-05, "epoch": 0.6446886446886447, "percentage": 21.49, "elapsed_time": "3:17:22", "remaining_time": "12:01:07"} | |
| {"current_steps": 354, "total_steps": 1638, "loss": 0.9506340026855469, "lr": 1.8430891213965146e-05, "epoch": 0.6483516483516484, "percentage": 21.61, "elapsed_time": "3:18:36", "remaining_time": "12:00:22"} | |
| {"current_steps": 356, "total_steps": 1638, "loss": 1.1715575456619263, "lr": 1.8410742798755255e-05, "epoch": 0.652014652014652, "percentage": 21.73, "elapsed_time": "3:19:38", "remaining_time": "11:58:55"} | |
| {"current_steps": 358, "total_steps": 1638, "loss": 1.151785135269165, "lr": 1.8390478367003922e-05, "epoch": 0.6556776556776557, "percentage": 21.86, "elapsed_time": "3:20:42", "remaining_time": "11:57:35"} | |
| {"current_steps": 360, "total_steps": 1638, "loss": 0.6752058267593384, "lr": 1.8370098235954553e-05, "epoch": 0.6593406593406593, "percentage": 21.98, "elapsed_time": "3:21:46", "remaining_time": "11:56:16"} | |
| {"current_steps": 362, "total_steps": 1638, "loss": 0.9451608657836914, "lr": 1.834960272466184e-05, "epoch": 0.663003663003663, "percentage": 22.1, "elapsed_time": "3:23:00", "remaining_time": "11:55:35"} | |
| {"current_steps": 364, "total_steps": 1638, "loss": 0.9306972026824951, "lr": 1.832899215398679e-05, "epoch": 0.6666666666666666, "percentage": 22.22, "elapsed_time": "3:23:58", "remaining_time": "11:53:56"} | |
| {"current_steps": 366, "total_steps": 1638, "loss": 1.1896872520446777, "lr": 1.8308266846591673e-05, "epoch": 0.6703296703296703, "percentage": 22.34, "elapsed_time": "3:25:09", "remaining_time": "11:52:59"} | |
| {"current_steps": 368, "total_steps": 1638, "loss": 1.0567468404769897, "lr": 1.828742712693499e-05, "epoch": 0.673992673992674, "percentage": 22.47, "elapsed_time": "3:26:18", "remaining_time": "11:52:00"} | |
| {"current_steps": 370, "total_steps": 1638, "loss": 1.0929261445999146, "lr": 1.8266473321266385e-05, "epoch": 0.6776556776556777, "percentage": 22.59, "elapsed_time": "3:27:27", "remaining_time": "11:50:59"} | |
| {"current_steps": 372, "total_steps": 1638, "loss": 1.1846730709075928, "lr": 1.824540575762154e-05, "epoch": 0.6813186813186813, "percentage": 22.71, "elapsed_time": "3:28:22", "remaining_time": "11:49:09"} | |
| {"current_steps": 374, "total_steps": 1638, "loss": 1.2002733945846558, "lr": 1.8224224765817033e-05, "epoch": 0.684981684981685, "percentage": 22.83, "elapsed_time": "3:29:33", "remaining_time": "11:48:13"} | |
| {"current_steps": 376, "total_steps": 1638, "loss": 0.8656985759735107, "lr": 1.820293067744519e-05, "epoch": 0.6886446886446886, "percentage": 22.95, "elapsed_time": "3:30:31", "remaining_time": "11:46:37"} | |
| {"current_steps": 378, "total_steps": 1638, "loss": 0.8376519680023193, "lr": 1.8181523825868882e-05, "epoch": 0.6923076923076923, "percentage": 23.08, "elapsed_time": "3:31:42", "remaining_time": "11:45:42"} | |
| {"current_steps": 380, "total_steps": 1638, "loss": 1.0472257137298584, "lr": 1.816000454621631e-05, "epoch": 0.6959706959706959, "percentage": 23.2, "elapsed_time": "3:32:48", "remaining_time": "11:44:31"} | |
| {"current_steps": 382, "total_steps": 1638, "loss": 0.9799332022666931, "lr": 1.8138373175375744e-05, "epoch": 0.6996336996336996, "percentage": 23.32, "elapsed_time": "3:33:56", "remaining_time": "11:43:24"} | |
| {"current_steps": 384, "total_steps": 1638, "loss": 1.1842981576919556, "lr": 1.8116630051990283e-05, "epoch": 0.7032967032967034, "percentage": 23.44, "elapsed_time": "3:35:03", "remaining_time": "11:42:17"} | |
| {"current_steps": 386, "total_steps": 1638, "loss": 1.0900828838348389, "lr": 1.8094775516452522e-05, "epoch": 0.706959706959707, "percentage": 23.57, "elapsed_time": "3:36:15", "remaining_time": "11:41:26"} | |
| {"current_steps": 388, "total_steps": 1638, "loss": 0.8862177729606628, "lr": 1.807280991089923e-05, "epoch": 0.7106227106227107, "percentage": 23.69, "elapsed_time": "3:37:22", "remaining_time": "11:40:19"} | |
| {"current_steps": 390, "total_steps": 1638, "loss": 1.094993233680725, "lr": 1.8050733579206005e-05, "epoch": 0.7142857142857143, "percentage": 23.81, "elapsed_time": "3:38:30", "remaining_time": "11:39:13"} | |
| {"current_steps": 392, "total_steps": 1638, "loss": 1.1782118082046509, "lr": 1.8028546866981875e-05, "epoch": 0.717948717948718, "percentage": 23.93, "elapsed_time": "3:39:36", "remaining_time": "11:38:00"} | |
| {"current_steps": 394, "total_steps": 1638, "loss": 1.118064284324646, "lr": 1.8006250121563903e-05, "epoch": 0.7216117216117216, "percentage": 24.05, "elapsed_time": "3:40:48", "remaining_time": "11:37:09"} | |
| {"current_steps": 396, "total_steps": 1638, "loss": 1.2445368766784668, "lr": 1.798384369201174e-05, "epoch": 0.7252747252747253, "percentage": 24.18, "elapsed_time": "3:42:00", "remaining_time": "11:36:16"} | |
| {"current_steps": 398, "total_steps": 1638, "loss": 0.921625554561615, "lr": 1.796132792910216e-05, "epoch": 0.7289377289377289, "percentage": 24.3, "elapsed_time": "3:42:58", "remaining_time": "11:34:40"} | |
| {"current_steps": 400, "total_steps": 1638, "loss": 0.8475565910339355, "lr": 1.7938703185323575e-05, "epoch": 0.7326007326007326, "percentage": 24.42, "elapsed_time": "3:44:04", "remaining_time": "11:33:31"} | |
| {"current_steps": 402, "total_steps": 1638, "loss": 1.2503498792648315, "lr": 1.7915969814870508e-05, "epoch": 0.7362637362637363, "percentage": 24.54, "elapsed_time": "3:45:19", "remaining_time": "11:32:46"} | |
| {"current_steps": 404, "total_steps": 1638, "loss": 0.8367084264755249, "lr": 1.789312817363805e-05, "epoch": 0.73992673992674, "percentage": 24.66, "elapsed_time": "3:46:20", "remaining_time": "11:31:20"} | |
| {"current_steps": 406, "total_steps": 1638, "loss": 1.018811821937561, "lr": 1.7870178619216304e-05, "epoch": 0.7435897435897436, "percentage": 24.79, "elapsed_time": "3:47:23", "remaining_time": "11:30:02"} | |
| {"current_steps": 408, "total_steps": 1638, "loss": 1.0193127393722534, "lr": 1.784712151088476e-05, "epoch": 0.7472527472527473, "percentage": 24.91, "elapsed_time": "3:48:26", "remaining_time": "11:28:42"} | |
| {"current_steps": 410, "total_steps": 1638, "loss": 0.8782365322113037, "lr": 1.782395720960669e-05, "epoch": 0.7509157509157509, "percentage": 25.03, "elapsed_time": "3:49:17", "remaining_time": "11:26:44"} | |
| {"current_steps": 412, "total_steps": 1638, "loss": 1.1719920635223389, "lr": 1.780068607802349e-05, "epoch": 0.7545787545787546, "percentage": 25.15, "elapsed_time": "3:50:26", "remaining_time": "11:25:43"} | |
| {"current_steps": 414, "total_steps": 1638, "loss": 0.9983189105987549, "lr": 1.7777308480449006e-05, "epoch": 0.7582417582417582, "percentage": 25.27, "elapsed_time": "3:51:30", "remaining_time": "11:24:26"} | |
| {"current_steps": 416, "total_steps": 1638, "loss": 1.2759360074996948, "lr": 1.7753824782863827e-05, "epoch": 0.7619047619047619, "percentage": 25.4, "elapsed_time": "3:52:39", "remaining_time": "11:23:25"} | |
| {"current_steps": 418, "total_steps": 1638, "loss": 0.6787292957305908, "lr": 1.773023535290956e-05, "epoch": 0.7655677655677655, "percentage": 25.52, "elapsed_time": "3:53:37", "remaining_time": "11:21:52"} | |
| {"current_steps": 420, "total_steps": 1638, "loss": 1.2402327060699463, "lr": 1.7706540559883066e-05, "epoch": 0.7692307692307693, "percentage": 25.64, "elapsed_time": "3:54:48", "remaining_time": "11:20:55"} | |
| {"current_steps": 422, "total_steps": 1638, "loss": 0.9981698989868164, "lr": 1.7682740774730688e-05, "epoch": 0.7728937728937729, "percentage": 25.76, "elapsed_time": "3:55:52", "remaining_time": "11:19:40"} | |
| {"current_steps": 424, "total_steps": 1638, "loss": 0.4972750246524811, "lr": 1.7658836370042443e-05, "epoch": 0.7765567765567766, "percentage": 25.89, "elapsed_time": "3:56:53", "remaining_time": "11:18:17"} | |
| {"current_steps": 426, "total_steps": 1638, "loss": 0.7953031063079834, "lr": 1.7634827720046178e-05, "epoch": 0.7802197802197802, "percentage": 26.01, "elapsed_time": "3:57:56", "remaining_time": "11:16:56"} | |
| {"current_steps": 428, "total_steps": 1638, "loss": 1.0705211162567139, "lr": 1.7610715200601727e-05, "epoch": 0.7838827838827839, "percentage": 26.13, "elapsed_time": "3:58:50", "remaining_time": "11:15:13"} | |
| {"current_steps": 430, "total_steps": 1638, "loss": 1.2132048606872559, "lr": 1.7586499189195016e-05, "epoch": 0.7875457875457875, "percentage": 26.25, "elapsed_time": "4:00:00", "remaining_time": "11:14:16"} | |
| {"current_steps": 432, "total_steps": 1638, "loss": 1.2941319942474365, "lr": 1.7562180064932158e-05, "epoch": 0.7912087912087912, "percentage": 26.37, "elapsed_time": "4:01:07", "remaining_time": "11:13:09"} | |
| {"current_steps": 434, "total_steps": 1638, "loss": 0.8893874883651733, "lr": 1.7537758208533516e-05, "epoch": 0.7948717948717948, "percentage": 26.5, "elapsed_time": "4:02:21", "remaining_time": "11:12:19"} | |
| {"current_steps": 436, "total_steps": 1638, "loss": 0.963141918182373, "lr": 1.7513234002327738e-05, "epoch": 0.7985347985347986, "percentage": 26.62, "elapsed_time": "4:03:23", "remaining_time": "11:10:59"} | |
| {"current_steps": 438, "total_steps": 1638, "loss": 0.8931108117103577, "lr": 1.748860783024579e-05, "epoch": 0.8021978021978022, "percentage": 26.74, "elapsed_time": "4:04:28", "remaining_time": "11:09:48"} | |
| {"current_steps": 440, "total_steps": 1638, "loss": 1.3087202310562134, "lr": 1.746388007781492e-05, "epoch": 0.8058608058608059, "percentage": 26.86, "elapsed_time": "4:05:29", "remaining_time": "11:08:24"} | |
| {"current_steps": 442, "total_steps": 1638, "loss": 1.202932596206665, "lr": 1.7439051132152644e-05, "epoch": 0.8095238095238095, "percentage": 26.98, "elapsed_time": "4:06:45", "remaining_time": "11:07:42"} | |
| {"current_steps": 444, "total_steps": 1638, "loss": 1.2049145698547363, "lr": 1.741412138196067e-05, "epoch": 0.8131868131868132, "percentage": 27.11, "elapsed_time": "4:07:56", "remaining_time": "11:06:46"} | |
| {"current_steps": 446, "total_steps": 1638, "loss": 1.2221276760101318, "lr": 1.738909121751882e-05, "epoch": 0.8168498168498168, "percentage": 27.23, "elapsed_time": "4:09:02", "remaining_time": "11:05:36"} | |
| {"current_steps": 448, "total_steps": 1638, "loss": 1.230087161064148, "lr": 1.736396103067893e-05, "epoch": 0.8205128205128205, "percentage": 27.35, "elapsed_time": "4:10:20", "remaining_time": "11:04:58"} | |
| {"current_steps": 450, "total_steps": 1638, "loss": 1.3565971851348877, "lr": 1.7338731214858688e-05, "epoch": 0.8241758241758241, "percentage": 27.47, "elapsed_time": "4:11:35", "remaining_time": "11:04:11"} | |
| {"current_steps": 452, "total_steps": 1638, "loss": 0.9984432458877563, "lr": 1.7313402165035504e-05, "epoch": 0.8278388278388278, "percentage": 27.59, "elapsed_time": "4:12:33", "remaining_time": "11:02:41"} | |
| {"current_steps": 454, "total_steps": 1638, "loss": 0.4852646291255951, "lr": 1.728797427774031e-05, "epoch": 0.8315018315018315, "percentage": 27.72, "elapsed_time": "4:13:17", "remaining_time": "11:00:33"} | |
| {"current_steps": 456, "total_steps": 1638, "loss": 0.8963858485221863, "lr": 1.7262447951051366e-05, "epoch": 0.8351648351648352, "percentage": 27.84, "elapsed_time": "4:14:11", "remaining_time": "10:58:54"} | |
| {"current_steps": 458, "total_steps": 1638, "loss": 0.8434333801269531, "lr": 1.7236823584587995e-05, "epoch": 0.8388278388278388, "percentage": 27.96, "elapsed_time": "4:15:12", "remaining_time": "10:57:31"} | |
| {"current_steps": 460, "total_steps": 1638, "loss": 1.029900312423706, "lr": 1.7211101579504382e-05, "epoch": 0.8424908424908425, "percentage": 28.08, "elapsed_time": "4:16:05", "remaining_time": "10:55:49"} | |
| {"current_steps": 462, "total_steps": 1638, "loss": 1.2301326990127563, "lr": 1.7185282338483243e-05, "epoch": 0.8461538461538461, "percentage": 28.21, "elapsed_time": "4:17:17", "remaining_time": "10:54:56"} | |
| {"current_steps": 464, "total_steps": 1638, "loss": 1.1807194948196411, "lr": 1.7159366265729537e-05, "epoch": 0.8498168498168498, "percentage": 28.33, "elapsed_time": "4:18:30", "remaining_time": "10:54:05"} | |
| {"current_steps": 466, "total_steps": 1638, "loss": 1.2045433521270752, "lr": 1.713335376696416e-05, "epoch": 0.8534798534798534, "percentage": 28.45, "elapsed_time": "4:19:37", "remaining_time": "10:52:58"} | |
| {"current_steps": 468, "total_steps": 1638, "loss": 0.8860416412353516, "lr": 1.7107245249417556e-05, "epoch": 0.8571428571428571, "percentage": 28.57, "elapsed_time": "4:20:40", "remaining_time": "10:51:41"} | |
| {"current_steps": 470, "total_steps": 1638, "loss": 0.9149615168571472, "lr": 1.7081041121823375e-05, "epoch": 0.8608058608058609, "percentage": 28.69, "elapsed_time": "4:21:47", "remaining_time": "10:50:33"} | |
| {"current_steps": 472, "total_steps": 1638, "loss": 1.1724745035171509, "lr": 1.705474179441205e-05, "epoch": 0.8644688644688645, "percentage": 28.82, "elapsed_time": "4:22:54", "remaining_time": "10:49:28"} | |
| {"current_steps": 474, "total_steps": 1638, "loss": 0.863320529460907, "lr": 1.7028347678904388e-05, "epoch": 0.8681318681318682, "percentage": 28.94, "elapsed_time": "4:23:53", "remaining_time": "10:48:02"} | |
| {"current_steps": 476, "total_steps": 1638, "loss": 1.096718192100525, "lr": 1.700185918850512e-05, "epoch": 0.8717948717948718, "percentage": 29.06, "elapsed_time": "4:24:57", "remaining_time": "10:46:48"} | |
| {"current_steps": 478, "total_steps": 1638, "loss": 1.0467816591262817, "lr": 1.6975276737896443e-05, "epoch": 0.8754578754578755, "percentage": 29.18, "elapsed_time": "4:26:00", "remaining_time": "10:45:33"} | |
| {"current_steps": 480, "total_steps": 1638, "loss": 1.0700321197509766, "lr": 1.69486007432315e-05, "epoch": 0.8791208791208791, "percentage": 29.3, "elapsed_time": "4:27:07", "remaining_time": "10:44:26"} | |
| {"current_steps": 482, "total_steps": 1638, "loss": 1.1908187866210938, "lr": 1.6921831622127905e-05, "epoch": 0.8827838827838828, "percentage": 29.43, "elapsed_time": "4:28:27", "remaining_time": "10:43:51"} | |
| {"current_steps": 484, "total_steps": 1638, "loss": 1.2682039737701416, "lr": 1.6894969793661163e-05, "epoch": 0.8864468864468864, "percentage": 29.55, "elapsed_time": "4:29:38", "remaining_time": "10:42:53"} | |
| {"current_steps": 486, "total_steps": 1638, "loss": 0.9106331467628479, "lr": 1.686801567835814e-05, "epoch": 0.8901098901098901, "percentage": 29.67, "elapsed_time": "4:30:34", "remaining_time": "10:41:22"} | |
| {"current_steps": 488, "total_steps": 1638, "loss": 1.1676743030548096, "lr": 1.6840969698190467e-05, "epoch": 0.8937728937728938, "percentage": 29.79, "elapsed_time": "4:31:42", "remaining_time": "10:40:18"} | |
| {"current_steps": 490, "total_steps": 1638, "loss": 1.1185270547866821, "lr": 1.6813832276567942e-05, "epoch": 0.8974358974358975, "percentage": 29.91, "elapsed_time": "4:32:55", "remaining_time": "10:39:24"} | |
| {"current_steps": 492, "total_steps": 1638, "loss": 1.0551954507827759, "lr": 1.6786603838331894e-05, "epoch": 0.9010989010989011, "percentage": 30.04, "elapsed_time": "4:33:53", "remaining_time": "10:37:57"} | |
| {"current_steps": 494, "total_steps": 1638, "loss": 0.5789248943328857, "lr": 1.6759284809748522e-05, "epoch": 0.9047619047619048, "percentage": 30.16, "elapsed_time": "4:34:47", "remaining_time": "10:36:21"} | |
| {"current_steps": 496, "total_steps": 1638, "loss": 1.2827361822128296, "lr": 1.673187561850225e-05, "epoch": 0.9084249084249084, "percentage": 30.28, "elapsed_time": "4:35:56", "remaining_time": "10:35:19"} | |
| {"current_steps": 498, "total_steps": 1638, "loss": 1.1320176124572754, "lr": 1.6704376693689003e-05, "epoch": 0.9120879120879121, "percentage": 30.4, "elapsed_time": "4:37:03", "remaining_time": "10:34:14"} | |
| {"current_steps": 500, "total_steps": 1638, "loss": 0.8153626322746277, "lr": 1.6676788465809506e-05, "epoch": 0.9157509157509157, "percentage": 30.53, "elapsed_time": "4:38:00", "remaining_time": "10:32:43"} | |
| {"current_steps": 502, "total_steps": 1638, "loss": 0.8643592596054077, "lr": 1.6649111366762552e-05, "epoch": 0.9194139194139194, "percentage": 30.65, "elapsed_time": "4:38:56", "remaining_time": "10:31:14"} | |
| {"current_steps": 504, "total_steps": 1638, "loss": 0.9420300126075745, "lr": 1.66213458298382e-05, "epoch": 0.9230769230769231, "percentage": 30.77, "elapsed_time": "4:40:01", "remaining_time": "10:30:04"} | |
| {"current_steps": 506, "total_steps": 1638, "loss": 0.8538585305213928, "lr": 1.659349228971105e-05, "epoch": 0.9267399267399268, "percentage": 30.89, "elapsed_time": "4:41:26", "remaining_time": "10:29:37"} | |
| {"current_steps": 508, "total_steps": 1638, "loss": 1.1675981283187866, "lr": 1.6565551182433382e-05, "epoch": 0.9304029304029304, "percentage": 31.01, "elapsed_time": "4:42:37", "remaining_time": "10:28:41"} | |
| {"current_steps": 510, "total_steps": 1638, "loss": 1.2252295017242432, "lr": 1.6537522945428386e-05, "epoch": 0.9340659340659341, "percentage": 31.14, "elapsed_time": "4:43:48", "remaining_time": "10:27:43"} | |
| {"current_steps": 512, "total_steps": 1638, "loss": 1.1778167486190796, "lr": 1.6509408017483258e-05, "epoch": 0.9377289377289377, "percentage": 31.26, "elapsed_time": "4:44:58", "remaining_time": "10:26:43"} | |
| {"current_steps": 514, "total_steps": 1638, "loss": 0.9604276418685913, "lr": 1.6481206838742362e-05, "epoch": 0.9413919413919414, "percentage": 31.38, "elapsed_time": "4:46:18", "remaining_time": "10:26:06"} | |
| {"current_steps": 516, "total_steps": 1638, "loss": 1.1854896545410156, "lr": 1.645291985070034e-05, "epoch": 0.945054945054945, "percentage": 31.5, "elapsed_time": "4:47:30", "remaining_time": "10:25:09"} | |
| {"current_steps": 518, "total_steps": 1638, "loss": 1.2195581197738647, "lr": 1.64245474961952e-05, "epoch": 0.9487179487179487, "percentage": 31.62, "elapsed_time": "4:48:37", "remaining_time": "10:24:04"} | |
| {"current_steps": 520, "total_steps": 1638, "loss": 1.218988299369812, "lr": 1.639609021940136e-05, "epoch": 0.9523809523809523, "percentage": 31.75, "elapsed_time": "4:49:44", "remaining_time": "10:22:57"} | |
| {"current_steps": 522, "total_steps": 1638, "loss": 0.8905650973320007, "lr": 1.6367548465822723e-05, "epoch": 0.9560439560439561, "percentage": 31.87, "elapsed_time": "4:50:57", "remaining_time": "10:22:02"} | |
| {"current_steps": 524, "total_steps": 1638, "loss": 1.0242419242858887, "lr": 1.6338922682285697e-05, "epoch": 0.9597069597069597, "percentage": 31.99, "elapsed_time": "4:52:01", "remaining_time": "10:20:50"} | |
| {"current_steps": 526, "total_steps": 1638, "loss": 0.9667062759399414, "lr": 1.6310213316932187e-05, "epoch": 0.9633699633699634, "percentage": 32.11, "elapsed_time": "4:53:08", "remaining_time": "10:19:43"} | |
| {"current_steps": 528, "total_steps": 1638, "loss": 0.6576095819473267, "lr": 1.6281420819212578e-05, "epoch": 0.967032967032967, "percentage": 32.23, "elapsed_time": "4:54:05", "remaining_time": "10:18:15"} | |
| {"current_steps": 530, "total_steps": 1638, "loss": 0.907448947429657, "lr": 1.6252545639878728e-05, "epoch": 0.9706959706959707, "percentage": 32.36, "elapsed_time": "4:55:04", "remaining_time": "10:16:52"} | |
| {"current_steps": 532, "total_steps": 1638, "loss": 1.3604565858840942, "lr": 1.6223588230976874e-05, "epoch": 0.9743589743589743, "percentage": 32.48, "elapsed_time": "4:56:06", "remaining_time": "10:15:35"} | |
| {"current_steps": 534, "total_steps": 1638, "loss": 0.604587733745575, "lr": 1.6194549045840582e-05, "epoch": 0.978021978021978, "percentage": 32.6, "elapsed_time": "4:57:08", "remaining_time": "10:14:18"} | |
| {"current_steps": 536, "total_steps": 1638, "loss": 0.8585751056671143, "lr": 1.616542853908363e-05, "epoch": 0.9816849816849816, "percentage": 32.72, "elapsed_time": "4:58:07", "remaining_time": "10:12:56"} | |
| {"current_steps": 538, "total_steps": 1638, "loss": 0.8037823438644409, "lr": 1.6136227166592912e-05, "epoch": 0.9853479853479854, "percentage": 32.84, "elapsed_time": "4:59:18", "remaining_time": "10:11:58"} | |
| {"current_steps": 540, "total_steps": 1638, "loss": 1.1241040229797363, "lr": 1.6106945385521286e-05, "epoch": 0.989010989010989, "percentage": 32.97, "elapsed_time": "5:00:25", "remaining_time": "10:10:52"} | |
| {"current_steps": 542, "total_steps": 1638, "loss": 1.1745156049728394, "lr": 1.6077583654280416e-05, "epoch": 0.9926739926739927, "percentage": 33.09, "elapsed_time": "5:01:36", "remaining_time": "10:09:53"} | |
| {"current_steps": 544, "total_steps": 1638, "loss": 1.186415195465088, "lr": 1.60481424325336e-05, "epoch": 0.9963369963369964, "percentage": 33.21, "elapsed_time": "5:02:41", "remaining_time": "10:08:43"} | |
| {"current_steps": 546, "total_steps": 1638, "loss": 1.3618619441986084, "lr": 1.6018622181188594e-05, "epoch": 1.0, "percentage": 33.33, "elapsed_time": "5:03:48", "remaining_time": "10:07:37"} | |
| {"current_steps": 548, "total_steps": 1638, "loss": 0.9441794157028198, "lr": 1.598902336239035e-05, "epoch": 1.0036630036630036, "percentage": 33.46, "elapsed_time": "5:05:00", "remaining_time": "10:06:40"} | |
| {"current_steps": 550, "total_steps": 1638, "loss": 1.1801525354385376, "lr": 1.595934643951382e-05, "epoch": 1.0073260073260073, "percentage": 33.58, "elapsed_time": "5:06:11", "remaining_time": "10:05:41"} | |
| {"current_steps": 552, "total_steps": 1638, "loss": 0.6706070899963379, "lr": 1.5929591877156694e-05, "epoch": 1.010989010989011, "percentage": 33.7, "elapsed_time": "5:07:05", "remaining_time": "10:04:10"} | |
| {"current_steps": 554, "total_steps": 1638, "loss": 1.1351317167282104, "lr": 1.5899760141132115e-05, "epoch": 1.0146520146520146, "percentage": 33.82, "elapsed_time": "5:08:19", "remaining_time": "10:03:17"} | |
| {"current_steps": 556, "total_steps": 1638, "loss": 1.0271389484405518, "lr": 1.58698516984614e-05, "epoch": 1.0183150183150182, "percentage": 33.94, "elapsed_time": "5:09:21", "remaining_time": "10:02:02"} | |
| {"current_steps": 558, "total_steps": 1638, "loss": 1.2197285890579224, "lr": 1.583986701736672e-05, "epoch": 1.021978021978022, "percentage": 34.07, "elapsed_time": "5:10:28", "remaining_time": "10:00:54"} | |
| {"current_steps": 560, "total_steps": 1638, "loss": 0.9212762713432312, "lr": 1.5809806567263767e-05, "epoch": 1.0256410256410255, "percentage": 34.19, "elapsed_time": "5:11:34", "remaining_time": "9:59:46"} | |
| {"current_steps": 562, "total_steps": 1638, "loss": 1.1707442998886108, "lr": 1.577967081875442e-05, "epoch": 1.0293040293040292, "percentage": 34.31, "elapsed_time": "5:12:55", "remaining_time": "9:59:07"} | |
| {"current_steps": 564, "total_steps": 1638, "loss": 1.1547435522079468, "lr": 1.574946024361936e-05, "epoch": 1.032967032967033, "percentage": 34.43, "elapsed_time": "5:14:07", "remaining_time": "9:58:09"} | |
| {"current_steps": 566, "total_steps": 1638, "loss": 1.044006109237671, "lr": 1.5719175314810706e-05, "epoch": 1.0366300366300367, "percentage": 34.55, "elapsed_time": "5:15:11", "remaining_time": "9:56:58"} | |
| {"current_steps": 568, "total_steps": 1638, "loss": 1.0475645065307617, "lr": 1.568881650644458e-05, "epoch": 1.0402930402930404, "percentage": 34.68, "elapsed_time": "5:16:18", "remaining_time": "9:55:51"} | |
| {"current_steps": 570, "total_steps": 1638, "loss": 1.1118239164352417, "lr": 1.565838429379371e-05, "epoch": 1.043956043956044, "percentage": 34.8, "elapsed_time": "5:17:34", "remaining_time": "9:55:02"} | |
| {"current_steps": 572, "total_steps": 1638, "loss": 1.2503812313079834, "lr": 1.5627879153279986e-05, "epoch": 1.0476190476190477, "percentage": 34.92, "elapsed_time": "5:18:44", "remaining_time": "9:54:00"} | |
| {"current_steps": 574, "total_steps": 1638, "loss": 1.0755858421325684, "lr": 1.559730156246699e-05, "epoch": 1.0512820512820513, "percentage": 35.04, "elapsed_time": "5:19:47", "remaining_time": "9:52:46"} | |
| {"current_steps": 576, "total_steps": 1638, "loss": 1.2320541143417358, "lr": 1.5566652000052533e-05, "epoch": 1.054945054945055, "percentage": 35.16, "elapsed_time": "5:20:43", "remaining_time": "9:51:20"} | |
| {"current_steps": 578, "total_steps": 1638, "loss": 1.2240521907806396, "lr": 1.553593094586115e-05, "epoch": 1.0586080586080586, "percentage": 35.29, "elapsed_time": "5:22:05", "remaining_time": "9:50:41"} | |
| {"current_steps": 580, "total_steps": 1638, "loss": 1.2212425470352173, "lr": 1.5505138880836595e-05, "epoch": 1.0622710622710623, "percentage": 35.41, "elapsed_time": "5:23:16", "remaining_time": "9:49:42"} | |
| {"current_steps": 582, "total_steps": 1638, "loss": 0.9912468194961548, "lr": 1.5474276287034305e-05, "epoch": 1.065934065934066, "percentage": 35.53, "elapsed_time": "5:24:24", "remaining_time": "9:48:37"} | |
| {"current_steps": 584, "total_steps": 1638, "loss": 1.147226333618164, "lr": 1.544334364761387e-05, "epoch": 1.0695970695970696, "percentage": 35.65, "elapsed_time": "5:25:29", "remaining_time": "9:47:27"} | |
| {"current_steps": 586, "total_steps": 1638, "loss": 1.0802392959594727, "lr": 1.541234144683144e-05, "epoch": 1.0732600732600732, "percentage": 35.78, "elapsed_time": "5:26:23", "remaining_time": "9:45:57"} | |
| {"current_steps": 588, "total_steps": 1638, "loss": 0.821092426776886, "lr": 1.5381270170032173e-05, "epoch": 1.0769230769230769, "percentage": 35.9, "elapsed_time": "5:27:28", "remaining_time": "9:44:46"} | |
| {"current_steps": 590, "total_steps": 1638, "loss": 1.2033530473709106, "lr": 1.5350130303642625e-05, "epoch": 1.0805860805860805, "percentage": 36.02, "elapsed_time": "5:28:39", "remaining_time": "9:43:47"} | |
| {"current_steps": 592, "total_steps": 1638, "loss": 1.0244792699813843, "lr": 1.5318922335163128e-05, "epoch": 1.0842490842490842, "percentage": 36.14, "elapsed_time": "5:29:35", "remaining_time": "9:42:20"} | |
| {"current_steps": 594, "total_steps": 1638, "loss": 0.9856408834457397, "lr": 1.5287646753160174e-05, "epoch": 1.0879120879120878, "percentage": 36.26, "elapsed_time": "5:30:51", "remaining_time": "9:41:30"} | |
| {"current_steps": 596, "total_steps": 1638, "loss": 1.0107301473617554, "lr": 1.5256304047258739e-05, "epoch": 1.0915750915750915, "percentage": 36.39, "elapsed_time": "5:32:03", "remaining_time": "9:40:32"} | |
| {"current_steps": 598, "total_steps": 1638, "loss": 1.0993225574493408, "lr": 1.522489470813466e-05, "epoch": 1.0952380952380953, "percentage": 36.51, "elapsed_time": "5:33:11", "remaining_time": "9:39:28"} | |
| {"current_steps": 600, "total_steps": 1638, "loss": 1.1531182527542114, "lr": 1.5193419227506913e-05, "epoch": 1.098901098901099, "percentage": 36.63, "elapsed_time": "5:34:18", "remaining_time": "9:38:21"} | |
| {"current_steps": 602, "total_steps": 1638, "loss": 0.8158029317855835, "lr": 1.5161878098129937e-05, "epoch": 1.1025641025641026, "percentage": 36.75, "elapsed_time": "5:35:28", "remaining_time": "9:37:20"} | |
| {"current_steps": 604, "total_steps": 1638, "loss": 0.9586283564567566, "lr": 1.5130271813785908e-05, "epoch": 1.1062271062271063, "percentage": 36.87, "elapsed_time": "5:36:37", "remaining_time": "9:36:15"} | |
| {"current_steps": 606, "total_steps": 1638, "loss": 0.8959888815879822, "lr": 1.509860086927703e-05, "epoch": 1.10989010989011, "percentage": 37.0, "elapsed_time": "5:37:39", "remaining_time": "9:35:01"} | |
| {"current_steps": 608, "total_steps": 1638, "loss": 1.2862759828567505, "lr": 1.5066865760417757e-05, "epoch": 1.1135531135531136, "percentage": 37.12, "elapsed_time": "5:38:35", "remaining_time": "9:33:35"} | |
| {"current_steps": 610, "total_steps": 1638, "loss": 0.7218859195709229, "lr": 1.5035066984027053e-05, "epoch": 1.1172161172161172, "percentage": 37.24, "elapsed_time": "5:39:38", "remaining_time": "9:32:23"} | |
| {"current_steps": 612, "total_steps": 1638, "loss": 1.1658059358596802, "lr": 1.5003205037920616e-05, "epoch": 1.120879120879121, "percentage": 37.36, "elapsed_time": "5:40:47", "remaining_time": "9:31:19"} | |
| {"current_steps": 614, "total_steps": 1638, "loss": 0.9905625581741333, "lr": 1.497128042090307e-05, "epoch": 1.1245421245421245, "percentage": 37.48, "elapsed_time": "5:41:43", "remaining_time": "9:29:55"} | |
| {"current_steps": 616, "total_steps": 1638, "loss": 1.1560765504837036, "lr": 1.493929363276017e-05, "epoch": 1.1282051282051282, "percentage": 37.61, "elapsed_time": "5:42:51", "remaining_time": "9:28:50"} | |
| {"current_steps": 618, "total_steps": 1638, "loss": 0.41150620579719543, "lr": 1.4907245174250957e-05, "epoch": 1.1318681318681318, "percentage": 37.73, "elapsed_time": "5:43:50", "remaining_time": "9:27:30"} | |
| {"current_steps": 620, "total_steps": 1638, "loss": 1.0893880128860474, "lr": 1.4875135547099953e-05, "epoch": 1.1355311355311355, "percentage": 37.85, "elapsed_time": "5:44:57", "remaining_time": "9:26:23"} | |
| {"current_steps": 622, "total_steps": 1638, "loss": 0.7574386596679688, "lr": 1.484296525398927e-05, "epoch": 1.1391941391941391, "percentage": 37.97, "elapsed_time": "5:45:53", "remaining_time": "9:24:59"} | |
| {"current_steps": 624, "total_steps": 1638, "loss": 1.0913819074630737, "lr": 1.4810734798550769e-05, "epoch": 1.1428571428571428, "percentage": 38.1, "elapsed_time": "5:46:48", "remaining_time": "9:23:34"} | |
| {"current_steps": 626, "total_steps": 1638, "loss": 1.3770023584365845, "lr": 1.4778444685358147e-05, "epoch": 1.1465201465201464, "percentage": 38.22, "elapsed_time": "5:48:04", "remaining_time": "9:22:42"} | |
| {"current_steps": 628, "total_steps": 1638, "loss": 0.4543880820274353, "lr": 1.4746095419919075e-05, "epoch": 1.15018315018315, "percentage": 38.34, "elapsed_time": "5:49:01", "remaining_time": "9:21:19"} | |
| {"current_steps": 630, "total_steps": 1638, "loss": 1.114593744277954, "lr": 1.4713687508667251e-05, "epoch": 1.1538461538461537, "percentage": 38.46, "elapsed_time": "5:49:58", "remaining_time": "9:19:57"} | |
| {"current_steps": 632, "total_steps": 1638, "loss": 1.0868229866027832, "lr": 1.4681221458954484e-05, "epoch": 1.1575091575091574, "percentage": 38.58, "elapsed_time": "5:51:04", "remaining_time": "9:18:49"} | |
| {"current_steps": 634, "total_steps": 1638, "loss": 0.8624401092529297, "lr": 1.4648697779042754e-05, "epoch": 1.1611721611721613, "percentage": 38.71, "elapsed_time": "5:52:19", "remaining_time": "9:17:56"} | |
| {"current_steps": 636, "total_steps": 1638, "loss": 0.9895141124725342, "lr": 1.461611697809625e-05, "epoch": 1.164835164835165, "percentage": 38.83, "elapsed_time": "5:53:21", "remaining_time": "9:16:43"} | |
| {"current_steps": 638, "total_steps": 1638, "loss": 1.1844947338104248, "lr": 1.4583479566173401e-05, "epoch": 1.1684981684981686, "percentage": 38.95, "elapsed_time": "5:54:32", "remaining_time": "9:15:42"} | |
| {"current_steps": 640, "total_steps": 1638, "loss": 0.7458541989326477, "lr": 1.4550786054218902e-05, "epoch": 1.1721611721611722, "percentage": 39.07, "elapsed_time": "5:55:33", "remaining_time": "9:14:26"} | |
| {"current_steps": 642, "total_steps": 1638, "loss": 1.1217985153198242, "lr": 1.4518036954055685e-05, "epoch": 1.1758241758241759, "percentage": 39.19, "elapsed_time": "5:56:53", "remaining_time": "9:13:41"} | |
| {"current_steps": 644, "total_steps": 1638, "loss": 1.041925311088562, "lr": 1.4485232778376945e-05, "epoch": 1.1794871794871795, "percentage": 39.32, "elapsed_time": "5:57:57", "remaining_time": "9:12:30"} | |
| {"current_steps": 646, "total_steps": 1638, "loss": 0.7592092156410217, "lr": 1.4452374040738078e-05, "epoch": 1.1831501831501832, "percentage": 39.44, "elapsed_time": "5:58:52", "remaining_time": "9:11:05"} | |
| {"current_steps": 648, "total_steps": 1638, "loss": 0.8962647914886475, "lr": 1.4419461255548666e-05, "epoch": 1.1868131868131868, "percentage": 39.56, "elapsed_time": "5:59:57", "remaining_time": "9:09:56"} | |
| {"current_steps": 650, "total_steps": 1638, "loss": 1.141674280166626, "lr": 1.4386494938064417e-05, "epoch": 1.1904761904761905, "percentage": 39.68, "elapsed_time": "6:01:17", "remaining_time": "9:09:10"} | |
| {"current_steps": 652, "total_steps": 1638, "loss": 0.6671714186668396, "lr": 1.4353475604379093e-05, "epoch": 1.1941391941391941, "percentage": 39.8, "elapsed_time": "6:02:16", "remaining_time": "9:07:50"} | |
| {"current_steps": 654, "total_steps": 1638, "loss": 1.2914996147155762, "lr": 1.4320403771416438e-05, "epoch": 1.1978021978021978, "percentage": 39.93, "elapsed_time": "6:03:24", "remaining_time": "9:06:46"} | |
| {"current_steps": 656, "total_steps": 1638, "loss": 0.8283839225769043, "lr": 1.4287279956922076e-05, "epoch": 1.2014652014652014, "percentage": 40.05, "elapsed_time": "6:04:31", "remaining_time": "9:05:40"} | |
| {"current_steps": 658, "total_steps": 1638, "loss": 0.808253824710846, "lr": 1.4254104679455416e-05, "epoch": 1.205128205128205, "percentage": 40.17, "elapsed_time": "6:05:40", "remaining_time": "9:04:36"} | |
| {"current_steps": 660, "total_steps": 1638, "loss": 1.1553109884262085, "lr": 1.4220878458381523e-05, "epoch": 1.2087912087912087, "percentage": 40.29, "elapsed_time": "6:06:53", "remaining_time": "9:03:39"} | |
| {"current_steps": 662, "total_steps": 1638, "loss": 1.0250654220581055, "lr": 1.418760181386301e-05, "epoch": 1.2124542124542124, "percentage": 40.42, "elapsed_time": "6:07:53", "remaining_time": "9:02:23"} | |
| {"current_steps": 664, "total_steps": 1638, "loss": 0.8853683471679688, "lr": 1.4154275266851856e-05, "epoch": 1.2161172161172162, "percentage": 40.54, "elapsed_time": "6:08:56", "remaining_time": "9:01:11"} | |
| {"current_steps": 666, "total_steps": 1638, "loss": 0.8249969482421875, "lr": 1.4120899339081291e-05, "epoch": 1.2197802197802199, "percentage": 40.66, "elapsed_time": "6:10:05", "remaining_time": "9:00:08"} | |
| {"current_steps": 668, "total_steps": 1638, "loss": 0.9055181741714478, "lr": 1.4087474553057599e-05, "epoch": 1.2234432234432235, "percentage": 40.78, "elapsed_time": "6:11:19", "remaining_time": "8:59:11"} | |
| {"current_steps": 670, "total_steps": 1638, "loss": 0.6745082139968872, "lr": 1.405400143205195e-05, "epoch": 1.2271062271062272, "percentage": 40.9, "elapsed_time": "6:12:21", "remaining_time": "8:57:59"} | |
| {"current_steps": 672, "total_steps": 1638, "loss": 1.1828240156173706, "lr": 1.4020480500092217e-05, "epoch": 1.2307692307692308, "percentage": 41.03, "elapsed_time": "6:13:40", "remaining_time": "8:57:10"} | |
| {"current_steps": 674, "total_steps": 1638, "loss": 1.1596636772155762, "lr": 1.3986912281954745e-05, "epoch": 1.2344322344322345, "percentage": 41.15, "elapsed_time": "6:15:00", "remaining_time": "8:56:22"} | |
| {"current_steps": 676, "total_steps": 1638, "loss": 1.1791561841964722, "lr": 1.3953297303156174e-05, "epoch": 1.2380952380952381, "percentage": 41.27, "elapsed_time": "6:16:11", "remaining_time": "8:55:21"} | |
| {"current_steps": 678, "total_steps": 1638, "loss": 0.7706201076507568, "lr": 1.391963608994517e-05, "epoch": 1.2417582417582418, "percentage": 41.39, "elapsed_time": "6:17:07", "remaining_time": "8:53:59"} | |
| {"current_steps": 680, "total_steps": 1638, "loss": 0.8264885544776917, "lr": 1.3885929169294218e-05, "epoch": 1.2454212454212454, "percentage": 41.51, "elapsed_time": "6:18:16", "remaining_time": "8:52:55"} | |
| {"current_steps": 682, "total_steps": 1638, "loss": 1.192352294921875, "lr": 1.3852177068891364e-05, "epoch": 1.249084249084249, "percentage": 41.64, "elapsed_time": "6:19:27", "remaining_time": "8:51:54"} | |
| {"current_steps": 684, "total_steps": 1638, "loss": 1.1608870029449463, "lr": 1.3818380317131946e-05, "epoch": 1.2527472527472527, "percentage": 41.76, "elapsed_time": "6:20:39", "remaining_time": "8:50:55"} | |
| {"current_steps": 686, "total_steps": 1638, "loss": 0.8176043629646301, "lr": 1.3784539443110323e-05, "epoch": 1.2564102564102564, "percentage": 41.88, "elapsed_time": "6:21:46", "remaining_time": "8:49:48"} | |
| {"current_steps": 688, "total_steps": 1638, "loss": 1.1090242862701416, "lr": 1.375065497661161e-05, "epoch": 1.26007326007326, "percentage": 42.0, "elapsed_time": "6:22:57", "remaining_time": "8:48:47"} | |
| {"current_steps": 690, "total_steps": 1638, "loss": 1.1419543027877808, "lr": 1.3716727448103356e-05, "epoch": 1.2637362637362637, "percentage": 42.12, "elapsed_time": "6:24:14", "remaining_time": "8:47:55"} | |
| {"current_steps": 692, "total_steps": 1638, "loss": 1.1804542541503906, "lr": 1.3682757388727261e-05, "epoch": 1.2673992673992673, "percentage": 42.25, "elapsed_time": "6:25:25", "remaining_time": "8:46:54"} | |
| {"current_steps": 694, "total_steps": 1638, "loss": 0.9813081622123718, "lr": 1.3648745330290848e-05, "epoch": 1.271062271062271, "percentage": 42.37, "elapsed_time": "6:26:32", "remaining_time": "8:45:47"} | |
| {"current_steps": 696, "total_steps": 1638, "loss": 1.0020716190338135, "lr": 1.361469180525916e-05, "epoch": 1.2747252747252746, "percentage": 42.49, "elapsed_time": "6:27:39", "remaining_time": "8:44:41"} | |
| {"current_steps": 698, "total_steps": 1638, "loss": 0.9081999063491821, "lr": 1.358059734674638e-05, "epoch": 1.2783882783882783, "percentage": 42.61, "elapsed_time": "6:28:53", "remaining_time": "8:43:43"} | |
| {"current_steps": 700, "total_steps": 1638, "loss": 0.6512075662612915, "lr": 1.3546462488507532e-05, "epoch": 1.282051282051282, "percentage": 42.74, "elapsed_time": "6:29:51", "remaining_time": "8:42:24"} | |
| {"current_steps": 702, "total_steps": 1638, "loss": 0.5245524644851685, "lr": 1.3512287764930102e-05, "epoch": 1.2857142857142856, "percentage": 42.86, "elapsed_time": "6:30:54", "remaining_time": "8:41:12"} | |
| {"current_steps": 704, "total_steps": 1638, "loss": 1.2959914207458496, "lr": 1.347807371102567e-05, "epoch": 1.2893772893772895, "percentage": 42.98, "elapsed_time": "6:31:56", "remaining_time": "8:40:00"} | |
| {"current_steps": 706, "total_steps": 1638, "loss": 0.9994240403175354, "lr": 1.3443820862421542e-05, "epoch": 1.293040293040293, "percentage": 43.1, "elapsed_time": "6:33:13", "remaining_time": "8:39:05"} | |
| {"current_steps": 708, "total_steps": 1638, "loss": 0.955507755279541, "lr": 1.3409529755352361e-05, "epoch": 1.2967032967032968, "percentage": 43.22, "elapsed_time": "6:34:10", "remaining_time": "8:37:46"} | |
| {"current_steps": 710, "total_steps": 1638, "loss": 0.5620253682136536, "lr": 1.3375200926651719e-05, "epoch": 1.3003663003663004, "percentage": 43.35, "elapsed_time": "6:35:00", "remaining_time": "8:36:17"} | |
| {"current_steps": 712, "total_steps": 1638, "loss": 0.8808104991912842, "lr": 1.3340834913743742e-05, "epoch": 1.304029304029304, "percentage": 43.47, "elapsed_time": "6:36:00", "remaining_time": "8:35:02"} | |
| {"current_steps": 714, "total_steps": 1638, "loss": 1.2082892656326294, "lr": 1.3306432254634676e-05, "epoch": 1.3076923076923077, "percentage": 43.59, "elapsed_time": "6:37:05", "remaining_time": "8:33:52"} | |
| {"current_steps": 716, "total_steps": 1638, "loss": 1.1394985914230347, "lr": 1.3271993487904485e-05, "epoch": 1.3113553113553114, "percentage": 43.71, "elapsed_time": "6:38:09", "remaining_time": "8:32:42"} | |
| {"current_steps": 718, "total_steps": 1638, "loss": 0.9257374405860901, "lr": 1.3237519152698392e-05, "epoch": 1.315018315018315, "percentage": 43.83, "elapsed_time": "6:39:07", "remaining_time": "8:31:24"} | |
| {"current_steps": 720, "total_steps": 1638, "loss": 0.92364901304245, "lr": 1.3203009788718454e-05, "epoch": 1.3186813186813187, "percentage": 43.96, "elapsed_time": "6:40:15", "remaining_time": "8:30:19"} | |
| {"current_steps": 722, "total_steps": 1638, "loss": 0.9131177067756653, "lr": 1.3168465936215114e-05, "epoch": 1.3223443223443223, "percentage": 44.08, "elapsed_time": "6:41:19", "remaining_time": "8:29:09"} | |
| {"current_steps": 724, "total_steps": 1638, "loss": 1.2074042558670044, "lr": 1.3133888135978733e-05, "epoch": 1.326007326007326, "percentage": 44.2, "elapsed_time": "6:42:26", "remaining_time": "8:28:03"} | |
| {"current_steps": 726, "total_steps": 1638, "loss": 1.0659313201904297, "lr": 1.3099276929331132e-05, "epoch": 1.3296703296703296, "percentage": 44.32, "elapsed_time": "6:43:29", "remaining_time": "8:26:52"} | |
| {"current_steps": 728, "total_steps": 1638, "loss": 1.1416211128234863, "lr": 1.3064632858117123e-05, "epoch": 1.3333333333333333, "percentage": 44.44, "elapsed_time": "6:44:41", "remaining_time": "8:25:51"} | |
| {"current_steps": 730, "total_steps": 1638, "loss": 0.7388544082641602, "lr": 1.3029956464696006e-05, "epoch": 1.3369963369963371, "percentage": 44.57, "elapsed_time": "6:45:44", "remaining_time": "8:24:40"} | |
| {"current_steps": 732, "total_steps": 1638, "loss": 0.8170838356018066, "lr": 1.2995248291933099e-05, "epoch": 1.3406593406593408, "percentage": 44.69, "elapsed_time": "6:46:47", "remaining_time": "8:23:29"} | |
| {"current_steps": 734, "total_steps": 1638, "loss": 0.6172389984130859, "lr": 1.296050888319123e-05, "epoch": 1.3443223443223444, "percentage": 44.81, "elapsed_time": "6:47:42", "remaining_time": "8:22:08"} | |
| {"current_steps": 736, "total_steps": 1638, "loss": 1.132319450378418, "lr": 1.2925738782322232e-05, "epoch": 1.347985347985348, "percentage": 44.93, "elapsed_time": "6:48:45", "remaining_time": "8:20:57"} | |
| {"current_steps": 738, "total_steps": 1638, "loss": 0.7506582736968994, "lr": 1.2890938533658429e-05, "epoch": 1.3516483516483517, "percentage": 45.05, "elapsed_time": "6:50:01", "remaining_time": "8:20:01"} | |
| {"current_steps": 740, "total_steps": 1638, "loss": 1.0084264278411865, "lr": 1.2856108682004116e-05, "epoch": 1.3553113553113554, "percentage": 45.18, "elapsed_time": "6:51:02", "remaining_time": "8:18:48"} | |
| {"current_steps": 742, "total_steps": 1638, "loss": 0.6014983057975769, "lr": 1.282124977262702e-05, "epoch": 1.358974358974359, "percentage": 45.3, "elapsed_time": "6:51:55", "remaining_time": "8:17:25"} | |
| {"current_steps": 744, "total_steps": 1638, "loss": 1.23367178440094, "lr": 1.2786362351249785e-05, "epoch": 1.3626373626373627, "percentage": 45.42, "elapsed_time": "6:53:00", "remaining_time": "8:16:16"} | |
| {"current_steps": 746, "total_steps": 1638, "loss": 1.0585216283798218, "lr": 1.2751446964041405e-05, "epoch": 1.3663003663003663, "percentage": 45.54, "elapsed_time": "6:54:02", "remaining_time": "8:15:04"} | |
| {"current_steps": 748, "total_steps": 1638, "loss": 1.06695556640625, "lr": 1.2716504157608693e-05, "epoch": 1.36996336996337, "percentage": 45.67, "elapsed_time": "6:54:59", "remaining_time": "8:13:46"} | |
| {"current_steps": 750, "total_steps": 1638, "loss": 0.7815660834312439, "lr": 1.2681534478987703e-05, "epoch": 1.3736263736263736, "percentage": 45.79, "elapsed_time": "6:56:02", "remaining_time": "8:12:35"} | |
| {"current_steps": 752, "total_steps": 1638, "loss": 1.120620608329773, "lr": 1.264653847563519e-05, "epoch": 1.3772893772893773, "percentage": 45.91, "elapsed_time": "6:57:01", "remaining_time": "8:11:19"} | |
| {"current_steps": 754, "total_steps": 1638, "loss": 0.9709092974662781, "lr": 1.2611516695420023e-05, "epoch": 1.380952380952381, "percentage": 46.03, "elapsed_time": "6:58:16", "remaining_time": "8:10:23"} | |
| {"current_steps": 756, "total_steps": 1638, "loss": 1.3016420602798462, "lr": 1.2576469686614608e-05, "epoch": 1.3846153846153846, "percentage": 46.15, "elapsed_time": "6:59:26", "remaining_time": "8:09:20"} | |
| {"current_steps": 758, "total_steps": 1638, "loss": 1.2032549381256104, "lr": 1.2541397997886317e-05, "epoch": 1.3882783882783882, "percentage": 46.28, "elapsed_time": "7:00:36", "remaining_time": "8:08:18"} | |
| {"current_steps": 760, "total_steps": 1638, "loss": 1.1462368965148926, "lr": 1.2506302178288887e-05, "epoch": 1.3919413919413919, "percentage": 46.4, "elapsed_time": "7:01:45", "remaining_time": "8:07:13"} | |
| {"current_steps": 762, "total_steps": 1638, "loss": 1.1458882093429565, "lr": 1.2471182777253832e-05, "epoch": 1.3956043956043955, "percentage": 46.52, "elapsed_time": "7:03:03", "remaining_time": "8:06:20"} | |
| {"current_steps": 764, "total_steps": 1638, "loss": 0.6942178606987, "lr": 1.2436040344581824e-05, "epoch": 1.3992673992673992, "percentage": 46.64, "elapsed_time": "7:04:06", "remaining_time": "8:05:09"} | |
| {"current_steps": 766, "total_steps": 1638, "loss": 0.8875712752342224, "lr": 1.2400875430434119e-05, "epoch": 1.4029304029304028, "percentage": 46.76, "elapsed_time": "7:05:15", "remaining_time": "8:04:05"} | |
| {"current_steps": 768, "total_steps": 1638, "loss": 0.8964008688926697, "lr": 1.236568858532391e-05, "epoch": 1.4065934065934065, "percentage": 46.89, "elapsed_time": "7:06:21", "remaining_time": "8:02:58"} | |
| {"current_steps": 770, "total_steps": 1638, "loss": 1.1805744171142578, "lr": 1.2330480360107728e-05, "epoch": 1.4102564102564101, "percentage": 47.01, "elapsed_time": "7:07:39", "remaining_time": "8:02:05"} | |
| {"current_steps": 772, "total_steps": 1638, "loss": 1.2107068300247192, "lr": 1.2295251305976818e-05, "epoch": 1.4139194139194138, "percentage": 47.13, "elapsed_time": "7:08:47", "remaining_time": "8:00:59"} | |
| {"current_steps": 774, "total_steps": 1638, "loss": 1.019040584564209, "lr": 1.2260001974448504e-05, "epoch": 1.4175824175824177, "percentage": 47.25, "elapsed_time": "7:09:57", "remaining_time": "7:59:57"} | |
| {"current_steps": 776, "total_steps": 1638, "loss": 1.1538594961166382, "lr": 1.222473291735754e-05, "epoch": 1.4212454212454213, "percentage": 47.37, "elapsed_time": "7:11:02", "remaining_time": "7:58:48"} | |
| {"current_steps": 778, "total_steps": 1638, "loss": 0.8257187604904175, "lr": 1.218944468684752e-05, "epoch": 1.424908424908425, "percentage": 47.5, "elapsed_time": "7:12:13", "remaining_time": "7:57:46"} | |
| {"current_steps": 780, "total_steps": 1638, "loss": 1.3644243478775024, "lr": 1.215413783536217e-05, "epoch": 1.4285714285714286, "percentage": 47.62, "elapsed_time": "7:13:22", "remaining_time": "7:56:42"} | |
| {"current_steps": 782, "total_steps": 1638, "loss": 1.2287310361862183, "lr": 1.2118812915636744e-05, "epoch": 1.4322344322344323, "percentage": 47.74, "elapsed_time": "7:14:33", "remaining_time": "7:55:40"} | |
| {"current_steps": 784, "total_steps": 1638, "loss": 1.1567542552947998, "lr": 1.2083470480689363e-05, "epoch": 1.435897435897436, "percentage": 47.86, "elapsed_time": "7:15:36", "remaining_time": "7:54:30"} | |
| {"current_steps": 786, "total_steps": 1638, "loss": 0.9774308800697327, "lr": 1.2048111083812342e-05, "epoch": 1.4395604395604396, "percentage": 47.99, "elapsed_time": "7:16:48", "remaining_time": "7:53:29"} | |
| {"current_steps": 788, "total_steps": 1638, "loss": 1.1295884847640991, "lr": 1.2012735278563546e-05, "epoch": 1.4432234432234432, "percentage": 48.11, "elapsed_time": "7:17:55", "remaining_time": "7:52:22"} | |
| {"current_steps": 790, "total_steps": 1638, "loss": 0.7207637429237366, "lr": 1.1977343618757702e-05, "epoch": 1.4468864468864469, "percentage": 48.23, "elapsed_time": "7:18:52", "remaining_time": "7:51:05"} | |
| {"current_steps": 792, "total_steps": 1638, "loss": 0.9219919443130493, "lr": 1.1941936658457769e-05, "epoch": 1.4505494505494505, "percentage": 48.35, "elapsed_time": "7:19:58", "remaining_time": "7:49:58"} | |
| {"current_steps": 794, "total_steps": 1638, "loss": 0.8157789707183838, "lr": 1.1906514951966208e-05, "epoch": 1.4542124542124542, "percentage": 48.47, "elapsed_time": "7:21:11", "remaining_time": "7:48:58"} | |
| {"current_steps": 796, "total_steps": 1638, "loss": 1.1426329612731934, "lr": 1.1871079053816357e-05, "epoch": 1.4578754578754578, "percentage": 48.6, "elapsed_time": "7:22:19", "remaining_time": "7:47:53"} | |
| {"current_steps": 798, "total_steps": 1638, "loss": 0.7938690781593323, "lr": 1.1835629518763714e-05, "epoch": 1.4615384615384617, "percentage": 48.72, "elapsed_time": "7:23:26", "remaining_time": "7:46:46"} | |
| {"current_steps": 800, "total_steps": 1638, "loss": 1.024507999420166, "lr": 1.1800166901777272e-05, "epoch": 1.4652014652014653, "percentage": 48.84, "elapsed_time": "7:24:25", "remaining_time": "7:45:32"} | |
| {"current_steps": 802, "total_steps": 1638, "loss": 1.5597076416015625, "lr": 1.1764691758030825e-05, "epoch": 1.468864468864469, "percentage": 48.96, "elapsed_time": "7:25:37", "remaining_time": "7:44:31"} | |
| {"current_steps": 804, "total_steps": 1638, "loss": 1.0233888626098633, "lr": 1.1729204642894265e-05, "epoch": 1.4725274725274726, "percentage": 49.08, "elapsed_time": "7:26:44", "remaining_time": "7:43:25"} | |
| {"current_steps": 806, "total_steps": 1638, "loss": 1.1873747110366821, "lr": 1.1693706111924912e-05, "epoch": 1.4761904761904763, "percentage": 49.21, "elapsed_time": "7:27:54", "remaining_time": "7:42:21"} | |
| {"current_steps": 808, "total_steps": 1638, "loss": 1.1727930307388306, "lr": 1.1658196720858794e-05, "epoch": 1.47985347985348, "percentage": 49.33, "elapsed_time": "7:29:06", "remaining_time": "7:41:20"} | |
| {"current_steps": 810, "total_steps": 1638, "loss": 1.0092246532440186, "lr": 1.1622677025601966e-05, "epoch": 1.4835164835164836, "percentage": 49.45, "elapsed_time": "7:30:19", "remaining_time": "7:40:19"} | |
| {"current_steps": 812, "total_steps": 1638, "loss": 0.8401330709457397, "lr": 1.1587147582221776e-05, "epoch": 1.4871794871794872, "percentage": 49.57, "elapsed_time": "7:31:25", "remaining_time": "7:39:12"} | |
| {"current_steps": 814, "total_steps": 1638, "loss": 1.2012218236923218, "lr": 1.1551608946938208e-05, "epoch": 1.4908424908424909, "percentage": 49.69, "elapsed_time": "7:32:34", "remaining_time": "7:38:07"} | |
| {"current_steps": 816, "total_steps": 1638, "loss": 1.137012004852295, "lr": 1.1516061676115124e-05, "epoch": 1.4945054945054945, "percentage": 49.82, "elapsed_time": "7:33:43", "remaining_time": "7:37:04"} | |
| {"current_steps": 818, "total_steps": 1638, "loss": 0.4151400327682495, "lr": 1.1480506326251595e-05, "epoch": 1.4981684981684982, "percentage": 49.94, "elapsed_time": "7:34:44", "remaining_time": "7:35:50"} | |
| {"current_steps": 820, "total_steps": 1638, "loss": 1.1498603820800781, "lr": 1.1444943453973155e-05, "epoch": 1.5018315018315018, "percentage": 50.06, "elapsed_time": "7:35:50", "remaining_time": "7:34:43"} | |
| {"current_steps": 822, "total_steps": 1638, "loss": 0.9069132804870605, "lr": 1.1409373616023111e-05, "epoch": 1.5054945054945055, "percentage": 50.18, "elapsed_time": "7:36:58", "remaining_time": "7:33:37"} | |
| {"current_steps": 824, "total_steps": 1638, "loss": 0.828985869884491, "lr": 1.1373797369253818e-05, "epoch": 1.5091575091575091, "percentage": 50.31, "elapsed_time": "7:38:04", "remaining_time": "7:32:31"} | |
| {"current_steps": 826, "total_steps": 1638, "loss": 1.0592471361160278, "lr": 1.1338215270617967e-05, "epoch": 1.5128205128205128, "percentage": 50.43, "elapsed_time": "7:39:11", "remaining_time": "7:31:24"} | |
| {"current_steps": 828, "total_steps": 1638, "loss": 1.0117770433425903, "lr": 1.130262787715985e-05, "epoch": 1.5164835164835164, "percentage": 50.55, "elapsed_time": "7:40:09", "remaining_time": "7:30:09"} | |
| {"current_steps": 830, "total_steps": 1638, "loss": 1.012010931968689, "lr": 1.1267035746006658e-05, "epoch": 1.52014652014652, "percentage": 50.67, "elapsed_time": "7:41:17", "remaining_time": "7:29:03"} | |
| {"current_steps": 832, "total_steps": 1638, "loss": 1.2559067010879517, "lr": 1.1231439434359755e-05, "epoch": 1.5238095238095237, "percentage": 50.79, "elapsed_time": "7:42:23", "remaining_time": "7:27:56"} | |
| {"current_steps": 834, "total_steps": 1638, "loss": 0.8118237853050232, "lr": 1.119583949948594e-05, "epoch": 1.5274725274725274, "percentage": 50.92, "elapsed_time": "7:43:37", "remaining_time": "7:26:57"} | |
| {"current_steps": 836, "total_steps": 1638, "loss": 1.156209111213684, "lr": 1.1160236498708742e-05, "epoch": 1.531135531135531, "percentage": 51.04, "elapsed_time": "7:44:49", "remaining_time": "7:25:55"} | |
| {"current_steps": 838, "total_steps": 1638, "loss": 1.207381248474121, "lr": 1.112463098939969e-05, "epoch": 1.5347985347985347, "percentage": 51.16, "elapsed_time": "7:45:53", "remaining_time": "7:24:45"} | |
| {"current_steps": 840, "total_steps": 1638, "loss": 1.0491501092910767, "lr": 1.1089023528969576e-05, "epoch": 1.5384615384615383, "percentage": 51.28, "elapsed_time": "7:46:58", "remaining_time": "7:23:37"} | |
| {"current_steps": 842, "total_steps": 1638, "loss": 0.872413694858551, "lr": 1.1053414674859741e-05, "epoch": 1.542124542124542, "percentage": 51.4, "elapsed_time": "7:47:59", "remaining_time": "7:22:25"} | |
| {"current_steps": 844, "total_steps": 1638, "loss": 1.038373589515686, "lr": 1.1017804984533351e-05, "epoch": 1.5457875457875456, "percentage": 51.53, "elapsed_time": "7:49:07", "remaining_time": "7:21:20"} | |
| {"current_steps": 846, "total_steps": 1638, "loss": 1.182910680770874, "lr": 1.0982195015466652e-05, "epoch": 1.5494505494505495, "percentage": 51.65, "elapsed_time": "7:50:15", "remaining_time": "7:20:14"} | |
| {"current_steps": 848, "total_steps": 1638, "loss": 0.5585577487945557, "lr": 1.0946585325140261e-05, "epoch": 1.5531135531135531, "percentage": 51.77, "elapsed_time": "7:51:17", "remaining_time": "7:19:03"} | |
| {"current_steps": 850, "total_steps": 1638, "loss": 1.0341525077819824, "lr": 1.0910976471030428e-05, "epoch": 1.5567765567765568, "percentage": 51.89, "elapsed_time": "7:52:32", "remaining_time": "7:18:04"} | |
| {"current_steps": 852, "total_steps": 1638, "loss": 1.1761656999588013, "lr": 1.0875369010600317e-05, "epoch": 1.5604395604395604, "percentage": 52.01, "elapsed_time": "7:53:45", "remaining_time": "7:17:03"} | |
| {"current_steps": 854, "total_steps": 1638, "loss": 0.8638155460357666, "lr": 1.083976350129126e-05, "epoch": 1.564102564102564, "percentage": 52.14, "elapsed_time": "7:54:53", "remaining_time": "7:15:57"} | |
| {"current_steps": 856, "total_steps": 1638, "loss": 0.8107349872589111, "lr": 1.0804160500514062e-05, "epoch": 1.5677655677655677, "percentage": 52.26, "elapsed_time": "7:55:59", "remaining_time": "7:14:50"} | |
| {"current_steps": 858, "total_steps": 1638, "loss": 0.9437478184700012, "lr": 1.0768560565640252e-05, "epoch": 1.5714285714285714, "percentage": 52.38, "elapsed_time": "7:57:13", "remaining_time": "7:13:50"} | |
| {"current_steps": 860, "total_steps": 1638, "loss": 0.7484935522079468, "lr": 1.0732964253993343e-05, "epoch": 1.575091575091575, "percentage": 52.5, "elapsed_time": "7:58:09", "remaining_time": "7:12:34"} | |
| {"current_steps": 862, "total_steps": 1638, "loss": 1.1590977907180786, "lr": 1.0697372122840156e-05, "epoch": 1.578754578754579, "percentage": 52.63, "elapsed_time": "7:59:20", "remaining_time": "7:11:31"} | |
| {"current_steps": 864, "total_steps": 1638, "loss": 0.9178829789161682, "lr": 1.0661784729382036e-05, "epoch": 1.5824175824175826, "percentage": 52.75, "elapsed_time": "8:00:25", "remaining_time": "7:10:23"} | |
| {"current_steps": 866, "total_steps": 1638, "loss": 1.1874239444732666, "lr": 1.0626202630746183e-05, "epoch": 1.5860805860805862, "percentage": 52.87, "elapsed_time": "8:01:45", "remaining_time": "7:09:28"} | |
| {"current_steps": 868, "total_steps": 1638, "loss": 1.1853182315826416, "lr": 1.0590626383976894e-05, "epoch": 1.5897435897435899, "percentage": 52.99, "elapsed_time": "8:02:56", "remaining_time": "7:08:24"} | |
| {"current_steps": 870, "total_steps": 1638, "loss": 0.5874127149581909, "lr": 1.055505654602685e-05, "epoch": 1.5934065934065935, "percentage": 53.11, "elapsed_time": "8:03:54", "remaining_time": "7:07:10"} | |
| {"current_steps": 872, "total_steps": 1638, "loss": 1.1814969778060913, "lr": 1.0519493673748406e-05, "epoch": 1.5970695970695972, "percentage": 53.24, "elapsed_time": "8:05:05", "remaining_time": "7:06:07"} | |
| {"current_steps": 874, "total_steps": 1638, "loss": 1.0709137916564941, "lr": 1.0483938323884879e-05, "epoch": 1.6007326007326008, "percentage": 53.36, "elapsed_time": "8:06:19", "remaining_time": "7:05:07"} | |
| {"current_steps": 876, "total_steps": 1638, "loss": 0.7793064117431641, "lr": 1.0448391053061795e-05, "epoch": 1.6043956043956045, "percentage": 53.48, "elapsed_time": "8:07:16", "remaining_time": "7:03:51"} | |
| {"current_steps": 878, "total_steps": 1638, "loss": 0.9944717884063721, "lr": 1.0412852417778225e-05, "epoch": 1.6080586080586081, "percentage": 53.6, "elapsed_time": "8:08:16", "remaining_time": "7:02:38"} | |
| {"current_steps": 880, "total_steps": 1638, "loss": 0.9063097834587097, "lr": 1.037732297439804e-05, "epoch": 1.6117216117216118, "percentage": 53.72, "elapsed_time": "8:09:09", "remaining_time": "7:01:20"} | |
| {"current_steps": 882, "total_steps": 1638, "loss": 1.1387708187103271, "lr": 1.034180327914121e-05, "epoch": 1.6153846153846154, "percentage": 53.85, "elapsed_time": "8:10:27", "remaining_time": "7:00:23"} | |
| {"current_steps": 884, "total_steps": 1638, "loss": 0.9485580921173096, "lr": 1.030629388807509e-05, "epoch": 1.619047619047619, "percentage": 53.97, "elapsed_time": "8:11:23", "remaining_time": "6:59:08"} | |
| {"current_steps": 886, "total_steps": 1638, "loss": 1.145193099975586, "lr": 1.0270795357105738e-05, "epoch": 1.6227106227106227, "percentage": 54.09, "elapsed_time": "8:12:31", "remaining_time": "6:58:01"} | |
| {"current_steps": 888, "total_steps": 1638, "loss": 0.8466076254844666, "lr": 1.023530824196918e-05, "epoch": 1.6263736263736264, "percentage": 54.21, "elapsed_time": "8:13:38", "remaining_time": "6:56:55"} | |
| {"current_steps": 890, "total_steps": 1638, "loss": 1.2269943952560425, "lr": 1.019983309822273e-05, "epoch": 1.63003663003663, "percentage": 54.33, "elapsed_time": "8:14:39", "remaining_time": "6:55:44"} | |
| {"current_steps": 892, "total_steps": 1638, "loss": 0.9066356420516968, "lr": 1.0164370481236292e-05, "epoch": 1.6336996336996337, "percentage": 54.46, "elapsed_time": "8:15:39", "remaining_time": "6:54:31"} | |
| {"current_steps": 894, "total_steps": 1638, "loss": 1.1919015645980835, "lr": 1.0128920946183646e-05, "epoch": 1.6373626373626373, "percentage": 54.58, "elapsed_time": "8:16:49", "remaining_time": "6:53:28"} | |
| {"current_steps": 896, "total_steps": 1638, "loss": 0.7559452652931213, "lr": 1.0093485048033798e-05, "epoch": 1.641025641025641, "percentage": 54.7, "elapsed_time": "8:17:46", "remaining_time": "6:52:12"} | |
| {"current_steps": 898, "total_steps": 1638, "loss": 1.141265869140625, "lr": 1.0058063341542238e-05, "epoch": 1.6446886446886446, "percentage": 54.82, "elapsed_time": "8:19:01", "remaining_time": "6:51:13"} | |
| {"current_steps": 900, "total_steps": 1638, "loss": 0.835241973400116, "lr": 1.0022656381242297e-05, "epoch": 1.6483516483516483, "percentage": 54.95, "elapsed_time": "8:19:58", "remaining_time": "6:49:59"} | |
| {"current_steps": 902, "total_steps": 1638, "loss": 0.8866770267486572, "lr": 9.98726472143646e-06, "epoch": 1.652014652014652, "percentage": 55.07, "elapsed_time": "8:21:05", "remaining_time": "6:48:52"} | |
| {"current_steps": 904, "total_steps": 1638, "loss": 1.027202844619751, "lr": 9.951888916187662e-06, "epoch": 1.6556776556776556, "percentage": 55.19, "elapsed_time": "8:22:15", "remaining_time": "6:47:48"} | |
| {"current_steps": 906, "total_steps": 1638, "loss": 1.1409128904342651, "lr": 9.916529519310638e-06, "epoch": 1.6593406593406592, "percentage": 55.31, "elapsed_time": "8:23:28", "remaining_time": "6:46:46"} | |
| {"current_steps": 908, "total_steps": 1638, "loss": 0.7516112923622131, "lr": 9.881187084363257e-06, "epoch": 1.6630036630036629, "percentage": 55.43, "elapsed_time": "8:24:25", "remaining_time": "6:45:32"} | |
| {"current_steps": 910, "total_steps": 1638, "loss": 1.2104125022888184, "lr": 9.845862164637834e-06, "epoch": 1.6666666666666665, "percentage": 55.56, "elapsed_time": "8:25:32", "remaining_time": "6:44:25"} | |
| {"current_steps": 912, "total_steps": 1638, "loss": 1.2229918241500854, "lr": 9.810555313152486e-06, "epoch": 1.6703296703296702, "percentage": 55.68, "elapsed_time": "8:26:48", "remaining_time": "6:43:26"} | |
| {"current_steps": 914, "total_steps": 1638, "loss": 1.1026571989059448, "lr": 9.775267082642461e-06, "epoch": 1.673992673992674, "percentage": 55.8, "elapsed_time": "8:28:08", "remaining_time": "6:42:30"} | |
| {"current_steps": 916, "total_steps": 1638, "loss": 1.2173269987106323, "lr": 9.7399980255515e-06, "epoch": 1.6776556776556777, "percentage": 55.92, "elapsed_time": "8:29:12", "remaining_time": "6:41:22"} | |
| {"current_steps": 918, "total_steps": 1638, "loss": 0.8437496423721313, "lr": 9.704748694023183e-06, "epoch": 1.6813186813186813, "percentage": 56.04, "elapsed_time": "8:30:19", "remaining_time": "6:40:14"} | |
| {"current_steps": 920, "total_steps": 1638, "loss": 1.2237019538879395, "lr": 9.669519639892275e-06, "epoch": 1.684981684981685, "percentage": 56.17, "elapsed_time": "8:31:28", "remaining_time": "6:39:10"} | |
| {"current_steps": 922, "total_steps": 1638, "loss": 1.0298762321472168, "lr": 9.634311414676096e-06, "epoch": 1.6886446886446886, "percentage": 56.29, "elapsed_time": "8:32:42", "remaining_time": "6:38:09"} | |
| {"current_steps": 924, "total_steps": 1638, "loss": 0.8457880616188049, "lr": 9.599124569565887e-06, "epoch": 1.6923076923076923, "percentage": 56.41, "elapsed_time": "8:33:39", "remaining_time": "6:36:55"} | |
| {"current_steps": 926, "total_steps": 1638, "loss": 0.9856649041175842, "lr": 9.56395965541818e-06, "epoch": 1.695970695970696, "percentage": 56.53, "elapsed_time": "8:34:42", "remaining_time": "6:35:45"} | |
| {"current_steps": 928, "total_steps": 1638, "loss": 0.8122522234916687, "lr": 9.528817222746171e-06, "epoch": 1.6996336996336996, "percentage": 56.65, "elapsed_time": "8:35:48", "remaining_time": "6:34:38"} | |
| {"current_steps": 930, "total_steps": 1638, "loss": 0.9117051362991333, "lr": 9.493697821711116e-06, "epoch": 1.7032967032967035, "percentage": 56.78, "elapsed_time": "8:36:52", "remaining_time": "6:33:29"} | |
| {"current_steps": 932, "total_steps": 1638, "loss": 0.9280415773391724, "lr": 9.458602002113684e-06, "epoch": 1.7069597069597071, "percentage": 56.9, "elapsed_time": "8:37:59", "remaining_time": "6:32:23"} | |
| {"current_steps": 934, "total_steps": 1638, "loss": 1.3979382514953613, "lr": 9.423530313385395e-06, "epoch": 1.7106227106227108, "percentage": 57.02, "elapsed_time": "8:39:08", "remaining_time": "6:31:18"} | |
| {"current_steps": 936, "total_steps": 1638, "loss": 1.1958733797073364, "lr": 9.388483304579983e-06, "epoch": 1.7142857142857144, "percentage": 57.14, "elapsed_time": "8:40:21", "remaining_time": "6:30:16"} | |
| {"current_steps": 938, "total_steps": 1638, "loss": 0.48058995604515076, "lr": 9.353461524364814e-06, "epoch": 1.717948717948718, "percentage": 57.26, "elapsed_time": "8:41:30", "remaining_time": "6:29:11"} | |
| {"current_steps": 940, "total_steps": 1638, "loss": 0.556159257888794, "lr": 9.318465521012298e-06, "epoch": 1.7216117216117217, "percentage": 57.39, "elapsed_time": "8:42:24", "remaining_time": "6:27:54"} | |
| {"current_steps": 942, "total_steps": 1638, "loss": 1.130286693572998, "lr": 9.283495842391313e-06, "epoch": 1.7252747252747254, "percentage": 57.51, "elapsed_time": "8:43:37", "remaining_time": "6:26:52"} | |
| {"current_steps": 944, "total_steps": 1638, "loss": 0.9355916380882263, "lr": 9.248553035958596e-06, "epoch": 1.728937728937729, "percentage": 57.63, "elapsed_time": "8:44:42", "remaining_time": "6:25:44"} | |
| {"current_steps": 946, "total_steps": 1638, "loss": 1.1549943685531616, "lr": 9.213637648750217e-06, "epoch": 1.7326007326007327, "percentage": 57.75, "elapsed_time": "8:45:55", "remaining_time": "6:24:43"} | |
| {"current_steps": 948, "total_steps": 1638, "loss": 1.100421667098999, "lr": 9.178750227372983e-06, "epoch": 1.7362637362637363, "percentage": 57.88, "elapsed_time": "8:46:57", "remaining_time": "6:23:32"} | |
| {"current_steps": 950, "total_steps": 1638, "loss": 0.9944745302200317, "lr": 9.143891317995888e-06, "epoch": 1.73992673992674, "percentage": 58.0, "elapsed_time": "8:47:59", "remaining_time": "6:22:22"} | |
| {"current_steps": 952, "total_steps": 1638, "loss": 0.9477764368057251, "lr": 9.109061466341576e-06, "epoch": 1.7435897435897436, "percentage": 58.12, "elapsed_time": "8:49:02", "remaining_time": "6:21:13"} | |
| {"current_steps": 954, "total_steps": 1638, "loss": 1.2121840715408325, "lr": 9.074261217677771e-06, "epoch": 1.7472527472527473, "percentage": 58.24, "elapsed_time": "8:50:19", "remaining_time": "6:20:13"} | |
| {"current_steps": 956, "total_steps": 1638, "loss": 0.7902039885520935, "lr": 9.039491116808773e-06, "epoch": 1.750915750915751, "percentage": 58.36, "elapsed_time": "8:51:13", "remaining_time": "6:18:58"} | |
| {"current_steps": 958, "total_steps": 1638, "loss": 1.1979793310165405, "lr": 9.004751708066906e-06, "epoch": 1.7545787545787546, "percentage": 58.49, "elapsed_time": "8:52:20", "remaining_time": "6:17:51"} | |
| {"current_steps": 960, "total_steps": 1638, "loss": 0.5474309325218201, "lr": 8.970043535303999e-06, "epoch": 1.7582417582417582, "percentage": 58.61, "elapsed_time": "8:53:05", "remaining_time": "6:16:30"} | |
| {"current_steps": 962, "total_steps": 1638, "loss": 0.9992084503173828, "lr": 8.93536714188288e-06, "epoch": 1.7619047619047619, "percentage": 58.73, "elapsed_time": "8:54:12", "remaining_time": "6:15:23"} | |
| {"current_steps": 964, "total_steps": 1638, "loss": 1.0385854244232178, "lr": 8.900723070668869e-06, "epoch": 1.7655677655677655, "percentage": 58.85, "elapsed_time": "8:55:19", "remaining_time": "6:14:16"} | |
| {"current_steps": 966, "total_steps": 1638, "loss": 1.1807163953781128, "lr": 8.86611186402127e-06, "epoch": 1.7692307692307692, "percentage": 58.97, "elapsed_time": "8:56:26", "remaining_time": "6:13:10"} | |
| {"current_steps": 968, "total_steps": 1638, "loss": 0.5750354528427124, "lr": 8.831534063784891e-06, "epoch": 1.7728937728937728, "percentage": 59.1, "elapsed_time": "8:57:28", "remaining_time": "6:12:01"} | |
| {"current_steps": 970, "total_steps": 1638, "loss": 0.8479418158531189, "lr": 8.796990211281549e-06, "epoch": 1.7765567765567765, "percentage": 59.22, "elapsed_time": "8:58:40", "remaining_time": "6:10:57"} | |
| {"current_steps": 972, "total_steps": 1638, "loss": 0.9116554856300354, "lr": 8.76248084730161e-06, "epoch": 1.7802197802197801, "percentage": 59.34, "elapsed_time": "8:59:34", "remaining_time": "6:09:42"} | |
| {"current_steps": 974, "total_steps": 1638, "loss": 1.2301732301712036, "lr": 8.728006512095517e-06, "epoch": 1.7838827838827838, "percentage": 59.46, "elapsed_time": "9:00:41", "remaining_time": "6:08:35"} | |
| {"current_steps": 976, "total_steps": 1638, "loss": 1.1915199756622314, "lr": 8.693567745365325e-06, "epoch": 1.7875457875457874, "percentage": 59.58, "elapsed_time": "9:01:40", "remaining_time": "6:07:24"} | |
| {"current_steps": 978, "total_steps": 1638, "loss": 0.9201015830039978, "lr": 8.659165086256263e-06, "epoch": 1.791208791208791, "percentage": 59.71, "elapsed_time": "9:02:50", "remaining_time": "6:06:20"} | |
| {"current_steps": 980, "total_steps": 1638, "loss": 0.9540326595306396, "lr": 8.624799073348282e-06, "epoch": 1.7948717948717947, "percentage": 59.83, "elapsed_time": "9:03:54", "remaining_time": "6:05:11"} | |
| {"current_steps": 982, "total_steps": 1638, "loss": 1.1440948247909546, "lr": 8.590470244647643e-06, "epoch": 1.7985347985347986, "percentage": 59.95, "elapsed_time": "9:05:03", "remaining_time": "6:04:06"} | |
| {"current_steps": 984, "total_steps": 1638, "loss": 1.100319504737854, "lr": 8.556179137578461e-06, "epoch": 1.8021978021978022, "percentage": 60.07, "elapsed_time": "9:05:58", "remaining_time": "6:02:52"} | |
| {"current_steps": 986, "total_steps": 1638, "loss": 0.6495481133460999, "lr": 8.521926288974336e-06, "epoch": 1.8058608058608059, "percentage": 60.2, "elapsed_time": "9:06:59", "remaining_time": "6:01:42"} | |
| {"current_steps": 988, "total_steps": 1638, "loss": 0.8149735927581787, "lr": 8.487712235069901e-06, "epoch": 1.8095238095238095, "percentage": 60.32, "elapsed_time": "9:08:05", "remaining_time": "6:00:35"} | |
| {"current_steps": 990, "total_steps": 1638, "loss": 0.7469933032989502, "lr": 8.453537511492469e-06, "epoch": 1.8131868131868132, "percentage": 60.44, "elapsed_time": "9:09:03", "remaining_time": "5:59:23"} | |
| {"current_steps": 992, "total_steps": 1638, "loss": 0.7891843914985657, "lr": 8.419402653253623e-06, "epoch": 1.8168498168498168, "percentage": 60.56, "elapsed_time": "9:10:10", "remaining_time": "5:58:16"} | |
| {"current_steps": 994, "total_steps": 1638, "loss": 0.8571860790252686, "lr": 8.385308194740846e-06, "epoch": 1.8205128205128205, "percentage": 60.68, "elapsed_time": "9:11:25", "remaining_time": "5:57:15"} | |
| {"current_steps": 996, "total_steps": 1638, "loss": 1.1353570222854614, "lr": 8.35125466970915e-06, "epoch": 1.8241758241758241, "percentage": 60.81, "elapsed_time": "9:12:27", "remaining_time": "5:56:06"} | |
| {"current_steps": 998, "total_steps": 1638, "loss": 0.8522858023643494, "lr": 8.317242611272745e-06, "epoch": 1.8278388278388278, "percentage": 60.93, "elapsed_time": "9:13:34", "remaining_time": "5:55:00"} | |
| {"current_steps": 1000, "total_steps": 1638, "loss": 1.1177325248718262, "lr": 8.283272551896649e-06, "epoch": 1.8315018315018317, "percentage": 61.05, "elapsed_time": "9:14:47", "remaining_time": "5:53:57"} | |
| {"current_steps": 1002, "total_steps": 1638, "loss": 1.145124912261963, "lr": 8.249345023388393e-06, "epoch": 1.8351648351648353, "percentage": 61.17, "elapsed_time": "9:16:07", "remaining_time": "5:52:59"} | |
| {"current_steps": 1004, "total_steps": 1638, "loss": 1.1394211053848267, "lr": 8.21546055688968e-06, "epoch": 1.838827838827839, "percentage": 61.29, "elapsed_time": "9:17:20", "remaining_time": "5:51:56"} | |
| {"current_steps": 1006, "total_steps": 1638, "loss": 1.1420109272003174, "lr": 8.181619682868059e-06, "epoch": 1.8424908424908426, "percentage": 61.42, "elapsed_time": "9:18:31", "remaining_time": "5:50:52"} | |
| {"current_steps": 1008, "total_steps": 1638, "loss": 0.7952710390090942, "lr": 8.147822931108638e-06, "epoch": 1.8461538461538463, "percentage": 61.54, "elapsed_time": "9:19:37", "remaining_time": "5:49:46"} | |
| {"current_steps": 1010, "total_steps": 1638, "loss": 1.1106821298599243, "lr": 8.114070830705785e-06, "epoch": 1.84981684981685, "percentage": 61.66, "elapsed_time": "9:21:00", "remaining_time": "5:48:49"} | |
| {"current_steps": 1012, "total_steps": 1638, "loss": 0.7631734609603882, "lr": 8.080363910054833e-06, "epoch": 1.8534798534798536, "percentage": 61.78, "elapsed_time": "9:22:02", "remaining_time": "5:47:40"} | |
| {"current_steps": 1014, "total_steps": 1638, "loss": 1.162695288658142, "lr": 8.04670269684383e-06, "epoch": 1.8571428571428572, "percentage": 61.9, "elapsed_time": "9:23:16", "remaining_time": "5:46:37"} | |
| {"current_steps": 1016, "total_steps": 1638, "loss": 1.1480703353881836, "lr": 8.013087718045256e-06, "epoch": 1.8608058608058609, "percentage": 62.03, "elapsed_time": "9:24:27", "remaining_time": "5:45:34"} | |
| {"current_steps": 1018, "total_steps": 1638, "loss": 1.2293591499328613, "lr": 7.979519499907786e-06, "epoch": 1.8644688644688645, "percentage": 62.15, "elapsed_time": "9:25:44", "remaining_time": "5:44:33"} | |
| {"current_steps": 1020, "total_steps": 1638, "loss": 0.9643331170082092, "lr": 7.945998567948052e-06, "epoch": 1.8681318681318682, "percentage": 62.27, "elapsed_time": "9:26:42", "remaining_time": "5:43:21"} | |
| {"current_steps": 1022, "total_steps": 1638, "loss": 1.1193994283676147, "lr": 7.912525446942406e-06, "epoch": 1.8717948717948718, "percentage": 62.39, "elapsed_time": "9:27:49", "remaining_time": "5:42:14"} | |
| {"current_steps": 1024, "total_steps": 1638, "loss": 0.5604696869850159, "lr": 7.879100660918713e-06, "epoch": 1.8754578754578755, "percentage": 62.52, "elapsed_time": "9:28:42", "remaining_time": "5:41:00"} | |
| {"current_steps": 1026, "total_steps": 1638, "loss": 1.1571592092514038, "lr": 7.845724733148149e-06, "epoch": 1.879120879120879, "percentage": 62.64, "elapsed_time": "9:29:56", "remaining_time": "5:39:58"} | |
| {"current_steps": 1028, "total_steps": 1638, "loss": 0.954494059085846, "lr": 7.812398186136994e-06, "epoch": 1.8827838827838828, "percentage": 62.76, "elapsed_time": "9:31:00", "remaining_time": "5:38:49"} | |
| {"current_steps": 1030, "total_steps": 1638, "loss": 1.153045892715454, "lr": 7.779121541618478e-06, "epoch": 1.8864468864468864, "percentage": 62.88, "elapsed_time": "9:32:07", "remaining_time": "5:37:43"} | |
| {"current_steps": 1032, "total_steps": 1638, "loss": 0.9504430890083313, "lr": 7.74589532054459e-06, "epoch": 1.89010989010989, "percentage": 63.0, "elapsed_time": "9:33:15", "remaining_time": "5:36:37"} | |
| {"current_steps": 1034, "total_steps": 1638, "loss": 0.8016744256019592, "lr": 7.712720043077929e-06, "epoch": 1.8937728937728937, "percentage": 63.13, "elapsed_time": "9:34:32", "remaining_time": "5:35:36"} | |
| {"current_steps": 1036, "total_steps": 1638, "loss": 1.1903191804885864, "lr": 7.679596228583563e-06, "epoch": 1.8974358974358974, "percentage": 63.25, "elapsed_time": "9:35:42", "remaining_time": "5:34:32"} | |
| {"current_steps": 1038, "total_steps": 1638, "loss": 1.157071828842163, "lr": 7.646524395620908e-06, "epoch": 1.901098901098901, "percentage": 63.37, "elapsed_time": "9:36:44", "remaining_time": "5:33:22"} | |
| {"current_steps": 1040, "total_steps": 1638, "loss": 1.2254270315170288, "lr": 7.613505061935584e-06, "epoch": 1.9047619047619047, "percentage": 63.49, "elapsed_time": "9:37:55", "remaining_time": "5:32:18"} | |
| {"current_steps": 1042, "total_steps": 1638, "loss": 0.6230685710906982, "lr": 7.580538744451336e-06, "epoch": 1.9084249084249083, "percentage": 63.61, "elapsed_time": "9:39:07", "remaining_time": "5:31:14"} | |
| {"current_steps": 1044, "total_steps": 1638, "loss": 0.8747984766960144, "lr": 7.547625959261928e-06, "epoch": 1.912087912087912, "percentage": 63.74, "elapsed_time": "9:40:06", "remaining_time": "5:30:03"} | |
| {"current_steps": 1046, "total_steps": 1638, "loss": 1.117228388786316, "lr": 7.5147672216230605e-06, "epoch": 1.9157509157509156, "percentage": 63.86, "elapsed_time": "9:41:15", "remaining_time": "5:28:58"} | |
| {"current_steps": 1048, "total_steps": 1638, "loss": 0.45573827624320984, "lr": 7.481963045944318e-06, "epoch": 1.9194139194139193, "percentage": 63.98, "elapsed_time": "9:42:10", "remaining_time": "5:27:44"} | |
| {"current_steps": 1050, "total_steps": 1638, "loss": 0.8882296085357666, "lr": 7.449213945781102e-06, "epoch": 1.9230769230769231, "percentage": 64.1, "elapsed_time": "9:43:29", "remaining_time": "5:26:45"} | |
| {"current_steps": 1052, "total_steps": 1638, "loss": 0.8158991932868958, "lr": 7.416520433826599e-06, "epoch": 1.9267399267399268, "percentage": 64.22, "elapsed_time": "9:44:38", "remaining_time": "5:25:39"} | |
| {"current_steps": 1054, "total_steps": 1638, "loss": 1.1231168508529663, "lr": 7.383883021903755e-06, "epoch": 1.9304029304029304, "percentage": 64.35, "elapsed_time": "9:45:50", "remaining_time": "5:24:35"} | |
| {"current_steps": 1056, "total_steps": 1638, "loss": 0.748049259185791, "lr": 7.351302220957251e-06, "epoch": 1.934065934065934, "percentage": 64.47, "elapsed_time": "9:46:58", "remaining_time": "5:23:30"} | |
| {"current_steps": 1058, "total_steps": 1638, "loss": 0.9868760704994202, "lr": 7.318778541045517e-06, "epoch": 1.9377289377289377, "percentage": 64.59, "elapsed_time": "9:48:20", "remaining_time": "5:22:32"} | |
| {"current_steps": 1060, "total_steps": 1638, "loss": 1.0847361087799072, "lr": 7.286312491332754e-06, "epoch": 1.9413919413919414, "percentage": 64.71, "elapsed_time": "9:49:32", "remaining_time": "5:21:27"} | |
| {"current_steps": 1062, "total_steps": 1638, "loss": 0.84217369556427, "lr": 7.253904580080926e-06, "epoch": 1.945054945054945, "percentage": 64.84, "elapsed_time": "9:50:28", "remaining_time": "5:20:15"} | |
| {"current_steps": 1064, "total_steps": 1638, "loss": 0.8080073595046997, "lr": 7.221555314641853e-06, "epoch": 1.9487179487179487, "percentage": 64.96, "elapsed_time": "9:51:36", "remaining_time": "5:19:09"} | |
| {"current_steps": 1066, "total_steps": 1638, "loss": 1.1642062664031982, "lr": 7.18926520144924e-06, "epoch": 1.9523809523809523, "percentage": 65.08, "elapsed_time": "9:52:59", "remaining_time": "5:18:11"} | |
| {"current_steps": 1068, "total_steps": 1638, "loss": 1.1827412843704224, "lr": 7.1570347460107335e-06, "epoch": 1.9560439560439562, "percentage": 65.2, "elapsed_time": "9:54:07", "remaining_time": "5:17:05"} | |
| {"current_steps": 1070, "total_steps": 1638, "loss": 0.706343412399292, "lr": 7.124864452900049e-06, "epoch": 1.9597069597069599, "percentage": 65.32, "elapsed_time": "9:55:06", "remaining_time": "5:15:54"} | |
| {"current_steps": 1072, "total_steps": 1638, "loss": 0.8516585230827332, "lr": 7.0927548257490465e-06, "epoch": 1.9633699633699635, "percentage": 65.45, "elapsed_time": "9:56:08", "remaining_time": "5:14:45"} | |
| {"current_steps": 1074, "total_steps": 1638, "loss": 1.1490978002548218, "lr": 7.060706367239836e-06, "epoch": 1.9670329670329672, "percentage": 65.57, "elapsed_time": "9:57:32", "remaining_time": "5:13:47"} | |
| {"current_steps": 1076, "total_steps": 1638, "loss": 1.1234122514724731, "lr": 7.028719579096932e-06, "epoch": 1.9706959706959708, "percentage": 65.69, "elapsed_time": "9:58:39", "remaining_time": "5:12:41"} | |
| {"current_steps": 1078, "total_steps": 1638, "loss": 0.9970806241035461, "lr": 6.9967949620793854e-06, "epoch": 1.9743589743589745, "percentage": 65.81, "elapsed_time": "9:59:49", "remaining_time": "5:11:35"} | |
| {"current_steps": 1080, "total_steps": 1638, "loss": 0.9913358688354492, "lr": 6.964933015972947e-06, "epoch": 1.978021978021978, "percentage": 65.93, "elapsed_time": "10:00:54", "remaining_time": "5:10:28"} | |
| {"current_steps": 1082, "total_steps": 1638, "loss": 1.1181188821792603, "lr": 6.933134239582246e-06, "epoch": 1.9816849816849818, "percentage": 66.06, "elapsed_time": "10:02:18", "remaining_time": "5:09:30"} | |
| {"current_steps": 1084, "total_steps": 1638, "loss": 0.7213895916938782, "lr": 6.9013991307229745e-06, "epoch": 1.9853479853479854, "percentage": 66.18, "elapsed_time": "10:03:21", "remaining_time": "5:08:21"} | |
| {"current_steps": 1086, "total_steps": 1638, "loss": 0.9767944812774658, "lr": 6.869728186214093e-06, "epoch": 1.989010989010989, "percentage": 66.3, "elapsed_time": "10:04:24", "remaining_time": "5:07:13"} | |
| {"current_steps": 1088, "total_steps": 1638, "loss": 0.9789687395095825, "lr": 6.8381219018700675e-06, "epoch": 1.9926739926739927, "percentage": 66.42, "elapsed_time": "10:05:33", "remaining_time": "5:06:07"} | |
| {"current_steps": 1090, "total_steps": 1638, "loss": 0.930722713470459, "lr": 6.806580772493088e-06, "epoch": 1.9963369963369964, "percentage": 66.54, "elapsed_time": "10:06:43", "remaining_time": "5:05:02"} | |
| {"current_steps": 1092, "total_steps": 1638, "loss": 1.042896032333374, "lr": 6.775105291865343e-06, "epoch": 2.0, "percentage": 66.67, "elapsed_time": "10:07:49", "remaining_time": "5:03:54"} | |
| {"current_steps": 1094, "total_steps": 1638, "loss": 1.0761141777038574, "lr": 6.743695952741265e-06, "epoch": 2.0036630036630036, "percentage": 66.79, "elapsed_time": "10:09:02", "remaining_time": "5:02:50"} | |
| {"current_steps": 1096, "total_steps": 1638, "loss": 1.1419814825057983, "lr": 6.71235324683983e-06, "epoch": 2.0073260073260073, "percentage": 66.91, "elapsed_time": "10:10:15", "remaining_time": "5:01:47"} | |
| {"current_steps": 1098, "total_steps": 1638, "loss": 1.088594675064087, "lr": 6.681077664836872e-06, "epoch": 2.010989010989011, "percentage": 67.03, "elapsed_time": "10:11:38", "remaining_time": "5:00:48"} | |
| {"current_steps": 1100, "total_steps": 1638, "loss": 1.1661202907562256, "lr": 6.649869696357381e-06, "epoch": 2.0146520146520146, "percentage": 67.16, "elapsed_time": "10:12:48", "remaining_time": "4:59:43"} | |
| {"current_steps": 1102, "total_steps": 1638, "loss": 0.8467984795570374, "lr": 6.6187298299678295e-06, "epoch": 2.0183150183150182, "percentage": 67.28, "elapsed_time": "10:13:55", "remaining_time": "4:58:36"} | |
| {"current_steps": 1104, "total_steps": 1638, "loss": 1.142805576324463, "lr": 6.587658553168563e-06, "epoch": 2.021978021978022, "percentage": 67.4, "elapsed_time": "10:15:05", "remaining_time": "4:57:30"} | |
| {"current_steps": 1106, "total_steps": 1638, "loss": 0.7679157853126526, "lr": 6.556656352386135e-06, "epoch": 2.0256410256410255, "percentage": 67.52, "elapsed_time": "10:16:14", "remaining_time": "4:56:25"} | |
| {"current_steps": 1108, "total_steps": 1638, "loss": 1.1841180324554443, "lr": 6.525723712965698e-06, "epoch": 2.029304029304029, "percentage": 67.64, "elapsed_time": "10:17:28", "remaining_time": "4:55:21"} | |
| {"current_steps": 1110, "total_steps": 1638, "loss": 0.9058336019515991, "lr": 6.494861119163412e-06, "epoch": 2.032967032967033, "percentage": 67.77, "elapsed_time": "10:18:34", "remaining_time": "4:54:14"} | |
| {"current_steps": 1112, "total_steps": 1638, "loss": 0.6124511957168579, "lr": 6.464069054138853e-06, "epoch": 2.0366300366300365, "percentage": 67.89, "elapsed_time": "10:19:21", "remaining_time": "4:52:58"} | |
| {"current_steps": 1114, "total_steps": 1638, "loss": 0.845076322555542, "lr": 6.433347999947468e-06, "epoch": 2.04029304029304, "percentage": 68.01, "elapsed_time": "10:20:27", "remaining_time": "4:51:50"} | |
| {"current_steps": 1116, "total_steps": 1638, "loss": 1.1547578573226929, "lr": 6.402698437533012e-06, "epoch": 2.043956043956044, "percentage": 68.13, "elapsed_time": "10:21:36", "remaining_time": "4:50:45"} | |
| {"current_steps": 1118, "total_steps": 1638, "loss": 1.1199109554290771, "lr": 6.372120846720018e-06, "epoch": 2.0476190476190474, "percentage": 68.25, "elapsed_time": "10:22:45", "remaining_time": "4:49:39"} | |
| {"current_steps": 1120, "total_steps": 1638, "loss": 0.8209899067878723, "lr": 6.341615706206292e-06, "epoch": 2.051282051282051, "percentage": 68.38, "elapsed_time": "10:23:53", "remaining_time": "4:48:32"} | |
| {"current_steps": 1122, "total_steps": 1638, "loss": 1.3262028694152832, "lr": 6.311183493555426e-06, "epoch": 2.0549450549450547, "percentage": 68.5, "elapsed_time": "10:25:06", "remaining_time": "4:47:28"} | |
| {"current_steps": 1124, "total_steps": 1638, "loss": 1.1404337882995605, "lr": 6.280824685189296e-06, "epoch": 2.0586080586080584, "percentage": 68.62, "elapsed_time": "10:26:18", "remaining_time": "4:46:24"} | |
| {"current_steps": 1126, "total_steps": 1638, "loss": 0.7613569498062134, "lr": 6.25053975638064e-06, "epoch": 2.062271062271062, "percentage": 68.74, "elapsed_time": "10:27:16", "remaining_time": "4:45:13"} | |
| {"current_steps": 1128, "total_steps": 1638, "loss": 1.1030632257461548, "lr": 6.220329181245585e-06, "epoch": 2.065934065934066, "percentage": 68.86, "elapsed_time": "10:28:27", "remaining_time": "4:44:08"} | |
| {"current_steps": 1130, "total_steps": 1638, "loss": 1.098275899887085, "lr": 6.1901934327362355e-06, "epoch": 2.06959706959707, "percentage": 68.99, "elapsed_time": "10:29:44", "remaining_time": "4:43:06"} | |
| {"current_steps": 1132, "total_steps": 1638, "loss": 1.130845069885254, "lr": 6.16013298263328e-06, "epoch": 2.0732600732600734, "percentage": 69.11, "elapsed_time": "10:30:55", "remaining_time": "4:42:01"} | |
| {"current_steps": 1134, "total_steps": 1638, "loss": 1.1122570037841797, "lr": 6.130148301538601e-06, "epoch": 2.076923076923077, "percentage": 69.23, "elapsed_time": "10:32:05", "remaining_time": "4:40:55"} | |
| {"current_steps": 1136, "total_steps": 1638, "loss": 0.7130240201950073, "lr": 6.100239858867887e-06, "epoch": 2.0805860805860807, "percentage": 69.35, "elapsed_time": "10:33:10", "remaining_time": "4:39:48"} | |
| {"current_steps": 1138, "total_steps": 1638, "loss": 1.1177456378936768, "lr": 6.070408122843311e-06, "epoch": 2.0842490842490844, "percentage": 69.47, "elapsed_time": "10:34:31", "remaining_time": "4:38:47"} | |
| {"current_steps": 1140, "total_steps": 1638, "loss": 1.0220168828964233, "lr": 6.040653560486183e-06, "epoch": 2.087912087912088, "percentage": 69.6, "elapsed_time": "10:35:26", "remaining_time": "4:37:35"} | |
| {"current_steps": 1142, "total_steps": 1638, "loss": 1.106982707977295, "lr": 6.010976637609653e-06, "epoch": 2.0915750915750917, "percentage": 69.72, "elapsed_time": "10:36:37", "remaining_time": "4:36:30"} | |
| {"current_steps": 1144, "total_steps": 1638, "loss": 0.3823546767234802, "lr": 5.9813778188114125e-06, "epoch": 2.0952380952380953, "percentage": 69.84, "elapsed_time": "10:37:30", "remaining_time": "4:35:17"} | |
| {"current_steps": 1146, "total_steps": 1638, "loss": 0.9157997369766235, "lr": 5.951857567466401e-06, "epoch": 2.098901098901099, "percentage": 69.96, "elapsed_time": "10:38:47", "remaining_time": "4:34:14"} | |
| {"current_steps": 1148, "total_steps": 1638, "loss": 0.8090324997901917, "lr": 5.922416345719588e-06, "epoch": 2.1025641025641026, "percentage": 70.09, "elapsed_time": "10:39:48", "remaining_time": "4:33:05"} | |
| {"current_steps": 1150, "total_steps": 1638, "loss": 0.8260840773582458, "lr": 5.893054614478718e-06, "epoch": 2.1062271062271063, "percentage": 70.21, "elapsed_time": "10:40:53", "remaining_time": "4:31:57"} | |
| {"current_steps": 1152, "total_steps": 1638, "loss": 0.9580783843994141, "lr": 5.8637728334070905e-06, "epoch": 2.10989010989011, "percentage": 70.33, "elapsed_time": "10:42:02", "remaining_time": "4:30:51"} | |
| {"current_steps": 1154, "total_steps": 1638, "loss": 0.7878354787826538, "lr": 5.834571460916371e-06, "epoch": 2.1135531135531136, "percentage": 70.45, "elapsed_time": "10:43:00", "remaining_time": "4:29:40"} | |
| {"current_steps": 1156, "total_steps": 1638, "loss": 1.0745412111282349, "lr": 5.805450954159422e-06, "epoch": 2.1172161172161172, "percentage": 70.57, "elapsed_time": "10:44:09", "remaining_time": "4:28:35"} | |
| {"current_steps": 1158, "total_steps": 1638, "loss": 1.0261658430099487, "lr": 5.776411769023127e-06, "epoch": 2.120879120879121, "percentage": 70.7, "elapsed_time": "10:45:01", "remaining_time": "4:27:21"} | |
| {"current_steps": 1160, "total_steps": 1638, "loss": 0.870224118232727, "lr": 5.747454360121274e-06, "epoch": 2.1245421245421245, "percentage": 70.82, "elapsed_time": "10:46:08", "remaining_time": "4:26:15"} | |
| {"current_steps": 1162, "total_steps": 1638, "loss": 0.7795557379722595, "lr": 5.718579180787425e-06, "epoch": 2.128205128205128, "percentage": 70.94, "elapsed_time": "10:47:20", "remaining_time": "4:25:10"} | |
| {"current_steps": 1164, "total_steps": 1638, "loss": 0.918286144733429, "lr": 5.689786683067817e-06, "epoch": 2.131868131868132, "percentage": 71.06, "elapsed_time": "10:48:19", "remaining_time": "4:24:00"} | |
| {"current_steps": 1166, "total_steps": 1638, "loss": 0.42682868242263794, "lr": 5.661077317714303e-06, "epoch": 2.1355311355311355, "percentage": 71.18, "elapsed_time": "10:49:20", "remaining_time": "4:22:51"} | |
| {"current_steps": 1168, "total_steps": 1638, "loss": 0.4123497009277344, "lr": 5.632451534177276e-06, "epoch": 2.139194139194139, "percentage": 71.31, "elapsed_time": "10:50:22", "remaining_time": "4:21:42"} | |
| {"current_steps": 1170, "total_steps": 1638, "loss": 0.8927979469299316, "lr": 5.603909780598644e-06, "epoch": 2.142857142857143, "percentage": 71.43, "elapsed_time": "10:51:26", "remaining_time": "4:20:34"} | |
| {"current_steps": 1172, "total_steps": 1638, "loss": 1.1349587440490723, "lr": 5.575452503804805e-06, "epoch": 2.1465201465201464, "percentage": 71.55, "elapsed_time": "10:52:31", "remaining_time": "4:19:26"} | |
| {"current_steps": 1174, "total_steps": 1638, "loss": 1.2815641164779663, "lr": 5.5470801492996605e-06, "epoch": 2.15018315018315, "percentage": 71.67, "elapsed_time": "10:53:41", "remaining_time": "4:18:21"} | |
| {"current_steps": 1176, "total_steps": 1638, "loss": 0.7825716137886047, "lr": 5.518793161257641e-06, "epoch": 2.1538461538461537, "percentage": 71.79, "elapsed_time": "10:54:39", "remaining_time": "4:17:11"} | |
| {"current_steps": 1178, "total_steps": 1638, "loss": 1.142047643661499, "lr": 5.490591982516749e-06, "epoch": 2.1575091575091574, "percentage": 71.92, "elapsed_time": "10:55:56", "remaining_time": "4:16:08"} | |
| {"current_steps": 1180, "total_steps": 1638, "loss": 1.127107858657837, "lr": 5.462477054571617e-06, "epoch": 2.161172161172161, "percentage": 72.04, "elapsed_time": "10:57:17", "remaining_time": "4:15:07"} | |
| {"current_steps": 1182, "total_steps": 1638, "loss": 1.1234813928604126, "lr": 5.4344488175666154e-06, "epoch": 2.1648351648351647, "percentage": 72.16, "elapsed_time": "10:58:32", "remaining_time": "4:14:03"} | |
| {"current_steps": 1184, "total_steps": 1638, "loss": 1.1429002285003662, "lr": 5.406507710288955e-06, "epoch": 2.1684981684981683, "percentage": 72.28, "elapsed_time": "10:59:42", "remaining_time": "4:12:57"} | |
| {"current_steps": 1186, "total_steps": 1638, "loss": 0.318652480840683, "lr": 5.378654170161805e-06, "epoch": 2.172161172161172, "percentage": 72.41, "elapsed_time": "11:00:30", "remaining_time": "4:11:43"} | |
| {"current_steps": 1188, "total_steps": 1638, "loss": 1.2102477550506592, "lr": 5.3508886332374534e-06, "epoch": 2.1758241758241756, "percentage": 72.53, "elapsed_time": "11:01:36", "remaining_time": "4:10:36"} | |
| {"current_steps": 1190, "total_steps": 1638, "loss": 0.7540980577468872, "lr": 5.323211534190496e-06, "epoch": 2.1794871794871793, "percentage": 72.65, "elapsed_time": "11:02:40", "remaining_time": "4:09:28"} | |
| {"current_steps": 1192, "total_steps": 1638, "loss": 0.9759551286697388, "lr": 5.295623306310999e-06, "epoch": 2.183150183150183, "percentage": 72.77, "elapsed_time": "11:03:49", "remaining_time": "4:08:22"} | |
| {"current_steps": 1194, "total_steps": 1638, "loss": 0.802385687828064, "lr": 5.268124381497755e-06, "epoch": 2.186813186813187, "percentage": 72.89, "elapsed_time": "11:04:52", "remaining_time": "4:07:14"} | |
| {"current_steps": 1196, "total_steps": 1638, "loss": 0.9069569110870361, "lr": 5.240715190251484e-06, "epoch": 2.1904761904761907, "percentage": 73.02, "elapsed_time": "11:05:58", "remaining_time": "4:06:07"} | |
| {"current_steps": 1198, "total_steps": 1638, "loss": 1.1158097982406616, "lr": 5.213396161668111e-06, "epoch": 2.1941391941391943, "percentage": 73.14, "elapsed_time": "11:07:10", "remaining_time": "4:05:02"} | |
| {"current_steps": 1200, "total_steps": 1638, "loss": 0.7542502284049988, "lr": 5.186167723432061e-06, "epoch": 2.197802197802198, "percentage": 73.26, "elapsed_time": "11:08:05", "remaining_time": "4:03:51"} | |
| {"current_steps": 1202, "total_steps": 1638, "loss": 1.1818705797195435, "lr": 5.159030301809534e-06, "epoch": 2.2014652014652016, "percentage": 73.38, "elapsed_time": "11:09:25", "remaining_time": "4:02:49"} | |
| {"current_steps": 1204, "total_steps": 1638, "loss": 1.1314865350723267, "lr": 5.131984321641865e-06, "epoch": 2.2051282051282053, "percentage": 73.5, "elapsed_time": "11:10:33", "remaining_time": "4:01:42"} | |
| {"current_steps": 1206, "total_steps": 1638, "loss": 0.4661150276660919, "lr": 5.105030206338843e-06, "epoch": 2.208791208791209, "percentage": 73.63, "elapsed_time": "11:11:34", "remaining_time": "4:00:33"} | |
| {"current_steps": 1208, "total_steps": 1638, "loss": 0.7386508584022522, "lr": 5.0781683778720965e-06, "epoch": 2.2124542124542126, "percentage": 73.75, "elapsed_time": "11:12:41", "remaining_time": "3:59:27"} | |
| {"current_steps": 1210, "total_steps": 1638, "loss": 0.6242368817329407, "lr": 5.051399256768498e-06, "epoch": 2.2161172161172162, "percentage": 73.87, "elapsed_time": "11:13:42", "remaining_time": "3:58:18"} | |
| {"current_steps": 1212, "total_steps": 1638, "loss": 1.0724856853485107, "lr": 5.024723262103559e-06, "epoch": 2.21978021978022, "percentage": 73.99, "elapsed_time": "11:14:53", "remaining_time": "3:57:12"} | |
| {"current_steps": 1214, "total_steps": 1638, "loss": 0.8155311942100525, "lr": 4.998140811494881e-06, "epoch": 2.2234432234432235, "percentage": 74.11, "elapsed_time": "11:16:01", "remaining_time": "3:56:06"} | |
| {"current_steps": 1216, "total_steps": 1638, "loss": 0.8991819024085999, "lr": 4.971652321095614e-06, "epoch": 2.227106227106227, "percentage": 74.24, "elapsed_time": "11:16:58", "remaining_time": "3:54:56"} | |
| {"current_steps": 1218, "total_steps": 1638, "loss": 1.1150569915771484, "lr": 4.945258205587955e-06, "epoch": 2.230769230769231, "percentage": 74.36, "elapsed_time": "11:18:14", "remaining_time": "3:53:52"} | |
| {"current_steps": 1220, "total_steps": 1638, "loss": 1.1184298992156982, "lr": 4.918958878176628e-06, "epoch": 2.2344322344322345, "percentage": 74.48, "elapsed_time": "11:19:26", "remaining_time": "3:52:47"} | |
| {"current_steps": 1222, "total_steps": 1638, "loss": 1.1249486207962036, "lr": 4.8927547505824465e-06, "epoch": 2.238095238095238, "percentage": 74.6, "elapsed_time": "11:20:35", "remaining_time": "3:51:41"} | |
| {"current_steps": 1224, "total_steps": 1638, "loss": 0.49614107608795166, "lr": 4.866646233035845e-06, "epoch": 2.241758241758242, "percentage": 74.73, "elapsed_time": "11:21:38", "remaining_time": "3:50:33"} | |
| {"current_steps": 1226, "total_steps": 1638, "loss": 1.0849982500076294, "lr": 4.840633734270464e-06, "epoch": 2.2454212454212454, "percentage": 74.85, "elapsed_time": "11:22:58", "remaining_time": "3:49:30"} | |
| {"current_steps": 1228, "total_steps": 1638, "loss": 0.6019390821456909, "lr": 4.814717661516762e-06, "epoch": 2.249084249084249, "percentage": 74.97, "elapsed_time": "11:24:01", "remaining_time": "3:48:22"} | |
| {"current_steps": 1230, "total_steps": 1638, "loss": 0.5977374911308289, "lr": 4.788898420495622e-06, "epoch": 2.2527472527472527, "percentage": 75.09, "elapsed_time": "11:24:58", "remaining_time": "3:47:12"} | |
| {"current_steps": 1232, "total_steps": 1638, "loss": 0.6154077649116516, "lr": 4.763176415412006e-06, "epoch": 2.2564102564102564, "percentage": 75.21, "elapsed_time": "11:25:50", "remaining_time": "3:46:00"} | |
| {"current_steps": 1234, "total_steps": 1638, "loss": 1.1026445627212524, "lr": 4.7375520489486395e-06, "epoch": 2.26007326007326, "percentage": 75.34, "elapsed_time": "11:27:07", "remaining_time": "3:44:57"} | |
| {"current_steps": 1236, "total_steps": 1638, "loss": 0.7703139185905457, "lr": 4.71202572225969e-06, "epoch": 2.2637362637362637, "percentage": 75.46, "elapsed_time": "11:28:10", "remaining_time": "3:43:49"} | |
| {"current_steps": 1238, "total_steps": 1638, "loss": 0.9660443067550659, "lr": 4.686597834964499e-06, "epoch": 2.2673992673992673, "percentage": 75.58, "elapsed_time": "11:29:14", "remaining_time": "3:42:41"} | |
| {"current_steps": 1240, "total_steps": 1638, "loss": 0.8756354451179504, "lr": 4.661268785141316e-06, "epoch": 2.271062271062271, "percentage": 75.7, "elapsed_time": "11:30:17", "remaining_time": "3:41:33"} | |
| {"current_steps": 1242, "total_steps": 1638, "loss": 0.9074218273162842, "lr": 4.636038969321073e-06, "epoch": 2.2747252747252746, "percentage": 75.82, "elapsed_time": "11:31:33", "remaining_time": "3:40:29"} | |
| {"current_steps": 1244, "total_steps": 1638, "loss": 1.1067166328430176, "lr": 4.610908782481179e-06, "epoch": 2.2783882783882783, "percentage": 75.95, "elapsed_time": "11:32:44", "remaining_time": "3:39:24"} | |
| {"current_steps": 1246, "total_steps": 1638, "loss": 0.730255126953125, "lr": 4.5858786180393326e-06, "epoch": 2.282051282051282, "percentage": 76.07, "elapsed_time": "11:33:42", "remaining_time": "3:38:14"} | |
| {"current_steps": 1248, "total_steps": 1638, "loss": 0.714017391204834, "lr": 4.560948867847359e-06, "epoch": 2.2857142857142856, "percentage": 76.19, "elapsed_time": "11:34:47", "remaining_time": "3:37:07"} | |
| {"current_steps": 1250, "total_steps": 1638, "loss": 1.1044319868087769, "lr": 4.536119922185082e-06, "epoch": 2.2893772893772892, "percentage": 76.31, "elapsed_time": "11:36:03", "remaining_time": "3:36:03"} | |
| {"current_steps": 1252, "total_steps": 1638, "loss": 0.839243471622467, "lr": 4.511392169754214e-06, "epoch": 2.293040293040293, "percentage": 76.43, "elapsed_time": "11:37:04", "remaining_time": "3:34:54"} | |
| {"current_steps": 1254, "total_steps": 1638, "loss": 1.1593670845031738, "lr": 4.486765997672263e-06, "epoch": 2.2967032967032965, "percentage": 76.56, "elapsed_time": "11:38:13", "remaining_time": "3:33:48"} | |
| {"current_steps": 1256, "total_steps": 1638, "loss": 1.1839113235473633, "lr": 4.46224179146649e-06, "epoch": 2.3003663003663, "percentage": 76.68, "elapsed_time": "11:39:23", "remaining_time": "3:32:42"} | |
| {"current_steps": 1258, "total_steps": 1638, "loss": 0.7787933945655823, "lr": 4.437819935067847e-06, "epoch": 2.304029304029304, "percentage": 76.8, "elapsed_time": "11:40:36", "remaining_time": "3:31:37"} | |
| {"current_steps": 1260, "total_steps": 1638, "loss": 1.1648868322372437, "lr": 4.413500810804986e-06, "epoch": 2.3076923076923075, "percentage": 76.92, "elapsed_time": "11:41:31", "remaining_time": "3:30:27"} | |
| {"current_steps": 1262, "total_steps": 1638, "loss": 1.0294675827026367, "lr": 4.389284799398276e-06, "epoch": 2.311355311355311, "percentage": 77.05, "elapsed_time": "11:42:38", "remaining_time": "3:29:20"} | |
| {"current_steps": 1264, "total_steps": 1638, "loss": 0.8878557085990906, "lr": 4.365172279953825e-06, "epoch": 2.315018315018315, "percentage": 77.17, "elapsed_time": "11:43:42", "remaining_time": "3:28:13"} | |
| {"current_steps": 1266, "total_steps": 1638, "loss": 0.7896403670310974, "lr": 4.34116362995756e-06, "epoch": 2.3186813186813184, "percentage": 77.29, "elapsed_time": "11:44:54", "remaining_time": "3:27:07"} | |
| {"current_steps": 1268, "total_steps": 1638, "loss": 0.8231415748596191, "lr": 4.317259225269313e-06, "epoch": 2.3223443223443225, "percentage": 77.41, "elapsed_time": "11:45:49", "remaining_time": "3:25:57"} | |
| {"current_steps": 1270, "total_steps": 1638, "loss": 1.1856201887130737, "lr": 4.293459440116935e-06, "epoch": 2.326007326007326, "percentage": 77.53, "elapsed_time": "11:46:56", "remaining_time": "3:24:50"} | |
| {"current_steps": 1272, "total_steps": 1638, "loss": 1.0062392950057983, "lr": 4.269764647090442e-06, "epoch": 2.32967032967033, "percentage": 77.66, "elapsed_time": "11:48:04", "remaining_time": "3:23:44"} | |
| {"current_steps": 1274, "total_steps": 1638, "loss": 0.6258052587509155, "lr": 4.246175217136176e-06, "epoch": 2.3333333333333335, "percentage": 77.78, "elapsed_time": "11:49:04", "remaining_time": "3:22:35"} | |
| {"current_steps": 1276, "total_steps": 1638, "loss": 1.0182530879974365, "lr": 4.2226915195509954e-06, "epoch": 2.336996336996337, "percentage": 77.9, "elapsed_time": "11:50:13", "remaining_time": "3:21:29"} | |
| {"current_steps": 1278, "total_steps": 1638, "loss": 0.8344160318374634, "lr": 4.199313921976511e-06, "epoch": 2.340659340659341, "percentage": 78.02, "elapsed_time": "11:51:22", "remaining_time": "3:20:23"} | |
| {"current_steps": 1280, "total_steps": 1638, "loss": 0.7739337682723999, "lr": 4.176042790393313e-06, "epoch": 2.3443223443223444, "percentage": 78.14, "elapsed_time": "11:52:29", "remaining_time": "3:19:16"} | |
| {"current_steps": 1282, "total_steps": 1638, "loss": 0.6238831877708435, "lr": 4.152878489115244e-06, "epoch": 2.347985347985348, "percentage": 78.27, "elapsed_time": "11:53:31", "remaining_time": "3:18:08"} | |
| {"current_steps": 1284, "total_steps": 1638, "loss": 1.091771125793457, "lr": 4.129821380783698e-06, "epoch": 2.3516483516483517, "percentage": 78.39, "elapsed_time": "11:54:43", "remaining_time": "3:17:03"} | |
| {"current_steps": 1286, "total_steps": 1638, "loss": 0.6089442372322083, "lr": 4.106871826361952e-06, "epoch": 2.3553113553113554, "percentage": 78.51, "elapsed_time": "11:55:46", "remaining_time": "3:15:55"} | |
| {"current_steps": 1288, "total_steps": 1638, "loss": 0.9349772334098816, "lr": 4.084030185129495e-06, "epoch": 2.358974358974359, "percentage": 78.63, "elapsed_time": "11:56:51", "remaining_time": "3:14:47"} | |
| {"current_steps": 1290, "total_steps": 1638, "loss": 0.896765947341919, "lr": 4.061296814676429e-06, "epoch": 2.3626373626373627, "percentage": 78.75, "elapsed_time": "11:58:01", "remaining_time": "3:13:41"} | |
| {"current_steps": 1292, "total_steps": 1638, "loss": 0.7659744024276733, "lr": 4.038672070897844e-06, "epoch": 2.3663003663003663, "percentage": 78.88, "elapsed_time": "11:59:01", "remaining_time": "3:12:33"} | |
| {"current_steps": 1294, "total_steps": 1638, "loss": 0.8882443308830261, "lr": 4.016156307988262e-06, "epoch": 2.36996336996337, "percentage": 79.0, "elapsed_time": "11:59:57", "remaining_time": "3:11:23"} | |
| {"current_steps": 1296, "total_steps": 1638, "loss": 1.2214046716690063, "lr": 3.9937498784361e-06, "epoch": 2.3736263736263736, "percentage": 79.12, "elapsed_time": "12:01:01", "remaining_time": "3:10:16"} | |
| {"current_steps": 1298, "total_steps": 1638, "loss": 1.1040786504745483, "lr": 3.9714531330181275e-06, "epoch": 2.3772893772893773, "percentage": 79.24, "elapsed_time": "12:02:17", "remaining_time": "3:09:11"} | |
| {"current_steps": 1300, "total_steps": 1638, "loss": 1.1567643880844116, "lr": 3.949266420793999e-06, "epoch": 2.380952380952381, "percentage": 79.37, "elapsed_time": "12:03:29", "remaining_time": "3:08:06"} | |
| {"current_steps": 1302, "total_steps": 1638, "loss": 0.8775418996810913, "lr": 3.9271900891007734e-06, "epoch": 2.3846153846153846, "percentage": 79.49, "elapsed_time": "12:04:32", "remaining_time": "3:06:58"} | |
| {"current_steps": 1304, "total_steps": 1638, "loss": 0.47357720136642456, "lr": 3.905224483547479e-06, "epoch": 2.3882783882783882, "percentage": 79.61, "elapsed_time": "12:05:34", "remaining_time": "3:05:50"} | |
| {"current_steps": 1306, "total_steps": 1638, "loss": 0.9344196915626526, "lr": 3.883369948009714e-06, "epoch": 2.391941391941392, "percentage": 79.73, "elapsed_time": "12:06:50", "remaining_time": "3:04:46"} | |
| {"current_steps": 1308, "total_steps": 1638, "loss": 1.1155997514724731, "lr": 3.861626824624258e-06, "epoch": 2.3956043956043955, "percentage": 79.85, "elapsed_time": "12:08:00", "remaining_time": "3:03:40"} | |
| {"current_steps": 1310, "total_steps": 1638, "loss": 0.5117136836051941, "lr": 3.839995453783694e-06, "epoch": 2.399267399267399, "percentage": 79.98, "elapsed_time": "12:09:00", "remaining_time": "3:02:31"} | |
| {"current_steps": 1312, "total_steps": 1638, "loss": 1.11769437789917, "lr": 3.818476174131118e-06, "epoch": 2.402930402930403, "percentage": 80.1, "elapsed_time": "12:10:09", "remaining_time": "3:01:25"} | |
| {"current_steps": 1314, "total_steps": 1638, "loss": 0.8328091502189636, "lr": 3.7970693225548116e-06, "epoch": 2.4065934065934065, "percentage": 80.22, "elapsed_time": "12:11:20", "remaining_time": "3:00:19"} | |
| {"current_steps": 1316, "total_steps": 1638, "loss": 1.115455985069275, "lr": 3.7757752341829723e-06, "epoch": 2.41025641025641, "percentage": 80.34, "elapsed_time": "12:12:31", "remaining_time": "2:59:14"} | |
| {"current_steps": 1318, "total_steps": 1638, "loss": 0.7884094715118408, "lr": 3.754594242378466e-06, "epoch": 2.413919413919414, "percentage": 80.46, "elapsed_time": "12:13:38", "remaining_time": "2:58:07"} | |
| {"current_steps": 1320, "total_steps": 1638, "loss": 0.7719835042953491, "lr": 3.7335266787336194e-06, "epoch": 2.4175824175824174, "percentage": 80.59, "elapsed_time": "12:14:35", "remaining_time": "2:56:58"} | |
| {"current_steps": 1322, "total_steps": 1638, "loss": 0.6111771464347839, "lr": 3.712572873065012e-06, "epoch": 2.421245421245421, "percentage": 80.71, "elapsed_time": "12:15:40", "remaining_time": "2:55:51"} | |
| {"current_steps": 1324, "total_steps": 1638, "loss": 0.7621108889579773, "lr": 3.69173315340833e-06, "epoch": 2.4249084249084247, "percentage": 80.83, "elapsed_time": "12:16:46", "remaining_time": "2:54:43"} | |
| {"current_steps": 1326, "total_steps": 1638, "loss": 0.7748345732688904, "lr": 3.6710078460132137e-06, "epoch": 2.4285714285714284, "percentage": 80.95, "elapsed_time": "12:17:51", "remaining_time": "2:53:36"} | |
| {"current_steps": 1328, "total_steps": 1638, "loss": 0.6837164163589478, "lr": 3.650397275338161e-06, "epoch": 2.4322344322344325, "percentage": 81.07, "elapsed_time": "12:18:55", "remaining_time": "2:52:29"} | |
| {"current_steps": 1330, "total_steps": 1638, "loss": 0.8087068796157837, "lr": 3.6299017640454516e-06, "epoch": 2.435897435897436, "percentage": 81.2, "elapsed_time": "12:20:17", "remaining_time": "2:51:26"} | |
| {"current_steps": 1332, "total_steps": 1638, "loss": 1.0505911111831665, "lr": 3.6095216329960786e-06, "epoch": 2.4395604395604398, "percentage": 81.32, "elapsed_time": "12:21:23", "remaining_time": "2:50:19"} | |
| {"current_steps": 1334, "total_steps": 1638, "loss": 0.7000587582588196, "lr": 3.5892572012447457e-06, "epoch": 2.4432234432234434, "percentage": 81.44, "elapsed_time": "12:22:29", "remaining_time": "2:49:12"} | |
| {"current_steps": 1336, "total_steps": 1638, "loss": 0.6738724708557129, "lr": 3.5691087860348577e-06, "epoch": 2.446886446886447, "percentage": 81.56, "elapsed_time": "12:23:23", "remaining_time": "2:48:02"} | |
| {"current_steps": 1338, "total_steps": 1638, "loss": 1.0319753885269165, "lr": 3.549076702793557e-06, "epoch": 2.4505494505494507, "percentage": 81.68, "elapsed_time": "12:24:37", "remaining_time": "2:46:57"} | |
| {"current_steps": 1340, "total_steps": 1638, "loss": 0.8778097033500671, "lr": 3.529161265126795e-06, "epoch": 2.4542124542124544, "percentage": 81.81, "elapsed_time": "12:25:30", "remaining_time": "2:45:47"} | |
| {"current_steps": 1342, "total_steps": 1638, "loss": 0.6379270553588867, "lr": 3.5093627848144128e-06, "epoch": 2.457875457875458, "percentage": 81.93, "elapsed_time": "12:26:23", "remaining_time": "2:44:37"} | |
| {"current_steps": 1344, "total_steps": 1638, "loss": 0.8156915903091431, "lr": 3.4896815718052534e-06, "epoch": 2.4615384615384617, "percentage": 82.05, "elapsed_time": "12:27:30", "remaining_time": "2:43:30"} | |
| {"current_steps": 1346, "total_steps": 1638, "loss": 1.0697602033615112, "lr": 3.4701179342123313e-06, "epoch": 2.4652014652014653, "percentage": 82.17, "elapsed_time": "12:28:33", "remaining_time": "2:42:23"} | |
| {"current_steps": 1348, "total_steps": 1638, "loss": 1.1031157970428467, "lr": 3.4506721783079925e-06, "epoch": 2.468864468864469, "percentage": 82.3, "elapsed_time": "12:29:44", "remaining_time": "2:41:17"} | |
| {"current_steps": 1350, "total_steps": 1638, "loss": 0.7289459705352783, "lr": 3.4313446085191203e-06, "epoch": 2.4725274725274726, "percentage": 82.42, "elapsed_time": "12:30:41", "remaining_time": "2:40:08"} | |
| {"current_steps": 1352, "total_steps": 1638, "loss": 0.8309732675552368, "lr": 3.4121355274223727e-06, "epoch": 2.4761904761904763, "percentage": 82.54, "elapsed_time": "12:31:37", "remaining_time": "2:38:59"} | |
| {"current_steps": 1354, "total_steps": 1638, "loss": 0.9143206477165222, "lr": 3.3930452357394473e-06, "epoch": 2.47985347985348, "percentage": 82.66, "elapsed_time": "12:32:54", "remaining_time": "2:37:55"} | |
| {"current_steps": 1356, "total_steps": 1638, "loss": 0.9112240672111511, "lr": 3.3740740323323705e-06, "epoch": 2.4835164835164836, "percentage": 82.78, "elapsed_time": "12:33:51", "remaining_time": "2:36:46"} | |
| {"current_steps": 1358, "total_steps": 1638, "loss": 1.0814073085784912, "lr": 3.3552222141988257e-06, "epoch": 2.4871794871794872, "percentage": 82.91, "elapsed_time": "12:34:58", "remaining_time": "2:35:39"} | |
| {"current_steps": 1360, "total_steps": 1638, "loss": 0.8779569268226624, "lr": 3.336490076467489e-06, "epoch": 2.490842490842491, "percentage": 83.03, "elapsed_time": "12:36:06", "remaining_time": "2:34:33"} | |
| {"current_steps": 1362, "total_steps": 1638, "loss": 0.8037658929824829, "lr": 3.31787791239342e-06, "epoch": 2.4945054945054945, "percentage": 83.15, "elapsed_time": "12:37:17", "remaining_time": "2:33:27"} | |
| {"current_steps": 1364, "total_steps": 1638, "loss": 0.8425225615501404, "lr": 3.2993860133534763e-06, "epoch": 2.498168498168498, "percentage": 83.27, "elapsed_time": "12:38:23", "remaining_time": "2:32:20"} | |
| {"current_steps": 1366, "total_steps": 1638, "loss": 1.1095960140228271, "lr": 3.2810146688417304e-06, "epoch": 2.501831501831502, "percentage": 83.39, "elapsed_time": "12:39:31", "remaining_time": "2:31:14"} | |
| {"current_steps": 1368, "total_steps": 1638, "loss": 0.7693407535552979, "lr": 3.2627641664649666e-06, "epoch": 2.5054945054945055, "percentage": 83.52, "elapsed_time": "12:40:23", "remaining_time": "2:30:04"} | |
| {"current_steps": 1370, "total_steps": 1638, "loss": 0.9375527501106262, "lr": 3.2446347919381533e-06, "epoch": 2.509157509157509, "percentage": 83.64, "elapsed_time": "12:41:22", "remaining_time": "2:28:56"} | |
| {"current_steps": 1372, "total_steps": 1638, "loss": 0.6353393197059631, "lr": 3.226626829079979e-06, "epoch": 2.5128205128205128, "percentage": 83.76, "elapsed_time": "12:42:19", "remaining_time": "2:27:47"} | |
| {"current_steps": 1374, "total_steps": 1638, "loss": 0.7712477445602417, "lr": 3.2087405598084194e-06, "epoch": 2.5164835164835164, "percentage": 83.88, "elapsed_time": "12:43:18", "remaining_time": "2:26:39"} | |
| {"current_steps": 1376, "total_steps": 1638, "loss": 0.9633672833442688, "lr": 3.1909762641363083e-06, "epoch": 2.52014652014652, "percentage": 84.0, "elapsed_time": "12:44:28", "remaining_time": "2:25:33"} | |
| {"current_steps": 1378, "total_steps": 1638, "loss": 0.7830007076263428, "lr": 3.173334220166962e-06, "epoch": 2.5238095238095237, "percentage": 84.13, "elapsed_time": "12:45:38", "remaining_time": "2:24:27"} | |
| {"current_steps": 1380, "total_steps": 1638, "loss": 0.8970922827720642, "lr": 3.155814704089823e-06, "epoch": 2.5274725274725274, "percentage": 84.25, "elapsed_time": "12:46:35", "remaining_time": "2:23:19"} | |
| {"current_steps": 1382, "total_steps": 1638, "loss": 0.8635251522064209, "lr": 3.1384179901761343e-06, "epoch": 2.531135531135531, "percentage": 84.37, "elapsed_time": "12:47:28", "remaining_time": "2:22:10"} | |
| {"current_steps": 1384, "total_steps": 1638, "loss": 0.7926411628723145, "lr": 3.1211443507746546e-06, "epoch": 2.5347985347985347, "percentage": 84.49, "elapsed_time": "12:48:31", "remaining_time": "2:21:02"} | |
| {"current_steps": 1386, "total_steps": 1638, "loss": 1.1008884906768799, "lr": 3.1039940563073894e-06, "epoch": 2.5384615384615383, "percentage": 84.62, "elapsed_time": "12:49:49", "remaining_time": "2:19:58"} | |
| {"current_steps": 1388, "total_steps": 1638, "loss": 0.7606490254402161, "lr": 3.0869673752653447e-06, "epoch": 2.542124542124542, "percentage": 84.74, "elapsed_time": "12:50:52", "remaining_time": "2:18:50"} | |
| {"current_steps": 1390, "total_steps": 1638, "loss": 0.8070803880691528, "lr": 3.0700645742043476e-06, "epoch": 2.5457875457875456, "percentage": 84.86, "elapsed_time": "12:52:01", "remaining_time": "2:17:44"} | |
| {"current_steps": 1392, "total_steps": 1638, "loss": 0.983840823173523, "lr": 3.0532859177408587e-06, "epoch": 2.5494505494505493, "percentage": 84.98, "elapsed_time": "12:53:00", "remaining_time": "2:16:36"} | |
| {"current_steps": 1394, "total_steps": 1638, "loss": 0.7210864424705505, "lr": 3.03663166854783e-06, "epoch": 2.553113553113553, "percentage": 85.1, "elapsed_time": "12:54:09", "remaining_time": "2:15:30"} | |
| {"current_steps": 1396, "total_steps": 1638, "loss": 0.34565648436546326, "lr": 3.020102087350594e-06, "epoch": 2.5567765567765566, "percentage": 85.23, "elapsed_time": "12:55:02", "remaining_time": "2:14:21"} | |
| {"current_steps": 1398, "total_steps": 1638, "loss": 1.1138232946395874, "lr": 3.0036974329227862e-06, "epoch": 2.5604395604395602, "percentage": 85.35, "elapsed_time": "12:56:10", "remaining_time": "2:13:14"} | |
| {"current_steps": 1400, "total_steps": 1638, "loss": 1.1061241626739502, "lr": 2.9874179620822856e-06, "epoch": 2.564102564102564, "percentage": 85.47, "elapsed_time": "12:57:22", "remaining_time": "2:12:09"} | |
| {"current_steps": 1402, "total_steps": 1638, "loss": 0.8952687978744507, "lr": 2.971263929687207e-06, "epoch": 2.5677655677655675, "percentage": 85.59, "elapsed_time": "12:58:17", "remaining_time": "2:11:00"} | |
| {"current_steps": 1404, "total_steps": 1638, "loss": 1.105149269104004, "lr": 2.9552355886318968e-06, "epoch": 2.571428571428571, "percentage": 85.71, "elapsed_time": "12:59:28", "remaining_time": "2:09:54"} | |
| {"current_steps": 1406, "total_steps": 1638, "loss": 0.5482126474380493, "lr": 2.9393331898429777e-06, "epoch": 2.575091575091575, "percentage": 85.84, "elapsed_time": "13:00:30", "remaining_time": "2:08:47"} | |
| {"current_steps": 1408, "total_steps": 1638, "loss": 1.1531801223754883, "lr": 2.9235569822754317e-06, "epoch": 2.578754578754579, "percentage": 85.96, "elapsed_time": "13:01:38", "remaining_time": "2:07:40"} | |
| {"current_steps": 1410, "total_steps": 1638, "loss": 0.8873265981674194, "lr": 2.9079072129086906e-06, "epoch": 2.5824175824175826, "percentage": 86.08, "elapsed_time": "13:02:50", "remaining_time": "2:06:35"} | |
| {"current_steps": 1412, "total_steps": 1638, "loss": 0.8655451536178589, "lr": 2.89238412674277e-06, "epoch": 2.586080586080586, "percentage": 86.2, "elapsed_time": "13:03:58", "remaining_time": "2:05:28"} | |
| {"current_steps": 1414, "total_steps": 1638, "loss": 0.953106164932251, "lr": 2.8769879667944393e-06, "epoch": 2.58974358974359, "percentage": 86.32, "elapsed_time": "13:05:02", "remaining_time": "2:04:21"} | |
| {"current_steps": 1416, "total_steps": 1638, "loss": 1.0878214836120605, "lr": 2.8617189740934113e-06, "epoch": 2.5934065934065935, "percentage": 86.45, "elapsed_time": "13:06:14", "remaining_time": "2:03:15"} | |
| {"current_steps": 1418, "total_steps": 1638, "loss": 0.7813395261764526, "lr": 2.8465773876785786e-06, "epoch": 2.597069597069597, "percentage": 86.57, "elapsed_time": "13:07:04", "remaining_time": "2:02:06"} | |
| {"current_steps": 1420, "total_steps": 1638, "loss": 1.085912823677063, "lr": 2.8315634445942623e-06, "epoch": 2.600732600732601, "percentage": 86.69, "elapsed_time": "13:08:16", "remaining_time": "2:01:01"} | |
| {"current_steps": 1422, "total_steps": 1638, "loss": 1.0346283912658691, "lr": 2.8166773798864978e-06, "epoch": 2.6043956043956045, "percentage": 86.81, "elapsed_time": "13:09:25", "remaining_time": "1:59:54"} | |
| {"current_steps": 1424, "total_steps": 1638, "loss": 1.1859427690505981, "lr": 2.8019194265993683e-06, "epoch": 2.608058608058608, "percentage": 86.94, "elapsed_time": "13:10:37", "remaining_time": "1:58:48"} | |
| {"current_steps": 1426, "total_steps": 1638, "loss": 1.2661553621292114, "lr": 2.787289815771348e-06, "epoch": 2.6117216117216118, "percentage": 87.06, "elapsed_time": "13:11:51", "remaining_time": "1:57:43"} | |
| {"current_steps": 1428, "total_steps": 1638, "loss": 1.1397687196731567, "lr": 2.7727887764316835e-06, "epoch": 2.6153846153846154, "percentage": 87.18, "elapsed_time": "13:13:01", "remaining_time": "1:56:37"} | |
| {"current_steps": 1430, "total_steps": 1638, "loss": 1.078932285308838, "lr": 2.758416535596812e-06, "epoch": 2.619047619047619, "percentage": 87.3, "elapsed_time": "13:14:11", "remaining_time": "1:55:31"} | |
| {"current_steps": 1432, "total_steps": 1638, "loss": 0.9404812455177307, "lr": 2.744173318266809e-06, "epoch": 2.6227106227106227, "percentage": 87.42, "elapsed_time": "13:15:13", "remaining_time": "1:54:23"} | |
| {"current_steps": 1434, "total_steps": 1638, "loss": 0.944557785987854, "lr": 2.7300593474218583e-06, "epoch": 2.6263736263736264, "percentage": 87.55, "elapsed_time": "13:16:22", "remaining_time": "1:53:17"} | |
| {"current_steps": 1436, "total_steps": 1638, "loss": 1.3000890016555786, "lr": 2.7160748440187736e-06, "epoch": 2.63003663003663, "percentage": 87.67, "elapsed_time": "13:17:30", "remaining_time": "1:52:11"} | |
| {"current_steps": 1438, "total_steps": 1638, "loss": 1.1358200311660767, "lr": 2.702220026987525e-06, "epoch": 2.6336996336996337, "percentage": 87.79, "elapsed_time": "13:18:38", "remaining_time": "1:51:04"} | |
| {"current_steps": 1440, "total_steps": 1638, "loss": 1.0983126163482666, "lr": 2.6884951132278185e-06, "epoch": 2.6373626373626373, "percentage": 87.91, "elapsed_time": "13:19:49", "remaining_time": "1:49:58"} | |
| {"current_steps": 1442, "total_steps": 1638, "loss": 1.1217743158340454, "lr": 2.6749003176057092e-06, "epoch": 2.641025641025641, "percentage": 88.03, "elapsed_time": "13:21:07", "remaining_time": "1:48:53"} | |
| {"current_steps": 1444, "total_steps": 1638, "loss": 1.0780593156814575, "lr": 2.6614358529502165e-06, "epoch": 2.6446886446886446, "percentage": 88.16, "elapsed_time": "13:22:03", "remaining_time": "1:47:45"} | |
| {"current_steps": 1446, "total_steps": 1638, "loss": 0.7865286469459534, "lr": 2.6481019300500166e-06, "epoch": 2.6483516483516483, "percentage": 88.28, "elapsed_time": "13:23:10", "remaining_time": "1:46:38"} | |
| {"current_steps": 1448, "total_steps": 1638, "loss": 0.8131098747253418, "lr": 2.634898757650121e-06, "epoch": 2.652014652014652, "percentage": 88.4, "elapsed_time": "13:24:14", "remaining_time": "1:45:31"} | |
| {"current_steps": 1450, "total_steps": 1638, "loss": 1.0926539897918701, "lr": 2.6218265424486233e-06, "epoch": 2.6556776556776556, "percentage": 88.52, "elapsed_time": "13:25:34", "remaining_time": "1:44:26"} | |
| {"current_steps": 1452, "total_steps": 1638, "loss": 0.6913706660270691, "lr": 2.608885489093455e-06, "epoch": 2.659340659340659, "percentage": 88.64, "elapsed_time": "13:26:28", "remaining_time": "1:43:18"} | |
| {"current_steps": 1454, "total_steps": 1638, "loss": 0.6983692646026611, "lr": 2.5960758001791893e-06, "epoch": 2.663003663003663, "percentage": 88.77, "elapsed_time": "13:27:26", "remaining_time": "1:42:10"} | |
| {"current_steps": 1456, "total_steps": 1638, "loss": 1.070138692855835, "lr": 2.5833976762438605e-06, "epoch": 2.6666666666666665, "percentage": 88.89, "elapsed_time": "13:28:36", "remaining_time": "1:41:04"} | |
| {"current_steps": 1458, "total_steps": 1638, "loss": 0.7558898329734802, "lr": 2.5708513157658295e-06, "epoch": 2.67032967032967, "percentage": 89.01, "elapsed_time": "13:29:51", "remaining_time": "1:39:58"} | |
| {"current_steps": 1460, "total_steps": 1638, "loss": 0.7775593400001526, "lr": 2.5584369151606785e-06, "epoch": 2.6739926739926743, "percentage": 89.13, "elapsed_time": "13:30:48", "remaining_time": "1:38:51"} | |
| {"current_steps": 1462, "total_steps": 1638, "loss": 0.7822065949440002, "lr": 2.5461546687781325e-06, "epoch": 2.677655677655678, "percentage": 89.26, "elapsed_time": "13:31:55", "remaining_time": "1:37:44"} | |
| {"current_steps": 1464, "total_steps": 1638, "loss": 1.0643365383148193, "lr": 2.5340047688990142e-06, "epoch": 2.6813186813186816, "percentage": 89.38, "elapsed_time": "13:32:57", "remaining_time": "1:36:37"} | |
| {"current_steps": 1466, "total_steps": 1638, "loss": 0.8780837655067444, "lr": 2.5219874057322453e-06, "epoch": 2.684981684981685, "percentage": 89.5, "elapsed_time": "13:34:10", "remaining_time": "1:35:31"} | |
| {"current_steps": 1468, "total_steps": 1638, "loss": 1.0327012538909912, "lr": 2.5101027674118523e-06, "epoch": 2.688644688644689, "percentage": 89.62, "elapsed_time": "13:35:10", "remaining_time": "1:34:24"} | |
| {"current_steps": 1470, "total_steps": 1638, "loss": 0.6416606903076172, "lr": 2.4983510399940377e-06, "epoch": 2.6923076923076925, "percentage": 89.74, "elapsed_time": "13:36:13", "remaining_time": "1:33:16"} | |
| {"current_steps": 1472, "total_steps": 1638, "loss": 0.7808413505554199, "lr": 2.4867324074542525e-06, "epoch": 2.695970695970696, "percentage": 89.87, "elapsed_time": "13:37:07", "remaining_time": "1:32:08"} | |
| {"current_steps": 1474, "total_steps": 1638, "loss": 0.4357774257659912, "lr": 2.4752470516843257e-06, "epoch": 2.6996336996337, "percentage": 89.99, "elapsed_time": "13:38:20", "remaining_time": "1:31:02"} | |
| {"current_steps": 1476, "total_steps": 1638, "loss": 1.1387816667556763, "lr": 2.463895152489617e-06, "epoch": 2.7032967032967035, "percentage": 90.11, "elapsed_time": "13:39:33", "remaining_time": "1:29:57"} | |
| {"current_steps": 1478, "total_steps": 1638, "loss": 0.6536518335342407, "lr": 2.4526768875861938e-06, "epoch": 2.706959706959707, "percentage": 90.23, "elapsed_time": "13:40:41", "remaining_time": "1:28:50"} | |
| {"current_steps": 1480, "total_steps": 1638, "loss": 1.0885263681411743, "lr": 2.4415924325980575e-06, "epoch": 2.7106227106227108, "percentage": 90.35, "elapsed_time": "13:41:53", "remaining_time": "1:27:44"} | |
| {"current_steps": 1482, "total_steps": 1638, "loss": 1.111185073852539, "lr": 2.4306419610543885e-06, "epoch": 2.7142857142857144, "percentage": 90.48, "elapsed_time": "13:43:14", "remaining_time": "1:26:39"} | |
| {"current_steps": 1484, "total_steps": 1638, "loss": 1.0014547109603882, "lr": 2.4198256443868327e-06, "epoch": 2.717948717948718, "percentage": 90.6, "elapsed_time": "13:44:26", "remaining_time": "1:25:33"} | |
| {"current_steps": 1486, "total_steps": 1638, "loss": 1.1393983364105225, "lr": 2.4091436519268167e-06, "epoch": 2.7216117216117217, "percentage": 90.72, "elapsed_time": "13:45:34", "remaining_time": "1:24:26"} | |
| {"current_steps": 1488, "total_steps": 1638, "loss": 1.1307849884033203, "lr": 2.3985961509028994e-06, "epoch": 2.7252747252747254, "percentage": 90.84, "elapsed_time": "13:46:46", "remaining_time": "1:23:20"} | |
| {"current_steps": 1490, "total_steps": 1638, "loss": 0.5263049006462097, "lr": 2.3881833064381478e-06, "epoch": 2.728937728937729, "percentage": 90.96, "elapsed_time": "13:47:58", "remaining_time": "1:22:14"} | |
| {"current_steps": 1492, "total_steps": 1638, "loss": 1.1333121061325073, "lr": 2.3779052815475553e-06, "epoch": 2.7326007326007327, "percentage": 91.09, "elapsed_time": "13:49:10", "remaining_time": "1:21:08"} | |
| {"current_steps": 1494, "total_steps": 1638, "loss": 0.6400864124298096, "lr": 2.3677622371354932e-06, "epoch": 2.7362637362637363, "percentage": 91.21, "elapsed_time": "13:50:10", "remaining_time": "1:20:00"} | |
| {"current_steps": 1496, "total_steps": 1638, "loss": 1.201290249824524, "lr": 2.357754331993187e-06, "epoch": 2.73992673992674, "percentage": 91.33, "elapsed_time": "13:51:19", "remaining_time": "1:18:54"} | |
| {"current_steps": 1498, "total_steps": 1638, "loss": 1.0851210355758667, "lr": 2.347881722796234e-06, "epoch": 2.7435897435897436, "percentage": 91.45, "elapsed_time": "13:52:38", "remaining_time": "1:17:49"} | |
| {"current_steps": 1500, "total_steps": 1638, "loss": 0.7578325271606445, "lr": 2.3381445641021445e-06, "epoch": 2.7472527472527473, "percentage": 91.58, "elapsed_time": "13:53:40", "remaining_time": "1:16:41"} | |
| {"current_steps": 1502, "total_steps": 1638, "loss": 1.035079002380371, "lr": 2.328543008347928e-06, "epoch": 2.750915750915751, "percentage": 91.7, "elapsed_time": "13:54:51", "remaining_time": "1:15:35"} | |
| {"current_steps": 1504, "total_steps": 1638, "loss": 1.1079963445663452, "lr": 2.31907720584771e-06, "epoch": 2.7545787545787546, "percentage": 91.82, "elapsed_time": "13:56:04", "remaining_time": "1:14:29"} | |
| {"current_steps": 1506, "total_steps": 1638, "loss": 1.1286638975143433, "lr": 2.3097473047903645e-06, "epoch": 2.758241758241758, "percentage": 91.94, "elapsed_time": "13:57:18", "remaining_time": "1:13:23"} | |
| {"current_steps": 1508, "total_steps": 1638, "loss": 0.9173861145973206, "lr": 2.3005534512372106e-06, "epoch": 2.761904761904762, "percentage": 92.06, "elapsed_time": "13:58:16", "remaining_time": "1:12:15"} | |
| {"current_steps": 1510, "total_steps": 1638, "loss": 1.0080313682556152, "lr": 2.2914957891197182e-06, "epoch": 2.7655677655677655, "percentage": 92.19, "elapsed_time": "13:59:25", "remaining_time": "1:11:09"} | |
| {"current_steps": 1512, "total_steps": 1638, "loss": 0.9287357330322266, "lr": 2.2825744602372506e-06, "epoch": 2.769230769230769, "percentage": 92.31, "elapsed_time": "14:00:35", "remaining_time": "1:10:02"} | |
| {"current_steps": 1514, "total_steps": 1638, "loss": 1.0911868810653687, "lr": 2.2737896042548537e-06, "epoch": 2.772893772893773, "percentage": 92.43, "elapsed_time": "14:01:55", "remaining_time": "1:08:57"} | |
| {"current_steps": 1516, "total_steps": 1638, "loss": 1.0172020196914673, "lr": 2.2651413587010634e-06, "epoch": 2.7765567765567765, "percentage": 92.55, "elapsed_time": "14:03:04", "remaining_time": "1:07:50"} | |
| {"current_steps": 1518, "total_steps": 1638, "loss": 0.9788475036621094, "lr": 2.2566298589657546e-06, "epoch": 2.78021978021978, "percentage": 92.67, "elapsed_time": "14:04:02", "remaining_time": "1:06:43"} | |
| {"current_steps": 1520, "total_steps": 1638, "loss": 0.5113797187805176, "lr": 2.2482552382980194e-06, "epoch": 2.7838827838827838, "percentage": 92.8, "elapsed_time": "14:05:05", "remaining_time": "1:05:36"} | |
| {"current_steps": 1522, "total_steps": 1638, "loss": 0.7732734084129333, "lr": 2.240017627804088e-06, "epoch": 2.7875457875457874, "percentage": 92.92, "elapsed_time": "14:06:18", "remaining_time": "1:04:30"} | |
| {"current_steps": 1524, "total_steps": 1638, "loss": 0.8295901417732239, "lr": 2.231917156445265e-06, "epoch": 2.791208791208791, "percentage": 93.04, "elapsed_time": "14:07:25", "remaining_time": "1:03:23"} | |
| {"current_steps": 1526, "total_steps": 1638, "loss": 1.143431544303894, "lr": 2.223953951035919e-06, "epoch": 2.7948717948717947, "percentage": 93.16, "elapsed_time": "14:08:35", "remaining_time": "1:02:16"} | |
| {"current_steps": 1528, "total_steps": 1638, "loss": 1.099791407585144, "lr": 2.216128136241497e-06, "epoch": 2.7985347985347984, "percentage": 93.28, "elapsed_time": "14:09:44", "remaining_time": "1:01:10"} | |
| {"current_steps": 1530, "total_steps": 1638, "loss": 1.0971970558166504, "lr": 2.208439834576568e-06, "epoch": 2.802197802197802, "percentage": 93.41, "elapsed_time": "14:11:06", "remaining_time": "1:00:04"} | |
| {"current_steps": 1532, "total_steps": 1638, "loss": 0.9817790389060974, "lr": 2.200889166402908e-06, "epoch": 2.8058608058608057, "percentage": 93.53, "elapsed_time": "14:12:22", "remaining_time": "0:58:58"} | |
| {"current_steps": 1534, "total_steps": 1638, "loss": 0.7601557970046997, "lr": 2.193476249927617e-06, "epoch": 2.8095238095238093, "percentage": 93.65, "elapsed_time": "14:13:24", "remaining_time": "0:57:51"} | |
| {"current_steps": 1536, "total_steps": 1638, "loss": 1.2858803272247314, "lr": 2.1862012012012647e-06, "epoch": 2.813186813186813, "percentage": 93.77, "elapsed_time": "14:14:31", "remaining_time": "0:56:44"} | |
| {"current_steps": 1538, "total_steps": 1638, "loss": 0.9057199954986572, "lr": 2.179064134116078e-06, "epoch": 2.8168498168498166, "percentage": 93.89, "elapsed_time": "14:15:49", "remaining_time": "0:55:38"} | |
| {"current_steps": 1540, "total_steps": 1638, "loss": 0.7820447683334351, "lr": 2.1720651604041543e-06, "epoch": 2.8205128205128203, "percentage": 94.02, "elapsed_time": "14:16:55", "remaining_time": "0:54:31"} | |
| {"current_steps": 1542, "total_steps": 1638, "loss": 0.8878074288368225, "lr": 2.1652043896357132e-06, "epoch": 2.824175824175824, "percentage": 94.14, "elapsed_time": "14:18:00", "remaining_time": "0:53:24"} | |
| {"current_steps": 1544, "total_steps": 1638, "loss": 1.0764187574386597, "lr": 2.1584819292173844e-06, "epoch": 2.8278388278388276, "percentage": 94.26, "elapsed_time": "14:19:10", "remaining_time": "0:52:18"} | |
| {"current_steps": 1546, "total_steps": 1638, "loss": 1.1286197900772095, "lr": 2.1518978843905204e-06, "epoch": 2.8315018315018317, "percentage": 94.38, "elapsed_time": "14:20:30", "remaining_time": "0:51:12"} | |
| {"current_steps": 1548, "total_steps": 1638, "loss": 1.1888434886932373, "lr": 2.1454523582295567e-06, "epoch": 2.8351648351648353, "percentage": 94.51, "elapsed_time": "14:21:38", "remaining_time": "0:50:05"} | |
| {"current_steps": 1550, "total_steps": 1638, "loss": 0.8518368601799011, "lr": 2.1391454516403876e-06, "epoch": 2.838827838827839, "percentage": 94.63, "elapsed_time": "14:22:43", "remaining_time": "0:48:58"} | |
| {"current_steps": 1552, "total_steps": 1638, "loss": 0.5600649118423462, "lr": 2.1329772633587976e-06, "epoch": 2.8424908424908426, "percentage": 94.75, "elapsed_time": "14:23:41", "remaining_time": "0:47:51"} | |
| {"current_steps": 1554, "total_steps": 1638, "loss": 1.1281017065048218, "lr": 2.1269478899489068e-06, "epoch": 2.8461538461538463, "percentage": 94.87, "elapsed_time": "14:25:01", "remaining_time": "0:46:45"} | |
| {"current_steps": 1556, "total_steps": 1638, "loss": 0.9187098741531372, "lr": 2.1210574258016675e-06, "epoch": 2.84981684981685, "percentage": 94.99, "elapsed_time": "14:26:11", "remaining_time": "0:45:38"} | |
| {"current_steps": 1558, "total_steps": 1638, "loss": 1.0671217441558838, "lr": 2.1153059631333785e-06, "epoch": 2.8534798534798536, "percentage": 95.12, "elapsed_time": "14:27:20", "remaining_time": "0:44:32"} | |
| {"current_steps": 1560, "total_steps": 1638, "loss": 0.5967673063278198, "lr": 2.1096935919842434e-06, "epoch": 2.857142857142857, "percentage": 95.24, "elapsed_time": "14:28:23", "remaining_time": "0:43:25"} | |
| {"current_steps": 1562, "total_steps": 1638, "loss": 0.705269992351532, "lr": 2.104220400216967e-06, "epoch": 2.860805860805861, "percentage": 95.36, "elapsed_time": "14:29:16", "remaining_time": "0:42:17"} | |
| {"current_steps": 1564, "total_steps": 1638, "loss": 0.8660311102867126, "lr": 2.0988864735153724e-06, "epoch": 2.8644688644688645, "percentage": 95.48, "elapsed_time": "14:30:25", "remaining_time": "0:41:11"} | |
| {"current_steps": 1566, "total_steps": 1638, "loss": 0.6954091787338257, "lr": 2.0936918953830633e-06, "epoch": 2.868131868131868, "percentage": 95.6, "elapsed_time": "14:31:10", "remaining_time": "0:40:03"} | |
| {"current_steps": 1568, "total_steps": 1638, "loss": 0.7069787979125977, "lr": 2.088636747142114e-06, "epoch": 2.871794871794872, "percentage": 95.73, "elapsed_time": "14:32:17", "remaining_time": "0:38:56"} | |
| {"current_steps": 1570, "total_steps": 1638, "loss": 0.7010313868522644, "lr": 2.083721107931803e-06, "epoch": 2.8754578754578755, "percentage": 95.85, "elapsed_time": "14:33:20", "remaining_time": "0:37:49"} | |
| {"current_steps": 1572, "total_steps": 1638, "loss": 0.66036057472229, "lr": 2.0789450547073634e-06, "epoch": 2.879120879120879, "percentage": 95.97, "elapsed_time": "14:34:24", "remaining_time": "0:36:42"} | |
| {"current_steps": 1574, "total_steps": 1638, "loss": 0.9868662357330322, "lr": 2.074308662238789e-06, "epoch": 2.8827838827838828, "percentage": 96.09, "elapsed_time": "14:35:27", "remaining_time": "0:35:35"} | |
| {"current_steps": 1576, "total_steps": 1638, "loss": 0.9745535254478455, "lr": 2.069812003109654e-06, "epoch": 2.8864468864468864, "percentage": 96.21, "elapsed_time": "14:36:27", "remaining_time": "0:34:28"} | |
| {"current_steps": 1578, "total_steps": 1638, "loss": 0.8887320160865784, "lr": 2.0654551477159868e-06, "epoch": 2.89010989010989, "percentage": 96.34, "elapsed_time": "14:37:35", "remaining_time": "0:33:22"} | |
| {"current_steps": 1580, "total_steps": 1638, "loss": 1.170695185661316, "lr": 2.0612381642651584e-06, "epoch": 2.8937728937728937, "percentage": 96.46, "elapsed_time": "14:38:46", "remaining_time": "0:32:15"} | |
| {"current_steps": 1582, "total_steps": 1638, "loss": 0.8388773798942566, "lr": 2.057161118774821e-06, "epoch": 2.8974358974358974, "percentage": 96.58, "elapsed_time": "14:39:54", "remaining_time": "0:31:08"} | |
| {"current_steps": 1584, "total_steps": 1638, "loss": 0.7783088684082031, "lr": 2.05322407507187e-06, "epoch": 2.901098901098901, "percentage": 96.7, "elapsed_time": "14:40:51", "remaining_time": "0:30:01"} | |
| {"current_steps": 1586, "total_steps": 1638, "loss": 0.788719892501831, "lr": 2.0494270947914507e-06, "epoch": 2.9047619047619047, "percentage": 96.83, "elapsed_time": "14:41:57", "remaining_time": "0:28:54"} | |
| {"current_steps": 1588, "total_steps": 1638, "loss": 1.1323091983795166, "lr": 2.0457702373759864e-06, "epoch": 2.9084249084249083, "percentage": 96.95, "elapsed_time": "14:43:03", "remaining_time": "0:27:48"} | |
| {"current_steps": 1590, "total_steps": 1638, "loss": 1.1913877725601196, "lr": 2.0422535600742526e-06, "epoch": 2.912087912087912, "percentage": 97.07, "elapsed_time": "14:44:13", "remaining_time": "0:26:41"} | |
| {"current_steps": 1592, "total_steps": 1638, "loss": 0.5050473809242249, "lr": 2.03887711794048e-06, "epoch": 2.9157509157509156, "percentage": 97.19, "elapsed_time": "14:45:15", "remaining_time": "0:25:34"} | |
| {"current_steps": 1594, "total_steps": 1638, "loss": 1.1414501667022705, "lr": 2.0356409638334902e-06, "epoch": 2.9194139194139193, "percentage": 97.31, "elapsed_time": "14:46:33", "remaining_time": "0:24:28"} | |
| {"current_steps": 1596, "total_steps": 1638, "loss": 0.747880756855011, "lr": 2.032545148415871e-06, "epoch": 2.9230769230769234, "percentage": 97.44, "elapsed_time": "14:47:36", "remaining_time": "0:23:21"} | |
| {"current_steps": 1598, "total_steps": 1638, "loss": 1.1563737392425537, "lr": 2.0295897201531838e-06, "epoch": 2.926739926739927, "percentage": 97.56, "elapsed_time": "14:48:46", "remaining_time": "0:22:14"} | |
| {"current_steps": 1600, "total_steps": 1638, "loss": 0.7789967060089111, "lr": 2.026774725313199e-06, "epoch": 2.9304029304029307, "percentage": 97.68, "elapsed_time": "14:49:50", "remaining_time": "0:21:08"} | |
| {"current_steps": 1602, "total_steps": 1638, "loss": 1.1291173696517944, "lr": 2.0241002079651803e-06, "epoch": 2.9340659340659343, "percentage": 97.8, "elapsed_time": "14:51:09", "remaining_time": "0:20:01"} | |
| {"current_steps": 1604, "total_steps": 1638, "loss": 0.8520828485488892, "lr": 2.0215662099791874e-06, "epoch": 2.937728937728938, "percentage": 97.92, "elapsed_time": "14:52:02", "remaining_time": "0:18:54"} | |
| {"current_steps": 1606, "total_steps": 1638, "loss": 1.1100882291793823, "lr": 2.019172771025426e-06, "epoch": 2.9413919413919416, "percentage": 98.05, "elapsed_time": "14:53:14", "remaining_time": "0:17:47"} | |
| {"current_steps": 1608, "total_steps": 1638, "loss": 0.7092351317405701, "lr": 2.0169199285736234e-06, "epoch": 2.9450549450549453, "percentage": 98.17, "elapsed_time": "14:54:21", "remaining_time": "0:16:41"} | |
| {"current_steps": 1610, "total_steps": 1638, "loss": 1.0054452419281006, "lr": 2.0148077178924412e-06, "epoch": 2.948717948717949, "percentage": 98.29, "elapsed_time": "14:55:24", "remaining_time": "0:15:34"} | |
| {"current_steps": 1612, "total_steps": 1638, "loss": 0.8747723698616028, "lr": 2.0128361720489263e-06, "epoch": 2.9523809523809526, "percentage": 98.41, "elapsed_time": "14:56:18", "remaining_time": "0:14:27"} | |
| {"current_steps": 1614, "total_steps": 1638, "loss": 0.6871626377105713, "lr": 2.0110053219079927e-06, "epoch": 2.956043956043956, "percentage": 98.53, "elapsed_time": "14:57:21", "remaining_time": "0:13:20"} | |
| {"current_steps": 1616, "total_steps": 1638, "loss": 0.8241419792175293, "lr": 2.009315196131934e-06, "epoch": 2.95970695970696, "percentage": 98.66, "elapsed_time": "14:58:19", "remaining_time": "0:12:13"} | |
| {"current_steps": 1618, "total_steps": 1638, "loss": 1.3680229187011719, "lr": 2.0077658211799823e-06, "epoch": 2.9633699633699635, "percentage": 98.78, "elapsed_time": "14:59:35", "remaining_time": "0:11:07"} | |
| {"current_steps": 1620, "total_steps": 1638, "loss": 1.2273290157318115, "lr": 2.0063572213078856e-06, "epoch": 2.967032967032967, "percentage": 98.9, "elapsed_time": "15:00:44", "remaining_time": "0:10:00"} | |
| {"current_steps": 1622, "total_steps": 1638, "loss": 0.9176530838012695, "lr": 2.0050894185675354e-06, "epoch": 2.970695970695971, "percentage": 99.02, "elapsed_time": "15:01:52", "remaining_time": "0:08:53"} | |
| {"current_steps": 1624, "total_steps": 1638, "loss": 0.780877411365509, "lr": 2.0039624328066154e-06, "epoch": 2.9743589743589745, "percentage": 99.15, "elapsed_time": "15:02:58", "remaining_time": "0:07:47"} | |
| {"current_steps": 1626, "total_steps": 1638, "loss": 0.8718687295913696, "lr": 2.0029762816682963e-06, "epoch": 2.978021978021978, "percentage": 99.27, "elapsed_time": "15:04:00", "remaining_time": "0:06:40"} | |
| {"current_steps": 1628, "total_steps": 1638, "loss": 0.9456213116645813, "lr": 2.0021309805909546e-06, "epoch": 2.9816849816849818, "percentage": 99.39, "elapsed_time": "15:05:06", "remaining_time": "0:05:33"} | |
| {"current_steps": 1630, "total_steps": 1638, "loss": 1.4017192125320435, "lr": 2.001426542807935e-06, "epoch": 2.9853479853479854, "percentage": 99.51, "elapsed_time": "15:06:14", "remaining_time": "0:04:26"} | |
| {"current_steps": 1632, "total_steps": 1638, "loss": 1.0045907497406006, "lr": 2.000862979347339e-06, "epoch": 2.989010989010989, "percentage": 99.63, "elapsed_time": "15:07:24", "remaining_time": "0:03:20"} | |
| {"current_steps": 1634, "total_steps": 1638, "loss": 0.7610074281692505, "lr": 2.0004402990318574e-06, "epoch": 2.9926739926739927, "percentage": 99.76, "elapsed_time": "15:08:34", "remaining_time": "0:02:13"} | |
| {"current_steps": 1636, "total_steps": 1638, "loss": 1.2684826850891113, "lr": 2.000158508478629e-06, "epoch": 2.9963369963369964, "percentage": 99.88, "elapsed_time": "15:09:31", "remaining_time": "0:01:06"} | |
| {"current_steps": 1638, "total_steps": 1638, "loss": 1.06321382522583, "lr": 2.0000176120991345e-06, "epoch": 3.0, "percentage": 100.0, "elapsed_time": "15:10:43", "remaining_time": "0:00:00"} | |
| {"current_steps": 1638, "total_steps": 1638, "epoch": 3.0, "percentage": 100.0, "elapsed_time": "15:10:43", "remaining_time": "0:00:00"} | |