Image-Text-to-Text
Transformers
Safetensors
qwen3_5
llama-factory
full
Generated from Trainer
conversational
Instructions to use furproxy/9b-136 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use furproxy/9b-136 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-text-to-text", model="furproxy/9b-136") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] pipe(text=messages)# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("furproxy/9b-136") model = AutoModelForImageTextToText.from_pretrained("furproxy/9b-136") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "What animal is on the candy?"} ] }, ] inputs = processor.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use furproxy/9b-136 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "furproxy/9b-136" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "furproxy/9b-136", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker
docker model run hf.co/furproxy/9b-136
- SGLang
How to use furproxy/9b-136 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "furproxy/9b-136" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "furproxy/9b-136", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "furproxy/9b-136" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "furproxy/9b-136", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "Describe this image in one sentence." }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' - Docker Model Runner
How to use furproxy/9b-136 with Docker Model Runner:
docker model run hf.co/furproxy/9b-136
| {"current_steps": 2516, "total_steps": 3564, "loss": 0.8136914968490601, "lr": 5.938831903690887e-07, "epoch": 2.1178451178451176, "percentage": 70.59, "elapsed_time": "1:13:42", "remaining_time": "0:30:42"} | |
| {"current_steps": 2518, "total_steps": 3564, "loss": 0.13099154829978943, "lr": 5.925026752463027e-07, "epoch": 2.1195286195286194, "percentage": 70.65, "elapsed_time": "1:13:45", "remaining_time": "0:30:38"} | |
| {"current_steps": 2520, "total_steps": 3564, "loss": 0.33465084433555603, "lr": 5.911239086653345e-07, "epoch": 2.121212121212121, "percentage": 70.71, "elapsed_time": "1:13:49", "remaining_time": "0:30:35"} | |
| {"current_steps": 2522, "total_steps": 3564, "loss": 0.2251596450805664, "lr": 5.89746895376614e-07, "epoch": 2.122895622895623, "percentage": 70.76, "elapsed_time": "1:13:53", "remaining_time": "0:30:31"} | |
| {"current_steps": 2524, "total_steps": 3564, "loss": 0.41063302755355835, "lr": 5.883716401245329e-07, "epoch": 2.1245791245791246, "percentage": 70.82, "elapsed_time": "1:13:56", "remaining_time": "0:30:28"} | |
| {"current_steps": 2526, "total_steps": 3564, "loss": 0.32705599069595337, "lr": 5.869981476474235e-07, "epoch": 2.1262626262626263, "percentage": 70.88, "elapsed_time": "1:13:59", "remaining_time": "0:30:24"} | |
| {"current_steps": 2528, "total_steps": 3564, "loss": 0.28738293051719666, "lr": 5.856264226775451e-07, "epoch": 2.127946127946128, "percentage": 70.93, "elapsed_time": "1:14:03", "remaining_time": "0:30:20"} | |
| {"current_steps": 2530, "total_steps": 3564, "loss": 0.5695469379425049, "lr": 5.842564699410676e-07, "epoch": 2.1296296296296298, "percentage": 70.99, "elapsed_time": "1:14:07", "remaining_time": "0:30:17"} | |
| {"current_steps": 2532, "total_steps": 3564, "loss": 0.7862983345985413, "lr": 5.828882941580548e-07, "epoch": 2.1313131313131315, "percentage": 71.04, "elapsed_time": "1:14:10", "remaining_time": "0:30:14"} | |
| {"current_steps": 2534, "total_steps": 3564, "loss": 0.32265302538871765, "lr": 5.815219000424475e-07, "epoch": 2.1329966329966332, "percentage": 71.1, "elapsed_time": "1:14:13", "remaining_time": "0:30:10"} | |
| {"current_steps": 2536, "total_steps": 3564, "loss": 0.6733647584915161, "lr": 5.801572923020486e-07, "epoch": 2.1346801346801345, "percentage": 71.16, "elapsed_time": "1:14:17", "remaining_time": "0:30:06"} | |
| {"current_steps": 2538, "total_steps": 3564, "loss": 0.34301066398620605, "lr": 5.787944756385061e-07, "epoch": 2.1363636363636362, "percentage": 71.21, "elapsed_time": "1:14:20", "remaining_time": "0:30:03"} | |
| {"current_steps": 2540, "total_steps": 3564, "loss": 0.31534287333488464, "lr": 5.774334547472963e-07, "epoch": 2.138047138047138, "percentage": 71.27, "elapsed_time": "1:14:23", "remaining_time": "0:29:59"} | |
| {"current_steps": 2542, "total_steps": 3564, "loss": 0.6951263546943665, "lr": 5.760742343177091e-07, "epoch": 2.1397306397306397, "percentage": 71.32, "elapsed_time": "1:14:26", "remaining_time": "0:29:55"} | |
| {"current_steps": 2544, "total_steps": 3564, "loss": 0.09168624877929688, "lr": 5.747168190328313e-07, "epoch": 2.1414141414141414, "percentage": 71.38, "elapsed_time": "1:14:29", "remaining_time": "0:29:52"} | |
| {"current_steps": 2546, "total_steps": 3564, "loss": 0.34088313579559326, "lr": 5.73361213569529e-07, "epoch": 2.143097643097643, "percentage": 71.44, "elapsed_time": "1:14:32", "remaining_time": "0:29:48"} | |
| {"current_steps": 2548, "total_steps": 3564, "loss": 0.6928970813751221, "lr": 5.720074225984335e-07, "epoch": 2.144781144781145, "percentage": 71.49, "elapsed_time": "1:14:36", "remaining_time": "0:29:44"} | |
| {"current_steps": 2550, "total_steps": 3564, "loss": 0.8698376417160034, "lr": 5.706554507839247e-07, "epoch": 2.1464646464646466, "percentage": 71.55, "elapsed_time": "1:14:39", "remaining_time": "0:29:41"} | |
| {"current_steps": 2552, "total_steps": 3564, "loss": 0.5156476497650146, "lr": 5.693053027841139e-07, "epoch": 2.148148148148148, "percentage": 71.6, "elapsed_time": "1:14:42", "remaining_time": "0:29:37"} | |
| {"current_steps": 2554, "total_steps": 3564, "loss": 0.14811789989471436, "lr": 5.679569832508294e-07, "epoch": 2.1498316498316496, "percentage": 71.66, "elapsed_time": "1:14:45", "remaining_time": "0:29:33"} | |
| {"current_steps": 2556, "total_steps": 3564, "loss": 0.4402310848236084, "lr": 5.666104968295993e-07, "epoch": 2.1515151515151514, "percentage": 71.72, "elapsed_time": "1:14:48", "remaining_time": "0:29:30"} | |
| {"current_steps": 2558, "total_steps": 3564, "loss": 0.6228591203689575, "lr": 5.652658481596355e-07, "epoch": 2.153198653198653, "percentage": 71.77, "elapsed_time": "1:14:51", "remaining_time": "0:29:26"} | |
| {"current_steps": 2560, "total_steps": 3564, "loss": 0.3809899091720581, "lr": 5.639230418738186e-07, "epoch": 2.154882154882155, "percentage": 71.83, "elapsed_time": "1:14:55", "remaining_time": "0:29:23"} | |
| {"current_steps": 2562, "total_steps": 3564, "loss": 0.4754774570465088, "lr": 5.625820825986818e-07, "epoch": 2.1565656565656566, "percentage": 71.89, "elapsed_time": "1:14:59", "remaining_time": "0:29:19"} | |
| {"current_steps": 2564, "total_steps": 3564, "loss": 0.7122776508331299, "lr": 5.61242974954393e-07, "epoch": 2.1582491582491583, "percentage": 71.94, "elapsed_time": "1:15:02", "remaining_time": "0:29:16"} | |
| {"current_steps": 2566, "total_steps": 3564, "loss": 0.45209017395973206, "lr": 5.599057235547422e-07, "epoch": 2.15993265993266, "percentage": 72.0, "elapsed_time": "1:15:06", "remaining_time": "0:29:12"} | |
| {"current_steps": 2568, "total_steps": 3564, "loss": 0.3703120946884155, "lr": 5.585703330071232e-07, "epoch": 2.1616161616161618, "percentage": 72.05, "elapsed_time": "1:15:09", "remaining_time": "0:29:09"} | |
| {"current_steps": 2570, "total_steps": 3564, "loss": 0.8958742618560791, "lr": 5.572368079125177e-07, "epoch": 2.1632996632996635, "percentage": 72.11, "elapsed_time": "1:15:13", "remaining_time": "0:29:05"} | |
| {"current_steps": 2572, "total_steps": 3564, "loss": 1.0562491416931152, "lr": 5.559051528654812e-07, "epoch": 2.164983164983165, "percentage": 72.17, "elapsed_time": "1:15:17", "remaining_time": "0:29:02"} | |
| {"current_steps": 2574, "total_steps": 3564, "loss": 0.7664850950241089, "lr": 5.545753724541259e-07, "epoch": 2.1666666666666665, "percentage": 72.22, "elapsed_time": "1:15:20", "remaining_time": "0:28:58"} | |
| {"current_steps": 2576, "total_steps": 3564, "loss": 0.2349638044834137, "lr": 5.532474712601041e-07, "epoch": 2.1683501683501682, "percentage": 72.28, "elapsed_time": "1:15:24", "remaining_time": "0:28:55"} | |
| {"current_steps": 2578, "total_steps": 3564, "loss": 0.5862404108047485, "lr": 5.519214538585945e-07, "epoch": 2.17003367003367, "percentage": 72.33, "elapsed_time": "1:15:27", "remaining_time": "0:28:51"} | |
| {"current_steps": 2580, "total_steps": 3564, "loss": 0.25796785950660706, "lr": 5.505973248182854e-07, "epoch": 2.1717171717171717, "percentage": 72.39, "elapsed_time": "1:15:31", "remaining_time": "0:28:48"} | |
| {"current_steps": 2582, "total_steps": 3564, "loss": 0.40474733710289, "lr": 5.492750887013576e-07, "epoch": 2.1734006734006734, "percentage": 72.45, "elapsed_time": "1:15:34", "remaining_time": "0:28:44"} | |
| {"current_steps": 2584, "total_steps": 3564, "loss": 0.25570929050445557, "lr": 5.479547500634716e-07, "epoch": 2.175084175084175, "percentage": 72.5, "elapsed_time": "1:15:37", "remaining_time": "0:28:40"} | |
| {"current_steps": 2586, "total_steps": 3564, "loss": 0.582108736038208, "lr": 5.466363134537495e-07, "epoch": 2.176767676767677, "percentage": 72.56, "elapsed_time": "1:15:40", "remaining_time": "0:28:37"} | |
| {"current_steps": 2588, "total_steps": 3564, "loss": 0.5546954274177551, "lr": 5.453197834147596e-07, "epoch": 2.1784511784511786, "percentage": 72.62, "elapsed_time": "1:15:44", "remaining_time": "0:28:33"} | |
| {"current_steps": 2590, "total_steps": 3564, "loss": 0.6109448671340942, "lr": 5.440051644825024e-07, "epoch": 2.18013468013468, "percentage": 72.67, "elapsed_time": "1:15:47", "remaining_time": "0:28:30"} | |
| {"current_steps": 2592, "total_steps": 3564, "loss": 0.4381883144378662, "lr": 5.426924611863932e-07, "epoch": 2.1818181818181817, "percentage": 72.73, "elapsed_time": "1:15:51", "remaining_time": "0:28:26"} | |
| {"current_steps": 2594, "total_steps": 3564, "loss": 0.28566718101501465, "lr": 5.413816780492464e-07, "epoch": 2.1835016835016834, "percentage": 72.78, "elapsed_time": "1:15:54", "remaining_time": "0:28:23"} | |
| {"current_steps": 2596, "total_steps": 3564, "loss": 0.6839703321456909, "lr": 5.400728195872627e-07, "epoch": 2.185185185185185, "percentage": 72.84, "elapsed_time": "1:15:57", "remaining_time": "0:28:19"} | |
| {"current_steps": 2598, "total_steps": 3564, "loss": 0.7969393134117126, "lr": 5.387658903100093e-07, "epoch": 2.186868686868687, "percentage": 72.9, "elapsed_time": "1:16:01", "remaining_time": "0:28:15"} | |
| {"current_steps": 2600, "total_steps": 3564, "loss": 0.5756024122238159, "lr": 5.374608947204078e-07, "epoch": 2.1885521885521886, "percentage": 72.95, "elapsed_time": "1:16:03", "remaining_time": "0:28:12"} | |
| {"current_steps": 2602, "total_steps": 3564, "loss": 0.8270890116691589, "lr": 5.361578373147173e-07, "epoch": 2.1902356902356903, "percentage": 73.01, "elapsed_time": "1:16:06", "remaining_time": "0:28:08"} | |
| {"current_steps": 2604, "total_steps": 3564, "loss": 0.7463648319244385, "lr": 5.348567225825182e-07, "epoch": 2.191919191919192, "percentage": 73.06, "elapsed_time": "1:16:10", "remaining_time": "0:28:04"} | |
| {"current_steps": 2606, "total_steps": 3564, "loss": 0.3755905032157898, "lr": 5.335575550066987e-07, "epoch": 2.1936026936026938, "percentage": 73.12, "elapsed_time": "1:16:13", "remaining_time": "0:28:01"} | |
| {"current_steps": 2608, "total_steps": 3564, "loss": 0.828824520111084, "lr": 5.322603390634379e-07, "epoch": 2.1952861952861955, "percentage": 73.18, "elapsed_time": "1:16:16", "remaining_time": "0:27:57"} | |
| {"current_steps": 2610, "total_steps": 3564, "loss": 0.7120569944381714, "lr": 5.3096507922219e-07, "epoch": 2.196969696969697, "percentage": 73.23, "elapsed_time": "1:16:20", "remaining_time": "0:27:54"} | |
| {"current_steps": 2612, "total_steps": 3564, "loss": 0.2670977711677551, "lr": 5.296717799456703e-07, "epoch": 2.1986531986531985, "percentage": 73.29, "elapsed_time": "1:16:23", "remaining_time": "0:27:50"} | |
| {"current_steps": 2614, "total_steps": 3564, "loss": 0.7222539782524109, "lr": 5.283804456898393e-07, "epoch": 2.2003367003367003, "percentage": 73.34, "elapsed_time": "1:16:26", "remaining_time": "0:27:46"} | |
| {"current_steps": 2616, "total_steps": 3564, "loss": 0.5107656717300415, "lr": 5.270910809038866e-07, "epoch": 2.202020202020202, "percentage": 73.4, "elapsed_time": "1:16:29", "remaining_time": "0:27:43"} | |
| {"current_steps": 2618, "total_steps": 3564, "loss": 0.44302040338516235, "lr": 5.258036900302162e-07, "epoch": 2.2037037037037037, "percentage": 73.46, "elapsed_time": "1:16:32", "remaining_time": "0:27:39"} | |
| {"current_steps": 2620, "total_steps": 3564, "loss": 0.28953254222869873, "lr": 5.245182775044319e-07, "epoch": 2.2053872053872055, "percentage": 73.51, "elapsed_time": "1:16:34", "remaining_time": "0:27:35"} | |
| {"current_steps": 2622, "total_steps": 3564, "loss": 0.5604819655418396, "lr": 5.2323484775532e-07, "epoch": 2.207070707070707, "percentage": 73.57, "elapsed_time": "1:16:37", "remaining_time": "0:27:31"} | |
| {"current_steps": 2624, "total_steps": 3564, "loss": 0.4838787317276001, "lr": 5.219534052048364e-07, "epoch": 2.208754208754209, "percentage": 73.63, "elapsed_time": "1:16:41", "remaining_time": "0:27:28"} | |
| {"current_steps": 2626, "total_steps": 3564, "loss": 0.41042160987854004, "lr": 5.206739542680903e-07, "epoch": 2.2104377104377106, "percentage": 73.68, "elapsed_time": "1:16:44", "remaining_time": "0:27:24"} | |
| {"current_steps": 2628, "total_steps": 3564, "loss": 0.5403867959976196, "lr": 5.193964993533275e-07, "epoch": 2.212121212121212, "percentage": 73.74, "elapsed_time": "1:16:47", "remaining_time": "0:27:20"} | |
| {"current_steps": 2630, "total_steps": 3564, "loss": 0.25527873635292053, "lr": 5.181210448619185e-07, "epoch": 2.2138047138047137, "percentage": 73.79, "elapsed_time": "1:16:50", "remaining_time": "0:27:17"} | |
| {"current_steps": 2632, "total_steps": 3564, "loss": 0.404461145401001, "lr": 5.168475951883405e-07, "epoch": 2.2154882154882154, "percentage": 73.85, "elapsed_time": "1:16:53", "remaining_time": "0:27:13"} | |
| {"current_steps": 2634, "total_steps": 3564, "loss": 0.07407370954751968, "lr": 5.155761547201631e-07, "epoch": 2.217171717171717, "percentage": 73.91, "elapsed_time": "1:16:55", "remaining_time": "0:27:09"} | |
| {"current_steps": 2636, "total_steps": 3564, "loss": 0.7165415287017822, "lr": 5.143067278380339e-07, "epoch": 2.218855218855219, "percentage": 73.96, "elapsed_time": "1:16:59", "remaining_time": "0:27:06"} | |
| {"current_steps": 2638, "total_steps": 3564, "loss": 1.0603926181793213, "lr": 5.13039318915663e-07, "epoch": 2.2205387205387206, "percentage": 74.02, "elapsed_time": "1:17:02", "remaining_time": "0:27:02"} | |
| {"current_steps": 2640, "total_steps": 3564, "loss": 0.997651219367981, "lr": 5.117739323198067e-07, "epoch": 2.2222222222222223, "percentage": 74.07, "elapsed_time": "1:17:06", "remaining_time": "0:26:59"} | |
| {"current_steps": 2642, "total_steps": 3564, "loss": 0.6530795097351074, "lr": 5.105105724102547e-07, "epoch": 2.223905723905724, "percentage": 74.13, "elapsed_time": "1:17:09", "remaining_time": "0:26:55"} | |
| {"current_steps": 2644, "total_steps": 3564, "loss": 0.6192750930786133, "lr": 5.092492435398137e-07, "epoch": 2.225589225589226, "percentage": 74.19, "elapsed_time": "1:17:12", "remaining_time": "0:26:51"} | |
| {"current_steps": 2646, "total_steps": 3564, "loss": 0.5436962246894836, "lr": 5.079899500542917e-07, "epoch": 2.227272727272727, "percentage": 74.24, "elapsed_time": "1:17:15", "remaining_time": "0:26:48"} | |
| {"current_steps": 2648, "total_steps": 3564, "loss": 0.2577816843986511, "lr": 5.067326962924848e-07, "epoch": 2.228956228956229, "percentage": 74.3, "elapsed_time": "1:17:18", "remaining_time": "0:26:44"} | |
| {"current_steps": 2650, "total_steps": 3564, "loss": 0.9602568745613098, "lr": 5.054774865861617e-07, "epoch": 2.2306397306397305, "percentage": 74.35, "elapsed_time": "1:17:22", "remaining_time": "0:26:41"} | |
| {"current_steps": 2652, "total_steps": 3564, "loss": 0.5225367546081543, "lr": 5.042243252600475e-07, "epoch": 2.2323232323232323, "percentage": 74.41, "elapsed_time": "1:17:25", "remaining_time": "0:26:37"} | |
| {"current_steps": 2654, "total_steps": 3564, "loss": 0.47632715106010437, "lr": 5.029732166318106e-07, "epoch": 2.234006734006734, "percentage": 74.47, "elapsed_time": "1:17:29", "remaining_time": "0:26:34"} | |
| {"current_steps": 2656, "total_steps": 3564, "loss": 0.5418964624404907, "lr": 5.017241650120462e-07, "epoch": 2.2356902356902357, "percentage": 74.52, "elapsed_time": "1:17:32", "remaining_time": "0:26:30"} | |
| {"current_steps": 2658, "total_steps": 3564, "loss": 0.8024328351020813, "lr": 5.004771747042631e-07, "epoch": 2.2373737373737375, "percentage": 74.58, "elapsed_time": "1:17:36", "remaining_time": "0:26:27"} | |
| {"current_steps": 2660, "total_steps": 3564, "loss": 0.5871691703796387, "lr": 4.992322500048673e-07, "epoch": 2.239057239057239, "percentage": 74.64, "elapsed_time": "1:17:39", "remaining_time": "0:26:23"} | |
| {"current_steps": 2662, "total_steps": 3564, "loss": 0.7337244153022766, "lr": 4.979893952031483e-07, "epoch": 2.240740740740741, "percentage": 74.69, "elapsed_time": "1:17:43", "remaining_time": "0:26:20"} | |
| {"current_steps": 2664, "total_steps": 3564, "loss": 0.3517826795578003, "lr": 4.96748614581264e-07, "epoch": 2.242424242424242, "percentage": 74.75, "elapsed_time": "1:17:46", "remaining_time": "0:26:16"} | |
| {"current_steps": 2666, "total_steps": 3564, "loss": 0.7348419427871704, "lr": 4.955099124142251e-07, "epoch": 2.244107744107744, "percentage": 74.8, "elapsed_time": "1:17:49", "remaining_time": "0:26:12"} | |
| {"current_steps": 2668, "total_steps": 3564, "loss": 0.5416382551193237, "lr": 4.942732929698827e-07, "epoch": 2.2457912457912457, "percentage": 74.86, "elapsed_time": "1:17:53", "remaining_time": "0:26:09"} | |
| {"current_steps": 2670, "total_steps": 3564, "loss": 0.44201749563217163, "lr": 4.930387605089104e-07, "epoch": 2.2474747474747474, "percentage": 74.92, "elapsed_time": "1:17:56", "remaining_time": "0:26:05"} | |
| {"current_steps": 2672, "total_steps": 3564, "loss": 0.34817391633987427, "lr": 4.918063192847921e-07, "epoch": 2.249158249158249, "percentage": 74.97, "elapsed_time": "1:17:59", "remaining_time": "0:26:02"} | |
| {"current_steps": 2674, "total_steps": 3564, "loss": 0.6200217008590698, "lr": 4.905759735438068e-07, "epoch": 2.250841750841751, "percentage": 75.03, "elapsed_time": "1:18:03", "remaining_time": "0:25:58"} | |
| {"current_steps": 2676, "total_steps": 3564, "loss": 0.7119044065475464, "lr": 4.893477275250127e-07, "epoch": 2.2525252525252526, "percentage": 75.08, "elapsed_time": "1:18:06", "remaining_time": "0:25:55"} | |
| {"current_steps": 2678, "total_steps": 3564, "loss": 0.4421549141407013, "lr": 4.881215854602342e-07, "epoch": 2.2542087542087543, "percentage": 75.14, "elapsed_time": "1:18:09", "remaining_time": "0:25:51"} | |
| {"current_steps": 2680, "total_steps": 3564, "loss": 0.835530161857605, "lr": 4.868975515740471e-07, "epoch": 2.255892255892256, "percentage": 75.2, "elapsed_time": "1:18:13", "remaining_time": "0:25:48"} | |
| {"current_steps": 2682, "total_steps": 3564, "loss": 0.19798390567302704, "lr": 4.856756300837625e-07, "epoch": 2.257575757575758, "percentage": 75.25, "elapsed_time": "1:18:16", "remaining_time": "0:25:44"} | |
| {"current_steps": 2684, "total_steps": 3564, "loss": 0.1048535406589508, "lr": 4.844558251994146e-07, "epoch": 2.259259259259259, "percentage": 75.31, "elapsed_time": "1:18:19", "remaining_time": "0:25:40"} | |
| {"current_steps": 2686, "total_steps": 3564, "loss": 0.604271650314331, "lr": 4.832381411237444e-07, "epoch": 2.260942760942761, "percentage": 75.36, "elapsed_time": "1:18:23", "remaining_time": "0:25:37"} | |
| {"current_steps": 2688, "total_steps": 3564, "loss": 0.36290663480758667, "lr": 4.820225820521855e-07, "epoch": 2.2626262626262625, "percentage": 75.42, "elapsed_time": "1:18:26", "remaining_time": "0:25:33"} | |
| {"current_steps": 2690, "total_steps": 3564, "loss": 0.8970327377319336, "lr": 4.808091521728506e-07, "epoch": 2.2643097643097643, "percentage": 75.48, "elapsed_time": "1:18:30", "remaining_time": "0:25:30"} | |
| {"current_steps": 2692, "total_steps": 3564, "loss": 0.8129058480262756, "lr": 4.795978556665165e-07, "epoch": 2.265993265993266, "percentage": 75.53, "elapsed_time": "1:18:33", "remaining_time": "0:25:26"} | |
| {"current_steps": 2694, "total_steps": 3564, "loss": 0.653793454170227, "lr": 4.783886967066088e-07, "epoch": 2.2676767676767677, "percentage": 75.59, "elapsed_time": "1:18:37", "remaining_time": "0:25:23"} | |
| {"current_steps": 2696, "total_steps": 3564, "loss": 0.5345746874809265, "lr": 4.77181679459189e-07, "epoch": 2.2693602693602695, "percentage": 75.65, "elapsed_time": "1:18:40", "remaining_time": "0:25:19"} | |
| {"current_steps": 2698, "total_steps": 3564, "loss": 0.638217568397522, "lr": 4.759768080829399e-07, "epoch": 2.271043771043771, "percentage": 75.7, "elapsed_time": "1:18:44", "remaining_time": "0:25:16"} | |
| {"current_steps": 2700, "total_steps": 3564, "loss": 0.7549663782119751, "lr": 4.747740867291497e-07, "epoch": 2.2727272727272725, "percentage": 75.76, "elapsed_time": "1:18:47", "remaining_time": "0:25:12"} | |
| {"current_steps": 2702, "total_steps": 3564, "loss": 0.5037040114402771, "lr": 4.7357351954169973e-07, "epoch": 2.274410774410774, "percentage": 75.81, "elapsed_time": "1:18:50", "remaining_time": "0:25:09"} | |
| {"current_steps": 2704, "total_steps": 3564, "loss": 0.8505884408950806, "lr": 4.7237511065704933e-07, "epoch": 2.276094276094276, "percentage": 75.87, "elapsed_time": "1:18:54", "remaining_time": "0:25:05"} | |
| {"current_steps": 2706, "total_steps": 3564, "loss": 0.9292435050010681, "lr": 4.7117886420422094e-07, "epoch": 2.2777777777777777, "percentage": 75.93, "elapsed_time": "1:18:57", "remaining_time": "0:25:02"} | |
| {"current_steps": 2708, "total_steps": 3564, "loss": 0.4456526041030884, "lr": 4.6998478430478714e-07, "epoch": 2.2794612794612794, "percentage": 75.98, "elapsed_time": "1:19:00", "remaining_time": "0:24:58"} | |
| {"current_steps": 2710, "total_steps": 3564, "loss": 0.49354496598243713, "lr": 4.6879287507285596e-07, "epoch": 2.281144781144781, "percentage": 76.04, "elapsed_time": "1:19:03", "remaining_time": "0:24:54"} | |
| {"current_steps": 2712, "total_steps": 3564, "loss": 0.517022430896759, "lr": 4.676031406150555e-07, "epoch": 2.282828282828283, "percentage": 76.09, "elapsed_time": "1:19:07", "remaining_time": "0:24:51"} | |
| {"current_steps": 2714, "total_steps": 3564, "loss": 0.42631667852401733, "lr": 4.66415585030522e-07, "epoch": 2.2845117845117846, "percentage": 76.15, "elapsed_time": "1:19:10", "remaining_time": "0:24:47"} | |
| {"current_steps": 2716, "total_steps": 3564, "loss": 0.7113944292068481, "lr": 4.6523021241088416e-07, "epoch": 2.2861952861952863, "percentage": 76.21, "elapsed_time": "1:19:14", "remaining_time": "0:24:44"} | |
| {"current_steps": 2718, "total_steps": 3564, "loss": 0.5162969827651978, "lr": 4.6404702684024905e-07, "epoch": 2.287878787878788, "percentage": 76.26, "elapsed_time": "1:19:17", "remaining_time": "0:24:40"} | |
| {"current_steps": 2720, "total_steps": 3564, "loss": 0.5146564841270447, "lr": 4.628660323951891e-07, "epoch": 2.28956228956229, "percentage": 76.32, "elapsed_time": "1:19:20", "remaining_time": "0:24:37"} | |
| {"current_steps": 2722, "total_steps": 3564, "loss": 0.6732128262519836, "lr": 4.616872331447272e-07, "epoch": 2.291245791245791, "percentage": 76.37, "elapsed_time": "1:19:24", "remaining_time": "0:24:33"} | |
| {"current_steps": 2724, "total_steps": 3564, "loss": 0.6910574436187744, "lr": 4.605106331503223e-07, "epoch": 2.292929292929293, "percentage": 76.43, "elapsed_time": "1:19:27", "remaining_time": "0:24:30"} | |
| {"current_steps": 2726, "total_steps": 3564, "loss": 0.6672347784042358, "lr": 4.5933623646585683e-07, "epoch": 2.2946127946127945, "percentage": 76.49, "elapsed_time": "1:19:31", "remaining_time": "0:24:26"} | |
| {"current_steps": 2728, "total_steps": 3564, "loss": 0.509329617023468, "lr": 4.581640471376215e-07, "epoch": 2.2962962962962963, "percentage": 76.54, "elapsed_time": "1:19:35", "remaining_time": "0:24:23"} | |
| {"current_steps": 2730, "total_steps": 3564, "loss": 0.9162227511405945, "lr": 4.5699406920430155e-07, "epoch": 2.297979797979798, "percentage": 76.6, "elapsed_time": "1:19:37", "remaining_time": "0:24:19"} | |
| {"current_steps": 2732, "total_steps": 3564, "loss": 0.46352601051330566, "lr": 4.5582630669696324e-07, "epoch": 2.2996632996632997, "percentage": 76.66, "elapsed_time": "1:19:41", "remaining_time": "0:24:16"} | |
| {"current_steps": 2734, "total_steps": 3564, "loss": 0.44609200954437256, "lr": 4.5466076363904e-07, "epoch": 2.3013468013468015, "percentage": 76.71, "elapsed_time": "1:19:45", "remaining_time": "0:24:12"} | |
| {"current_steps": 2736, "total_steps": 3564, "loss": 0.38603392243385315, "lr": 4.5349744404631785e-07, "epoch": 2.303030303030303, "percentage": 76.77, "elapsed_time": "1:19:48", "remaining_time": "0:24:09"} | |
| {"current_steps": 2738, "total_steps": 3564, "loss": 0.5370512008666992, "lr": 4.5233635192692206e-07, "epoch": 2.3047138047138045, "percentage": 76.82, "elapsed_time": "1:19:52", "remaining_time": "0:24:05"} | |
| {"current_steps": 2740, "total_steps": 3564, "loss": 0.35465237498283386, "lr": 4.511774912813043e-07, "epoch": 2.3063973063973062, "percentage": 76.88, "elapsed_time": "1:19:56", "remaining_time": "0:24:02"} | |
| {"current_steps": 2742, "total_steps": 3564, "loss": 0.7493946552276611, "lr": 4.5002086610222626e-07, "epoch": 2.308080808080808, "percentage": 76.94, "elapsed_time": "1:19:59", "remaining_time": "0:23:58"} | |
| {"current_steps": 2744, "total_steps": 3564, "loss": 0.7291615009307861, "lr": 4.488664803747487e-07, "epoch": 2.3097643097643097, "percentage": 76.99, "elapsed_time": "1:20:03", "remaining_time": "0:23:55"} | |
| {"current_steps": 2746, "total_steps": 3564, "loss": 0.8265661001205444, "lr": 4.4771433807621644e-07, "epoch": 2.3114478114478114, "percentage": 77.05, "elapsed_time": "1:20:06", "remaining_time": "0:23:51"} | |
| {"current_steps": 2748, "total_steps": 3564, "loss": 0.6443151831626892, "lr": 4.4656444317624397e-07, "epoch": 2.313131313131313, "percentage": 77.1, "elapsed_time": "1:20:10", "remaining_time": "0:23:48"} | |
| {"current_steps": 2750, "total_steps": 3564, "loss": 0.0978798121213913, "lr": 4.454167996367032e-07, "epoch": 2.314814814814815, "percentage": 77.16, "elapsed_time": "1:20:12", "remaining_time": "0:23:44"} | |
| {"current_steps": 2752, "total_steps": 3564, "loss": 0.2580530345439911, "lr": 4.442714114117092e-07, "epoch": 2.3164983164983166, "percentage": 77.22, "elapsed_time": "1:20:16", "remaining_time": "0:23:41"} | |
| {"current_steps": 2754, "total_steps": 3564, "loss": 0.46834707260131836, "lr": 4.4312828244760613e-07, "epoch": 2.3181818181818183, "percentage": 77.27, "elapsed_time": "1:20:19", "remaining_time": "0:23:37"} | |
| {"current_steps": 2756, "total_steps": 3564, "loss": 0.900390625, "lr": 4.4198741668295425e-07, "epoch": 2.31986531986532, "percentage": 77.33, "elapsed_time": "1:20:23", "remaining_time": "0:23:34"} | |
| {"current_steps": 2758, "total_steps": 3564, "loss": 0.6006342172622681, "lr": 4.4084881804851644e-07, "epoch": 2.3215488215488214, "percentage": 77.38, "elapsed_time": "1:20:26", "remaining_time": "0:23:30"} | |
| {"current_steps": 2760, "total_steps": 3564, "loss": 0.7037711143493652, "lr": 4.397124904672437e-07, "epoch": 2.323232323232323, "percentage": 77.44, "elapsed_time": "1:20:30", "remaining_time": "0:23:27"} | |
| {"current_steps": 2762, "total_steps": 3564, "loss": 0.4606119990348816, "lr": 4.3857843785426263e-07, "epoch": 2.324915824915825, "percentage": 77.5, "elapsed_time": "1:20:32", "remaining_time": "0:23:23"} | |
| {"current_steps": 2764, "total_steps": 3564, "loss": 0.9028510451316833, "lr": 4.374466641168622e-07, "epoch": 2.3265993265993266, "percentage": 77.55, "elapsed_time": "1:20:35", "remaining_time": "0:23:19"} | |
| {"current_steps": 2766, "total_steps": 3564, "loss": 0.6837437152862549, "lr": 4.363171731544786e-07, "epoch": 2.3282828282828283, "percentage": 77.61, "elapsed_time": "1:20:38", "remaining_time": "0:23:16"} | |
| {"current_steps": 2768, "total_steps": 3564, "loss": 0.5506434440612793, "lr": 4.351899688586834e-07, "epoch": 2.32996632996633, "percentage": 77.67, "elapsed_time": "1:20:42", "remaining_time": "0:23:12"} | |
| {"current_steps": 2770, "total_steps": 3564, "loss": 0.6231704354286194, "lr": 4.3406505511317025e-07, "epoch": 2.3316498316498318, "percentage": 77.72, "elapsed_time": "1:20:46", "remaining_time": "0:23:09"} | |
| {"current_steps": 2772, "total_steps": 3564, "loss": 0.5775326490402222, "lr": 4.329424357937397e-07, "epoch": 2.3333333333333335, "percentage": 77.78, "elapsed_time": "1:20:49", "remaining_time": "0:23:05"} | |
| {"current_steps": 2774, "total_steps": 3564, "loss": 0.6728795766830444, "lr": 4.318221147682879e-07, "epoch": 2.3350168350168348, "percentage": 77.83, "elapsed_time": "1:20:53", "remaining_time": "0:23:02"} | |
| {"current_steps": 2776, "total_steps": 3564, "loss": 0.7195960879325867, "lr": 4.307040958967924e-07, "epoch": 2.3367003367003365, "percentage": 77.89, "elapsed_time": "1:20:56", "remaining_time": "0:22:58"} | |
| {"current_steps": 2778, "total_steps": 3564, "loss": 0.3605208098888397, "lr": 4.2958838303129817e-07, "epoch": 2.3383838383838382, "percentage": 77.95, "elapsed_time": "1:20:59", "remaining_time": "0:22:54"} | |
| {"current_steps": 2780, "total_steps": 3564, "loss": 0.6560809016227722, "lr": 4.2847498001590573e-07, "epoch": 2.34006734006734, "percentage": 78.0, "elapsed_time": "1:21:03", "remaining_time": "0:22:51"} | |
| {"current_steps": 2782, "total_steps": 3564, "loss": 0.5723754167556763, "lr": 4.273638906867573e-07, "epoch": 2.3417508417508417, "percentage": 78.06, "elapsed_time": "1:21:06", "remaining_time": "0:22:47"} | |
| {"current_steps": 2784, "total_steps": 3564, "loss": 0.786733090877533, "lr": 4.2625511887202225e-07, "epoch": 2.3434343434343434, "percentage": 78.11, "elapsed_time": "1:21:09", "remaining_time": "0:22:44"} | |
| {"current_steps": 2786, "total_steps": 3564, "loss": 0.5187538862228394, "lr": 4.2514866839188657e-07, "epoch": 2.345117845117845, "percentage": 78.17, "elapsed_time": "1:21:13", "remaining_time": "0:22:40"} | |
| {"current_steps": 2788, "total_steps": 3564, "loss": 0.9200822114944458, "lr": 4.2404454305853796e-07, "epoch": 2.346801346801347, "percentage": 78.23, "elapsed_time": "1:21:16", "remaining_time": "0:22:37"} | |
| {"current_steps": 2790, "total_steps": 3564, "loss": 0.7082578539848328, "lr": 4.229427466761522e-07, "epoch": 2.3484848484848486, "percentage": 78.28, "elapsed_time": "1:21:20", "remaining_time": "0:22:33"} | |
| {"current_steps": 2792, "total_steps": 3564, "loss": 0.5452355146408081, "lr": 4.2184328304088164e-07, "epoch": 2.3501683501683504, "percentage": 78.34, "elapsed_time": "1:21:24", "remaining_time": "0:22:30"} | |
| {"current_steps": 2794, "total_steps": 3564, "loss": 0.5780555009841919, "lr": 4.2074615594084146e-07, "epoch": 2.351851851851852, "percentage": 78.4, "elapsed_time": "1:21:27", "remaining_time": "0:22:26"} | |
| {"current_steps": 2796, "total_steps": 3564, "loss": 0.9775782823562622, "lr": 4.1965136915609543e-07, "epoch": 2.3535353535353534, "percentage": 78.45, "elapsed_time": "1:21:30", "remaining_time": "0:22:23"} | |
| {"current_steps": 2798, "total_steps": 3564, "loss": 0.4702543616294861, "lr": 4.1855892645864513e-07, "epoch": 2.355218855218855, "percentage": 78.51, "elapsed_time": "1:21:34", "remaining_time": "0:22:19"} | |
| {"current_steps": 2800, "total_steps": 3564, "loss": 1.041868805885315, "lr": 4.1746883161241555e-07, "epoch": 2.356902356902357, "percentage": 78.56, "elapsed_time": "1:21:38", "remaining_time": "0:22:16"} | |
| {"current_steps": 2802, "total_steps": 3564, "loss": 0.8972384333610535, "lr": 4.1638108837324137e-07, "epoch": 2.3585858585858586, "percentage": 78.62, "elapsed_time": "1:21:42", "remaining_time": "0:22:13"} | |
| {"current_steps": 2804, "total_steps": 3564, "loss": 0.8051435947418213, "lr": 4.152957004888563e-07, "epoch": 2.3602693602693603, "percentage": 78.68, "elapsed_time": "1:21:45", "remaining_time": "0:22:09"} | |
| {"current_steps": 2806, "total_steps": 3564, "loss": 0.805417001247406, "lr": 4.142126716988784e-07, "epoch": 2.361952861952862, "percentage": 78.73, "elapsed_time": "1:21:49", "remaining_time": "0:22:06"} | |
| {"current_steps": 2808, "total_steps": 3564, "loss": 0.7631466388702393, "lr": 4.131320057347969e-07, "epoch": 2.3636363636363638, "percentage": 78.79, "elapsed_time": "1:21:52", "remaining_time": "0:22:02"} | |
| {"current_steps": 2810, "total_steps": 3564, "loss": 0.9656248688697815, "lr": 4.120537063199612e-07, "epoch": 2.3653198653198655, "percentage": 78.84, "elapsed_time": "1:21:56", "remaining_time": "0:21:59"} | |
| {"current_steps": 2812, "total_steps": 3564, "loss": 0.6510505676269531, "lr": 4.109777771695663e-07, "epoch": 2.3670033670033668, "percentage": 78.9, "elapsed_time": "1:21:59", "remaining_time": "0:21:55"} | |
| {"current_steps": 2814, "total_steps": 3564, "loss": 0.5992385745048523, "lr": 4.0990422199064103e-07, "epoch": 2.3686868686868685, "percentage": 78.96, "elapsed_time": "1:22:03", "remaining_time": "0:21:52"} | |
| {"current_steps": 2816, "total_steps": 3564, "loss": 0.6755191087722778, "lr": 4.0883304448203477e-07, "epoch": 2.3703703703703702, "percentage": 79.01, "elapsed_time": "1:22:05", "remaining_time": "0:21:48"} | |
| {"current_steps": 2818, "total_steps": 3564, "loss": 0.6416581869125366, "lr": 4.077642483344044e-07, "epoch": 2.372053872053872, "percentage": 79.07, "elapsed_time": "1:22:08", "remaining_time": "0:21:44"} | |
| {"current_steps": 2820, "total_steps": 3564, "loss": 0.7114299535751343, "lr": 4.066978372302025e-07, "epoch": 2.3737373737373737, "percentage": 79.12, "elapsed_time": "1:22:12", "remaining_time": "0:21:41"} | |
| {"current_steps": 2822, "total_steps": 3564, "loss": 0.38672173023223877, "lr": 4.056338148436643e-07, "epoch": 2.3754208754208754, "percentage": 79.18, "elapsed_time": "1:22:15", "remaining_time": "0:21:37"} | |
| {"current_steps": 2824, "total_steps": 3564, "loss": 0.9695321321487427, "lr": 4.0457218484079414e-07, "epoch": 2.377104377104377, "percentage": 79.24, "elapsed_time": "1:22:18", "remaining_time": "0:21:34"} | |
| {"current_steps": 2826, "total_steps": 3564, "loss": 0.899653971195221, "lr": 4.035129508793542e-07, "epoch": 2.378787878787879, "percentage": 79.29, "elapsed_time": "1:22:21", "remaining_time": "0:21:30"} | |
| {"current_steps": 2828, "total_steps": 3564, "loss": 0.4069860577583313, "lr": 4.024561166088516e-07, "epoch": 2.3804713804713806, "percentage": 79.35, "elapsed_time": "1:22:25", "remaining_time": "0:21:27"} | |
| {"current_steps": 2830, "total_steps": 3564, "loss": 0.90252685546875, "lr": 4.0140168567052447e-07, "epoch": 2.3821548821548824, "percentage": 79.41, "elapsed_time": "1:22:28", "remaining_time": "0:21:23"} | |
| {"current_steps": 2832, "total_steps": 3564, "loss": 0.6742314100265503, "lr": 4.003496616973312e-07, "epoch": 2.3838383838383836, "percentage": 79.46, "elapsed_time": "1:22:31", "remaining_time": "0:21:19"} | |
| {"current_steps": 2834, "total_steps": 3564, "loss": 0.5178687572479248, "lr": 3.9930004831393757e-07, "epoch": 2.3855218855218854, "percentage": 79.52, "elapsed_time": "1:22:35", "remaining_time": "0:21:16"} | |
| {"current_steps": 2836, "total_steps": 3564, "loss": 0.5686367154121399, "lr": 3.982528491367025e-07, "epoch": 2.387205387205387, "percentage": 79.57, "elapsed_time": "1:22:39", "remaining_time": "0:21:12"} | |
| {"current_steps": 2838, "total_steps": 3564, "loss": 0.4284480810165405, "lr": 3.9720806777366817e-07, "epoch": 2.388888888888889, "percentage": 79.63, "elapsed_time": "1:22:42", "remaining_time": "0:21:09"} | |
| {"current_steps": 2840, "total_steps": 3564, "loss": 0.7795579433441162, "lr": 3.961657078245462e-07, "epoch": 2.3905723905723906, "percentage": 79.69, "elapsed_time": "1:22:45", "remaining_time": "0:21:05"} | |
| {"current_steps": 2842, "total_steps": 3564, "loss": 0.3763793110847473, "lr": 3.9512577288070487e-07, "epoch": 2.3922558922558923, "percentage": 79.74, "elapsed_time": "1:22:48", "remaining_time": "0:21:02"} | |
| {"current_steps": 2844, "total_steps": 3564, "loss": 0.9840795993804932, "lr": 3.940882665251576e-07, "epoch": 2.393939393939394, "percentage": 79.8, "elapsed_time": "1:22:52", "remaining_time": "0:20:58"} | |
| {"current_steps": 2846, "total_steps": 3564, "loss": 0.7532452344894409, "lr": 3.930531923325506e-07, "epoch": 2.3956228956228958, "percentage": 79.85, "elapsed_time": "1:22:55", "remaining_time": "0:20:55"} | |
| {"current_steps": 2848, "total_steps": 3564, "loss": 0.9117331504821777, "lr": 3.920205538691497e-07, "epoch": 2.3973063973063975, "percentage": 79.91, "elapsed_time": "1:22:58", "remaining_time": "0:20:51"} | |
| {"current_steps": 2850, "total_steps": 3564, "loss": 0.7445226907730103, "lr": 3.9099035469282906e-07, "epoch": 2.398989898989899, "percentage": 79.97, "elapsed_time": "1:23:01", "remaining_time": "0:20:48"} | |
| {"current_steps": 2852, "total_steps": 3564, "loss": 0.3813757598400116, "lr": 3.8996259835305835e-07, "epoch": 2.4006734006734005, "percentage": 80.02, "elapsed_time": "1:23:05", "remaining_time": "0:20:44"} | |
| {"current_steps": 2854, "total_steps": 3564, "loss": 0.589090883731842, "lr": 3.8893728839089035e-07, "epoch": 2.4023569023569022, "percentage": 80.08, "elapsed_time": "1:23:09", "remaining_time": "0:20:41"} | |
| {"current_steps": 2856, "total_steps": 3564, "loss": 0.5158854126930237, "lr": 3.879144283389495e-07, "epoch": 2.404040404040404, "percentage": 80.13, "elapsed_time": "1:23:12", "remaining_time": "0:20:37"} | |
| {"current_steps": 2858, "total_steps": 3564, "loss": 0.6101418733596802, "lr": 3.8689402172141915e-07, "epoch": 2.4057239057239057, "percentage": 80.19, "elapsed_time": "1:23:16", "remaining_time": "0:20:34"} | |
| {"current_steps": 2860, "total_steps": 3564, "loss": 0.3425447642803192, "lr": 3.8587607205402916e-07, "epoch": 2.4074074074074074, "percentage": 80.25, "elapsed_time": "1:23:20", "remaining_time": "0:20:30"} | |
| {"current_steps": 2862, "total_steps": 3564, "loss": 0.7518799901008606, "lr": 3.848605828440444e-07, "epoch": 2.409090909090909, "percentage": 80.3, "elapsed_time": "1:23:24", "remaining_time": "0:20:27"} | |
| {"current_steps": 2864, "total_steps": 3564, "loss": 0.4169810712337494, "lr": 3.8384755759025313e-07, "epoch": 2.410774410774411, "percentage": 80.36, "elapsed_time": "1:23:27", "remaining_time": "0:20:23"} | |
| {"current_steps": 2866, "total_steps": 3564, "loss": 0.6622034907341003, "lr": 3.828369997829528e-07, "epoch": 2.4124579124579126, "percentage": 80.42, "elapsed_time": "1:23:30", "remaining_time": "0:20:20"} | |
| {"current_steps": 2868, "total_steps": 3564, "loss": 0.7845497131347656, "lr": 3.818289129039405e-07, "epoch": 2.4141414141414144, "percentage": 80.47, "elapsed_time": "1:23:34", "remaining_time": "0:20:16"} | |
| {"current_steps": 2870, "total_steps": 3564, "loss": 0.5676144361495972, "lr": 3.808233004264997e-07, "epoch": 2.4158249158249157, "percentage": 80.53, "elapsed_time": "1:23:37", "remaining_time": "0:20:13"} | |
| {"current_steps": 2872, "total_steps": 3564, "loss": 0.4738210439682007, "lr": 3.79820165815389e-07, "epoch": 2.4175084175084174, "percentage": 80.58, "elapsed_time": "1:23:40", "remaining_time": "0:20:09"} | |
| {"current_steps": 2874, "total_steps": 3564, "loss": 0.8427296876907349, "lr": 3.788195125268284e-07, "epoch": 2.419191919191919, "percentage": 80.64, "elapsed_time": "1:23:43", "remaining_time": "0:20:06"} | |
| {"current_steps": 2876, "total_steps": 3564, "loss": 0.7298943996429443, "lr": 3.7782134400848995e-07, "epoch": 2.420875420875421, "percentage": 80.7, "elapsed_time": "1:23:47", "remaining_time": "0:20:02"} | |
| {"current_steps": 2878, "total_steps": 3564, "loss": 0.4356338381767273, "lr": 3.768256636994843e-07, "epoch": 2.4225589225589226, "percentage": 80.75, "elapsed_time": "1:23:50", "remaining_time": "0:19:59"} | |
| {"current_steps": 2880, "total_steps": 3564, "loss": 0.7260875701904297, "lr": 3.7583247503034864e-07, "epoch": 2.4242424242424243, "percentage": 80.81, "elapsed_time": "1:23:54", "remaining_time": "0:19:55"} | |
| {"current_steps": 2882, "total_steps": 3564, "loss": 0.5450549721717834, "lr": 3.7484178142303625e-07, "epoch": 2.425925925925926, "percentage": 80.86, "elapsed_time": "1:23:58", "remaining_time": "0:19:52"} | |
| {"current_steps": 2884, "total_steps": 3564, "loss": 0.4824645519256592, "lr": 3.738535862909031e-07, "epoch": 2.4276094276094278, "percentage": 80.92, "elapsed_time": "1:24:01", "remaining_time": "0:19:48"} | |
| {"current_steps": 2886, "total_steps": 3564, "loss": 0.4984836280345917, "lr": 3.7286789303869735e-07, "epoch": 2.429292929292929, "percentage": 80.98, "elapsed_time": "1:24:04", "remaining_time": "0:19:45"} | |
| {"current_steps": 2888, "total_steps": 3564, "loss": 0.6126713156700134, "lr": 3.7188470506254744e-07, "epoch": 2.430976430976431, "percentage": 81.03, "elapsed_time": "1:24:08", "remaining_time": "0:19:41"} | |
| {"current_steps": 2890, "total_steps": 3564, "loss": 0.5302858352661133, "lr": 3.7090402574994885e-07, "epoch": 2.4326599326599325, "percentage": 81.09, "elapsed_time": "1:24:11", "remaining_time": "0:19:38"} | |
| {"current_steps": 2892, "total_steps": 3564, "loss": 0.5883275270462036, "lr": 3.699258584797548e-07, "epoch": 2.4343434343434343, "percentage": 81.14, "elapsed_time": "1:24:14", "remaining_time": "0:19:34"} | |
| {"current_steps": 2894, "total_steps": 3564, "loss": 0.8630578517913818, "lr": 3.6895020662216326e-07, "epoch": 2.436026936026936, "percentage": 81.2, "elapsed_time": "1:24:18", "remaining_time": "0:19:31"} | |
| {"current_steps": 2896, "total_steps": 3564, "loss": 0.720264732837677, "lr": 3.679770735387052e-07, "epoch": 2.4377104377104377, "percentage": 81.26, "elapsed_time": "1:24:22", "remaining_time": "0:19:27"} | |
| {"current_steps": 2898, "total_steps": 3564, "loss": 0.6094503998756409, "lr": 3.6700646258223343e-07, "epoch": 2.4393939393939394, "percentage": 81.31, "elapsed_time": "1:24:25", "remaining_time": "0:19:24"} | |
| {"current_steps": 2900, "total_steps": 3564, "loss": 0.40544137358665466, "lr": 3.6603837709691153e-07, "epoch": 2.441077441077441, "percentage": 81.37, "elapsed_time": "1:24:29", "remaining_time": "0:19:20"} | |
| {"current_steps": 2902, "total_steps": 3564, "loss": 0.8314005136489868, "lr": 3.6507282041820085e-07, "epoch": 2.442760942760943, "percentage": 81.43, "elapsed_time": "1:24:32", "remaining_time": "0:19:17"} | |
| {"current_steps": 2904, "total_steps": 3564, "loss": 0.49147939682006836, "lr": 3.641097958728506e-07, "epoch": 2.4444444444444446, "percentage": 81.48, "elapsed_time": "1:24:36", "remaining_time": "0:19:13"} | |
| {"current_steps": 2906, "total_steps": 3564, "loss": 0.34731265902519226, "lr": 3.631493067788858e-07, "epoch": 2.4461279461279464, "percentage": 81.54, "elapsed_time": "1:24:39", "remaining_time": "0:19:10"} | |
| {"current_steps": 2908, "total_steps": 3564, "loss": 0.5173161029815674, "lr": 3.6219135644559506e-07, "epoch": 2.4478114478114477, "percentage": 81.59, "elapsed_time": "1:24:43", "remaining_time": "0:19:06"} | |
| {"current_steps": 2910, "total_steps": 3564, "loss": 0.6695667505264282, "lr": 3.6123594817352046e-07, "epoch": 2.4494949494949494, "percentage": 81.65, "elapsed_time": "1:24:46", "remaining_time": "0:19:03"} | |
| {"current_steps": 2912, "total_steps": 3564, "loss": 0.4327901005744934, "lr": 3.602830852544458e-07, "epoch": 2.451178451178451, "percentage": 81.71, "elapsed_time": "1:24:50", "remaining_time": "0:18:59"} | |
| {"current_steps": 2914, "total_steps": 3564, "loss": 0.7913680672645569, "lr": 3.593327709713844e-07, "epoch": 2.452861952861953, "percentage": 81.76, "elapsed_time": "1:24:54", "remaining_time": "0:18:56"} | |
| {"current_steps": 2916, "total_steps": 3564, "loss": 0.6534749865531921, "lr": 3.5838500859856893e-07, "epoch": 2.4545454545454546, "percentage": 81.82, "elapsed_time": "1:24:57", "remaining_time": "0:18:52"} | |
| {"current_steps": 2918, "total_steps": 3564, "loss": 0.19182810187339783, "lr": 3.5743980140143975e-07, "epoch": 2.4562289562289563, "percentage": 81.87, "elapsed_time": "1:25:00", "remaining_time": "0:18:49"} | |
| {"current_steps": 2920, "total_steps": 3564, "loss": 0.8050523996353149, "lr": 3.5649715263663297e-07, "epoch": 2.457912457912458, "percentage": 81.93, "elapsed_time": "1:25:03", "remaining_time": "0:18:45"} | |
| {"current_steps": 2922, "total_steps": 3564, "loss": 0.3782300353050232, "lr": 3.5555706555197043e-07, "epoch": 2.45959595959596, "percentage": 81.99, "elapsed_time": "1:25:07", "remaining_time": "0:18:42"} | |
| {"current_steps": 2924, "total_steps": 3564, "loss": 0.316059410572052, "lr": 3.5461954338644795e-07, "epoch": 2.461279461279461, "percentage": 82.04, "elapsed_time": "1:25:10", "remaining_time": "0:18:38"} | |
| {"current_steps": 2926, "total_steps": 3564, "loss": 0.5723974704742432, "lr": 3.536845893702234e-07, "epoch": 2.462962962962963, "percentage": 82.1, "elapsed_time": "1:25:13", "remaining_time": "0:18:35"} | |
| {"current_steps": 2928, "total_steps": 3564, "loss": 0.5091125965118408, "lr": 3.527522067246068e-07, "epoch": 2.4646464646464645, "percentage": 82.15, "elapsed_time": "1:25:17", "remaining_time": "0:18:31"} | |
| {"current_steps": 2930, "total_steps": 3564, "loss": 0.3073745667934418, "lr": 3.518223986620491e-07, "epoch": 2.4663299663299663, "percentage": 82.21, "elapsed_time": "1:25:20", "remaining_time": "0:18:27"} | |
| {"current_steps": 2932, "total_steps": 3564, "loss": 0.6242831945419312, "lr": 3.5089516838612986e-07, "epoch": 2.468013468013468, "percentage": 82.27, "elapsed_time": "1:25:23", "remaining_time": "0:18:24"} | |
| {"current_steps": 2934, "total_steps": 3564, "loss": 0.627583384513855, "lr": 3.499705190915476e-07, "epoch": 2.4696969696969697, "percentage": 82.32, "elapsed_time": "1:25:27", "remaining_time": "0:18:20"} | |
| {"current_steps": 2936, "total_steps": 3564, "loss": 0.43692106008529663, "lr": 3.4904845396410854e-07, "epoch": 2.4713804713804715, "percentage": 82.38, "elapsed_time": "1:25:30", "remaining_time": "0:18:17"} | |
| {"current_steps": 2938, "total_steps": 3564, "loss": 0.5572280883789062, "lr": 3.4812897618071445e-07, "epoch": 2.473063973063973, "percentage": 82.44, "elapsed_time": "1:25:33", "remaining_time": "0:18:13"} | |
| {"current_steps": 2940, "total_steps": 3564, "loss": 0.5607247352600098, "lr": 3.472120889093536e-07, "epoch": 2.474747474747475, "percentage": 82.49, "elapsed_time": "1:25:36", "remaining_time": "0:18:10"} | |
| {"current_steps": 2942, "total_steps": 3564, "loss": 0.3747951090335846, "lr": 3.462977953090884e-07, "epoch": 2.4764309764309766, "percentage": 82.55, "elapsed_time": "1:25:40", "remaining_time": "0:18:06"} | |
| {"current_steps": 2944, "total_steps": 3564, "loss": 0.43182575702667236, "lr": 3.453860985300446e-07, "epoch": 2.478114478114478, "percentage": 82.6, "elapsed_time": "1:25:43", "remaining_time": "0:18:03"} | |
| {"current_steps": 2946, "total_steps": 3564, "loss": 0.9047005772590637, "lr": 3.4447700171340164e-07, "epoch": 2.4797979797979797, "percentage": 82.66, "elapsed_time": "1:25:47", "remaining_time": "0:17:59"} | |
| {"current_steps": 2948, "total_steps": 3564, "loss": 0.938655436038971, "lr": 3.4357050799138053e-07, "epoch": 2.4814814814814814, "percentage": 82.72, "elapsed_time": "1:25:51", "remaining_time": "0:17:56"} | |
| {"current_steps": 2950, "total_steps": 3564, "loss": 1.013432502746582, "lr": 3.4266662048723337e-07, "epoch": 2.483164983164983, "percentage": 82.77, "elapsed_time": "1:25:55", "remaining_time": "0:17:52"} | |
| {"current_steps": 2952, "total_steps": 3564, "loss": 0.8985989093780518, "lr": 3.417653423152329e-07, "epoch": 2.484848484848485, "percentage": 82.83, "elapsed_time": "1:25:58", "remaining_time": "0:17:49"} | |
| {"current_steps": 2954, "total_steps": 3564, "loss": 0.5609415769577026, "lr": 3.4086667658066186e-07, "epoch": 2.4865319865319866, "percentage": 82.88, "elapsed_time": "1:26:01", "remaining_time": "0:17:45"} | |
| {"current_steps": 2956, "total_steps": 3564, "loss": 0.8369396924972534, "lr": 3.3997062637980167e-07, "epoch": 2.4882154882154883, "percentage": 82.94, "elapsed_time": "1:26:05", "remaining_time": "0:17:42"} | |
| {"current_steps": 2958, "total_steps": 3564, "loss": 0.5242006182670593, "lr": 3.390771947999224e-07, "epoch": 2.48989898989899, "percentage": 83.0, "elapsed_time": "1:26:09", "remaining_time": "0:17:38"} | |
| {"current_steps": 2960, "total_steps": 3564, "loss": 0.8243865370750427, "lr": 3.381863849192718e-07, "epoch": 2.4915824915824913, "percentage": 83.05, "elapsed_time": "1:26:12", "remaining_time": "0:17:35"} | |
| {"current_steps": 2962, "total_steps": 3564, "loss": 0.5058671832084656, "lr": 3.3729819980706444e-07, "epoch": 2.493265993265993, "percentage": 83.11, "elapsed_time": "1:26:15", "remaining_time": "0:17:31"} | |
| {"current_steps": 2964, "total_steps": 3564, "loss": 0.7412878274917603, "lr": 3.364126425234719e-07, "epoch": 2.494949494949495, "percentage": 83.16, "elapsed_time": "1:26:18", "remaining_time": "0:17:28"} | |
| {"current_steps": 2966, "total_steps": 3564, "loss": 0.5835074186325073, "lr": 3.3552971611961187e-07, "epoch": 2.4966329966329965, "percentage": 83.22, "elapsed_time": "1:26:21", "remaining_time": "0:17:24"} | |
| {"current_steps": 2968, "total_steps": 3564, "loss": 0.8192091584205627, "lr": 3.34649423637537e-07, "epoch": 2.4983164983164983, "percentage": 83.28, "elapsed_time": "1:26:25", "remaining_time": "0:17:21"} | |
| {"current_steps": 2970, "total_steps": 3564, "loss": 0.8428059816360474, "lr": 3.337717681102253e-07, "epoch": 2.5, "percentage": 83.33, "elapsed_time": "1:26:28", "remaining_time": "0:17:17"} | |
| {"current_steps": 2972, "total_steps": 3564, "loss": 0.39063435792922974, "lr": 3.328967525615697e-07, "epoch": 2.5016835016835017, "percentage": 83.39, "elapsed_time": "1:26:32", "remaining_time": "0:17:14"} | |
| {"current_steps": 2974, "total_steps": 3564, "loss": 0.47806400060653687, "lr": 3.3202438000636634e-07, "epoch": 2.5033670033670035, "percentage": 83.45, "elapsed_time": "1:26:36", "remaining_time": "0:17:10"} | |
| {"current_steps": 2976, "total_steps": 3564, "loss": 0.6802424788475037, "lr": 3.311546534503061e-07, "epoch": 2.505050505050505, "percentage": 83.5, "elapsed_time": "1:26:39", "remaining_time": "0:17:07"} | |
| {"current_steps": 2978, "total_steps": 3564, "loss": 0.38681331276893616, "lr": 3.3028757588996303e-07, "epoch": 2.506734006734007, "percentage": 83.56, "elapsed_time": "1:26:43", "remaining_time": "0:17:03"} | |
| {"current_steps": 2980, "total_steps": 3564, "loss": 0.7302665710449219, "lr": 3.294231503127839e-07, "epoch": 2.5084175084175087, "percentage": 83.61, "elapsed_time": "1:26:47", "remaining_time": "0:17:00"} | |
| {"current_steps": 2982, "total_steps": 3564, "loss": 0.7972818613052368, "lr": 3.2856137969707847e-07, "epoch": 2.51010101010101, "percentage": 83.67, "elapsed_time": "1:26:50", "remaining_time": "0:16:57"} | |
| {"current_steps": 2984, "total_steps": 3564, "loss": 0.39771410822868347, "lr": 3.277022670120095e-07, "epoch": 2.5117845117845117, "percentage": 83.73, "elapsed_time": "1:26:54", "remaining_time": "0:16:53"} | |
| {"current_steps": 2986, "total_steps": 3564, "loss": 0.7731115818023682, "lr": 3.268458152175813e-07, "epoch": 2.5134680134680134, "percentage": 83.78, "elapsed_time": "1:26:58", "remaining_time": "0:16:50"} | |
| {"current_steps": 2988, "total_steps": 3564, "loss": 0.5933781862258911, "lr": 3.2599202726463084e-07, "epoch": 2.515151515151515, "percentage": 83.84, "elapsed_time": "1:27:02", "remaining_time": "0:16:46"} | |
| {"current_steps": 2990, "total_steps": 3564, "loss": 0.09502522647380829, "lr": 3.2514090609481683e-07, "epoch": 2.516835016835017, "percentage": 83.89, "elapsed_time": "1:27:05", "remaining_time": "0:16:43"} | |
| {"current_steps": 2992, "total_steps": 3564, "loss": 0.8891875147819519, "lr": 3.2429245464060965e-07, "epoch": 2.5185185185185186, "percentage": 83.95, "elapsed_time": "1:27:09", "remaining_time": "0:16:39"} | |
| {"current_steps": 2994, "total_steps": 3564, "loss": 0.5735270977020264, "lr": 3.234466758252818e-07, "epoch": 2.5202020202020203, "percentage": 84.01, "elapsed_time": "1:27:13", "remaining_time": "0:16:36"} | |
| {"current_steps": 2996, "total_steps": 3564, "loss": 0.7090741395950317, "lr": 3.2260357256289715e-07, "epoch": 2.5218855218855216, "percentage": 84.06, "elapsed_time": "1:27:16", "remaining_time": "0:16:32"} | |
| {"current_steps": 2998, "total_steps": 3564, "loss": 0.5537684559822083, "lr": 3.217631477583009e-07, "epoch": 2.5235690235690234, "percentage": 84.12, "elapsed_time": "1:27:20", "remaining_time": "0:16:29"} | |
| {"current_steps": 3000, "total_steps": 3564, "loss": 0.5045433044433594, "lr": 3.2092540430711044e-07, "epoch": 2.525252525252525, "percentage": 84.18, "elapsed_time": "1:27:24", "remaining_time": "0:16:25"} | |
| {"current_steps": 3002, "total_steps": 3564, "loss": 0.4958549439907074, "lr": 3.200903450957044e-07, "epoch": 2.526936026936027, "percentage": 84.23, "elapsed_time": "1:27:27", "remaining_time": "0:16:22"} | |
| {"current_steps": 3004, "total_steps": 3564, "loss": 0.9713015556335449, "lr": 3.192579730012129e-07, "epoch": 2.5286195286195285, "percentage": 84.29, "elapsed_time": "1:27:31", "remaining_time": "0:16:18"} | |
| {"current_steps": 3006, "total_steps": 3564, "loss": 0.7774836421012878, "lr": 3.184282908915081e-07, "epoch": 2.5303030303030303, "percentage": 84.34, "elapsed_time": "1:27:35", "remaining_time": "0:16:15"} | |
| {"current_steps": 3008, "total_steps": 3564, "loss": 0.6949951648712158, "lr": 3.1760130162519427e-07, "epoch": 2.531986531986532, "percentage": 84.4, "elapsed_time": "1:27:37", "remaining_time": "0:16:11"} | |
| {"current_steps": 3010, "total_steps": 3564, "loss": 0.2635032832622528, "lr": 3.16777008051597e-07, "epoch": 2.5336700336700337, "percentage": 84.46, "elapsed_time": "1:27:41", "remaining_time": "0:16:08"} | |
| {"current_steps": 3012, "total_steps": 3564, "loss": 0.7169020771980286, "lr": 3.159554130107546e-07, "epoch": 2.5353535353535355, "percentage": 84.51, "elapsed_time": "1:27:44", "remaining_time": "0:16:04"} | |
| {"current_steps": 3014, "total_steps": 3564, "loss": 0.6434400677680969, "lr": 3.1513651933340797e-07, "epoch": 2.537037037037037, "percentage": 84.57, "elapsed_time": "1:27:48", "remaining_time": "0:16:01"} | |
| {"current_steps": 3016, "total_steps": 3564, "loss": 0.522533655166626, "lr": 3.143203298409899e-07, "epoch": 2.538720538720539, "percentage": 84.62, "elapsed_time": "1:27:51", "remaining_time": "0:15:57"} | |
| {"current_steps": 3018, "total_steps": 3564, "loss": 0.8724677562713623, "lr": 3.1350684734561676e-07, "epoch": 2.5404040404040407, "percentage": 84.68, "elapsed_time": "1:27:55", "remaining_time": "0:15:54"} | |
| {"current_steps": 3020, "total_steps": 3564, "loss": 0.6959270238876343, "lr": 3.126960746500784e-07, "epoch": 2.542087542087542, "percentage": 84.74, "elapsed_time": "1:27:59", "remaining_time": "0:15:50"} | |
| {"current_steps": 3022, "total_steps": 3564, "loss": 0.7995277643203735, "lr": 3.118880145478274e-07, "epoch": 2.5437710437710437, "percentage": 84.79, "elapsed_time": "1:28:02", "remaining_time": "0:15:47"} | |
| {"current_steps": 3024, "total_steps": 3564, "loss": 0.9624471664428711, "lr": 3.110826698229711e-07, "epoch": 2.5454545454545454, "percentage": 84.85, "elapsed_time": "1:28:06", "remaining_time": "0:15:43"} | |
| {"current_steps": 3026, "total_steps": 3564, "loss": 0.22170954942703247, "lr": 3.102800432502607e-07, "epoch": 2.547138047138047, "percentage": 84.9, "elapsed_time": "1:28:09", "remaining_time": "0:15:40"} | |
| {"current_steps": 3028, "total_steps": 3564, "loss": 0.5246233344078064, "lr": 3.0948013759508274e-07, "epoch": 2.548821548821549, "percentage": 84.96, "elapsed_time": "1:28:13", "remaining_time": "0:15:36"} | |
| {"current_steps": 3030, "total_steps": 3564, "loss": 0.4475906491279602, "lr": 3.0868295561344874e-07, "epoch": 2.5505050505050506, "percentage": 85.02, "elapsed_time": "1:28:16", "remaining_time": "0:15:33"} | |
| {"current_steps": 3032, "total_steps": 3564, "loss": 0.4590218961238861, "lr": 3.078885000519858e-07, "epoch": 2.5521885521885523, "percentage": 85.07, "elapsed_time": "1:28:19", "remaining_time": "0:15:29"} | |
| {"current_steps": 3034, "total_steps": 3564, "loss": 0.8541072607040405, "lr": 3.0709677364792767e-07, "epoch": 2.5538720538720536, "percentage": 85.13, "elapsed_time": "1:28:23", "remaining_time": "0:15:26"} | |
| {"current_steps": 3036, "total_steps": 3564, "loss": 0.9300471544265747, "lr": 3.0630777912910533e-07, "epoch": 2.5555555555555554, "percentage": 85.19, "elapsed_time": "1:28:27", "remaining_time": "0:15:22"} | |
| {"current_steps": 3038, "total_steps": 3564, "loss": 0.6171663999557495, "lr": 3.0552151921393633e-07, "epoch": 2.557239057239057, "percentage": 85.24, "elapsed_time": "1:28:30", "remaining_time": "0:15:19"} | |
| {"current_steps": 3040, "total_steps": 3564, "loss": 0.865818977355957, "lr": 3.0473799661141707e-07, "epoch": 2.558922558922559, "percentage": 85.3, "elapsed_time": "1:28:33", "remaining_time": "0:15:15"} | |
| {"current_steps": 3042, "total_steps": 3564, "loss": 0.6238538026809692, "lr": 3.0395721402111286e-07, "epoch": 2.5606060606060606, "percentage": 85.35, "elapsed_time": "1:28:37", "remaining_time": "0:15:12"} | |
| {"current_steps": 3044, "total_steps": 3564, "loss": 0.778638482093811, "lr": 3.031791741331478e-07, "epoch": 2.5622895622895623, "percentage": 85.41, "elapsed_time": "1:28:40", "remaining_time": "0:15:08"} | |
| {"current_steps": 3046, "total_steps": 3564, "loss": 0.6787006855010986, "lr": 3.0240387962819695e-07, "epoch": 2.563973063973064, "percentage": 85.47, "elapsed_time": "1:28:44", "remaining_time": "0:15:05"} | |
| {"current_steps": 3048, "total_steps": 3564, "loss": 0.8738001585006714, "lr": 3.016313331774762e-07, "epoch": 2.5656565656565657, "percentage": 85.52, "elapsed_time": "1:28:48", "remaining_time": "0:15:02"} | |
| {"current_steps": 3050, "total_steps": 3564, "loss": 0.3498271703720093, "lr": 3.008615374427329e-07, "epoch": 2.5673400673400675, "percentage": 85.58, "elapsed_time": "1:28:51", "remaining_time": "0:14:58"} | |
| {"current_steps": 3052, "total_steps": 3564, "loss": 0.9484968185424805, "lr": 3.000944950762373e-07, "epoch": 2.569023569023569, "percentage": 85.63, "elapsed_time": "1:28:55", "remaining_time": "0:14:55"} | |
| {"current_steps": 3054, "total_steps": 3564, "loss": 0.0691433697938919, "lr": 2.993302087207732e-07, "epoch": 2.570707070707071, "percentage": 85.69, "elapsed_time": "1:28:58", "remaining_time": "0:14:51"} | |
| {"current_steps": 3056, "total_steps": 3564, "loss": 0.6116932034492493, "lr": 2.985686810096285e-07, "epoch": 2.5723905723905722, "percentage": 85.75, "elapsed_time": "1:29:02", "remaining_time": "0:14:48"} | |
| {"current_steps": 3058, "total_steps": 3564, "loss": 0.3154261112213135, "lr": 2.978099145665867e-07, "epoch": 2.574074074074074, "percentage": 85.8, "elapsed_time": "1:29:05", "remaining_time": "0:14:44"} | |
| {"current_steps": 3060, "total_steps": 3564, "loss": 0.6580586433410645, "lr": 2.970539120059174e-07, "epoch": 2.5757575757575757, "percentage": 85.86, "elapsed_time": "1:29:09", "remaining_time": "0:14:41"} | |
| {"current_steps": 3062, "total_steps": 3564, "loss": 0.6125509142875671, "lr": 2.963006759323676e-07, "epoch": 2.5774410774410774, "percentage": 85.91, "elapsed_time": "1:29:13", "remaining_time": "0:14:37"} | |
| {"current_steps": 3064, "total_steps": 3564, "loss": 0.4061823785305023, "lr": 2.955502089411523e-07, "epoch": 2.579124579124579, "percentage": 85.97, "elapsed_time": "1:29:16", "remaining_time": "0:14:34"} | |
| {"current_steps": 3066, "total_steps": 3564, "loss": 0.5432108044624329, "lr": 2.9480251361794656e-07, "epoch": 2.580808080808081, "percentage": 86.03, "elapsed_time": "1:29:19", "remaining_time": "0:14:30"} | |
| {"current_steps": 3068, "total_steps": 3564, "loss": 0.2773892879486084, "lr": 2.940575925388746e-07, "epoch": 2.5824915824915826, "percentage": 86.08, "elapsed_time": "1:29:22", "remaining_time": "0:14:26"} | |
| {"current_steps": 3070, "total_steps": 3564, "loss": 0.08487945795059204, "lr": 2.933154482705035e-07, "epoch": 2.584175084175084, "percentage": 86.14, "elapsed_time": "1:29:26", "remaining_time": "0:14:23"} | |
| {"current_steps": 3072, "total_steps": 3564, "loss": 0.41717803478240967, "lr": 2.925760833698327e-07, "epoch": 2.5858585858585856, "percentage": 86.2, "elapsed_time": "1:29:29", "remaining_time": "0:14:19"} | |
| {"current_steps": 3074, "total_steps": 3564, "loss": 0.9503785371780396, "lr": 2.9183950038428475e-07, "epoch": 2.5875420875420874, "percentage": 86.25, "elapsed_time": "1:29:32", "remaining_time": "0:14:16"} | |
| {"current_steps": 3076, "total_steps": 3564, "loss": 0.3452813923358917, "lr": 2.9110570185169834e-07, "epoch": 2.589225589225589, "percentage": 86.31, "elapsed_time": "1:29:36", "remaining_time": "0:14:12"} | |
| {"current_steps": 3078, "total_steps": 3564, "loss": 0.8001734614372253, "lr": 2.903746903003184e-07, "epoch": 2.590909090909091, "percentage": 86.36, "elapsed_time": "1:29:40", "remaining_time": "0:14:09"} | |
| {"current_steps": 3080, "total_steps": 3564, "loss": 0.6741084456443787, "lr": 2.896464682487866e-07, "epoch": 2.5925925925925926, "percentage": 86.42, "elapsed_time": "1:29:43", "remaining_time": "0:14:06"} | |
| {"current_steps": 3082, "total_steps": 3564, "loss": 0.9191502332687378, "lr": 2.8892103820613487e-07, "epoch": 2.5942760942760943, "percentage": 86.48, "elapsed_time": "1:29:47", "remaining_time": "0:14:02"} | |
| {"current_steps": 3084, "total_steps": 3564, "loss": 0.5582960844039917, "lr": 2.88198402671775e-07, "epoch": 2.595959595959596, "percentage": 86.53, "elapsed_time": "1:29:51", "remaining_time": "0:13:59"} | |
| {"current_steps": 3086, "total_steps": 3564, "loss": 0.5779297947883606, "lr": 2.874785641354901e-07, "epoch": 2.5976430976430978, "percentage": 86.59, "elapsed_time": "1:29:54", "remaining_time": "0:13:55"} | |
| {"current_steps": 3088, "total_steps": 3564, "loss": 0.7671989798545837, "lr": 2.867615250774269e-07, "epoch": 2.5993265993265995, "percentage": 86.64, "elapsed_time": "1:29:57", "remaining_time": "0:13:52"} | |
| {"current_steps": 3090, "total_steps": 3564, "loss": 0.8642760515213013, "lr": 2.860472879680869e-07, "epoch": 2.601010101010101, "percentage": 86.7, "elapsed_time": "1:30:01", "remaining_time": "0:13:48"} | |
| {"current_steps": 3092, "total_steps": 3564, "loss": 0.6304323673248291, "lr": 2.8533585526831726e-07, "epoch": 2.602693602693603, "percentage": 86.76, "elapsed_time": "1:30:05", "remaining_time": "0:13:45"} | |
| {"current_steps": 3094, "total_steps": 3564, "loss": 0.4931812286376953, "lr": 2.8462722942930286e-07, "epoch": 2.6043771043771042, "percentage": 86.81, "elapsed_time": "1:30:09", "remaining_time": "0:13:41"} | |
| {"current_steps": 3096, "total_steps": 3564, "loss": 0.6241375207901001, "lr": 2.8392141289255806e-07, "epoch": 2.606060606060606, "percentage": 86.87, "elapsed_time": "1:30:12", "remaining_time": "0:13:38"} | |
| {"current_steps": 3098, "total_steps": 3564, "loss": 0.5527880191802979, "lr": 2.8321840808991775e-07, "epoch": 2.6077441077441077, "percentage": 86.92, "elapsed_time": "1:30:16", "remaining_time": "0:13:34"} | |
| {"current_steps": 3100, "total_steps": 3564, "loss": 0.6250026226043701, "lr": 2.8251821744352933e-07, "epoch": 2.6094276094276094, "percentage": 86.98, "elapsed_time": "1:30:19", "remaining_time": "0:13:31"} | |
| {"current_steps": 3102, "total_steps": 3564, "loss": 0.5582347512245178, "lr": 2.8182084336584423e-07, "epoch": 2.611111111111111, "percentage": 87.04, "elapsed_time": "1:30:22", "remaining_time": "0:13:27"} | |
| {"current_steps": 3104, "total_steps": 3564, "loss": 0.791733980178833, "lr": 2.8112628825960926e-07, "epoch": 2.612794612794613, "percentage": 87.09, "elapsed_time": "1:30:26", "remaining_time": "0:13:24"} | |
| {"current_steps": 3106, "total_steps": 3564, "loss": 0.7450399398803711, "lr": 2.804345545178594e-07, "epoch": 2.6144781144781146, "percentage": 87.15, "elapsed_time": "1:30:29", "remaining_time": "0:13:20"} | |
| {"current_steps": 3108, "total_steps": 3564, "loss": 0.17849119007587433, "lr": 2.7974564452390833e-07, "epoch": 2.616161616161616, "percentage": 87.21, "elapsed_time": "1:30:32", "remaining_time": "0:13:17"} | |
| {"current_steps": 3110, "total_steps": 3564, "loss": 0.7354204654693604, "lr": 2.790595606513406e-07, "epoch": 2.6178451178451176, "percentage": 87.26, "elapsed_time": "1:30:36", "remaining_time": "0:13:13"} | |
| {"current_steps": 3112, "total_steps": 3564, "loss": 0.41245055198669434, "lr": 2.78376305264004e-07, "epoch": 2.6195286195286194, "percentage": 87.32, "elapsed_time": "1:30:39", "remaining_time": "0:13:10"} | |
| {"current_steps": 3114, "total_steps": 3564, "loss": 0.37273505330085754, "lr": 2.776958807160011e-07, "epoch": 2.621212121212121, "percentage": 87.37, "elapsed_time": "1:30:43", "remaining_time": "0:13:06"} | |
| {"current_steps": 3116, "total_steps": 3564, "loss": 0.8599231243133545, "lr": 2.7701828935168026e-07, "epoch": 2.622895622895623, "percentage": 87.43, "elapsed_time": "1:30:47", "remaining_time": "0:13:03"} | |
| {"current_steps": 3118, "total_steps": 3564, "loss": 0.9832479953765869, "lr": 2.763435335056291e-07, "epoch": 2.6245791245791246, "percentage": 87.49, "elapsed_time": "1:30:50", "remaining_time": "0:12:59"} | |
| {"current_steps": 3120, "total_steps": 3564, "loss": 0.5217673778533936, "lr": 2.756716155026656e-07, "epoch": 2.6262626262626263, "percentage": 87.54, "elapsed_time": "1:30:54", "remaining_time": "0:12:56"} | |
| {"current_steps": 3122, "total_steps": 3564, "loss": 0.8622322082519531, "lr": 2.750025376578295e-07, "epoch": 2.627946127946128, "percentage": 87.6, "elapsed_time": "1:30:57", "remaining_time": "0:12:52"} | |
| {"current_steps": 3124, "total_steps": 3564, "loss": 0.8336771726608276, "lr": 2.743363022763758e-07, "epoch": 2.6296296296296298, "percentage": 87.65, "elapsed_time": "1:31:01", "remaining_time": "0:12:49"} | |
| {"current_steps": 3126, "total_steps": 3564, "loss": 0.5954484939575195, "lr": 2.7367291165376593e-07, "epoch": 2.6313131313131315, "percentage": 87.71, "elapsed_time": "1:31:05", "remaining_time": "0:12:45"} | |
| {"current_steps": 3128, "total_steps": 3564, "loss": 0.8022388219833374, "lr": 2.7301236807565925e-07, "epoch": 2.6329966329966332, "percentage": 87.77, "elapsed_time": "1:31:08", "remaining_time": "0:12:42"} | |
| {"current_steps": 3130, "total_steps": 3564, "loss": 0.5048923492431641, "lr": 2.7235467381790654e-07, "epoch": 2.634680134680135, "percentage": 87.82, "elapsed_time": "1:31:12", "remaining_time": "0:12:38"} | |
| {"current_steps": 3132, "total_steps": 3564, "loss": 0.2697800397872925, "lr": 2.716998311465415e-07, "epoch": 2.6363636363636362, "percentage": 87.88, "elapsed_time": "1:31:15", "remaining_time": "0:12:35"} | |
| {"current_steps": 3134, "total_steps": 3564, "loss": 0.8560886383056641, "lr": 2.710478423177722e-07, "epoch": 2.638047138047138, "percentage": 87.93, "elapsed_time": "1:31:19", "remaining_time": "0:12:31"} | |
| {"current_steps": 3136, "total_steps": 3564, "loss": 0.7351222038269043, "lr": 2.7039870957797464e-07, "epoch": 2.6397306397306397, "percentage": 87.99, "elapsed_time": "1:31:23", "remaining_time": "0:12:28"} | |
| {"current_steps": 3138, "total_steps": 3564, "loss": 0.41435521841049194, "lr": 2.697524351636844e-07, "epoch": 2.6414141414141414, "percentage": 88.05, "elapsed_time": "1:31:26", "remaining_time": "0:12:24"} | |
| {"current_steps": 3140, "total_steps": 3564, "loss": 0.9173501133918762, "lr": 2.691090213015886e-07, "epoch": 2.643097643097643, "percentage": 88.1, "elapsed_time": "1:31:30", "remaining_time": "0:12:21"} | |
| {"current_steps": 3142, "total_steps": 3564, "loss": 0.5904110670089722, "lr": 2.6846847020851884e-07, "epoch": 2.644781144781145, "percentage": 88.16, "elapsed_time": "1:31:33", "remaining_time": "0:12:17"} | |
| {"current_steps": 3144, "total_steps": 3564, "loss": 0.8097279071807861, "lr": 2.678307840914431e-07, "epoch": 2.6464646464646466, "percentage": 88.22, "elapsed_time": "1:31:37", "remaining_time": "0:12:14"} | |
| {"current_steps": 3146, "total_steps": 3564, "loss": 0.8938575983047485, "lr": 2.6719596514745826e-07, "epoch": 2.648148148148148, "percentage": 88.27, "elapsed_time": "1:31:40", "remaining_time": "0:12:10"} | |
| {"current_steps": 3148, "total_steps": 3564, "loss": 0.5425578355789185, "lr": 2.665640155637828e-07, "epoch": 2.6498316498316496, "percentage": 88.33, "elapsed_time": "1:31:44", "remaining_time": "0:12:07"} | |
| {"current_steps": 3150, "total_steps": 3564, "loss": 0.8360292911529541, "lr": 2.659349375177489e-07, "epoch": 2.6515151515151514, "percentage": 88.38, "elapsed_time": "1:31:47", "remaining_time": "0:12:03"} | |
| {"current_steps": 3152, "total_steps": 3564, "loss": 0.2029864341020584, "lr": 2.6530873317679515e-07, "epoch": 2.653198653198653, "percentage": 88.44, "elapsed_time": "1:31:50", "remaining_time": "0:12:00"} | |
| {"current_steps": 3154, "total_steps": 3564, "loss": 0.9556988477706909, "lr": 2.6468540469845895e-07, "epoch": 2.654882154882155, "percentage": 88.5, "elapsed_time": "1:31:53", "remaining_time": "0:11:56"} | |
| {"current_steps": 3156, "total_steps": 3564, "loss": 0.5114415884017944, "lr": 2.640649542303693e-07, "epoch": 2.6565656565656566, "percentage": 88.55, "elapsed_time": "1:31:57", "remaining_time": "0:11:53"} | |
| {"current_steps": 3158, "total_steps": 3564, "loss": 0.39493846893310547, "lr": 2.634473839102389e-07, "epoch": 2.6582491582491583, "percentage": 88.61, "elapsed_time": "1:32:00", "remaining_time": "0:11:49"} | |
| {"current_steps": 3160, "total_steps": 3564, "loss": 0.5446680784225464, "lr": 2.6283269586585737e-07, "epoch": 2.65993265993266, "percentage": 88.66, "elapsed_time": "1:32:04", "remaining_time": "0:11:46"} | |
| {"current_steps": 3162, "total_steps": 3564, "loss": 0.6248540282249451, "lr": 2.6222089221508404e-07, "epoch": 2.6616161616161618, "percentage": 88.72, "elapsed_time": "1:32:07", "remaining_time": "0:11:42"} | |
| {"current_steps": 3164, "total_steps": 3564, "loss": 0.8368432521820068, "lr": 2.6161197506583944e-07, "epoch": 2.6632996632996635, "percentage": 88.78, "elapsed_time": "1:32:11", "remaining_time": "0:11:39"} | |
| {"current_steps": 3166, "total_steps": 3564, "loss": 0.619489312171936, "lr": 2.610059465160995e-07, "epoch": 2.6649831649831652, "percentage": 88.83, "elapsed_time": "1:32:14", "remaining_time": "0:11:35"} | |
| {"current_steps": 3168, "total_steps": 3564, "loss": 0.7894487380981445, "lr": 2.6040280865388773e-07, "epoch": 2.6666666666666665, "percentage": 88.89, "elapsed_time": "1:32:18", "remaining_time": "0:11:32"} | |
| {"current_steps": 3170, "total_steps": 3564, "loss": 0.5782526135444641, "lr": 2.5980256355726744e-07, "epoch": 2.6683501683501682, "percentage": 88.95, "elapsed_time": "1:32:21", "remaining_time": "0:11:28"} | |
| {"current_steps": 3172, "total_steps": 3564, "loss": 1.0222315788269043, "lr": 2.5920521329433606e-07, "epoch": 2.67003367003367, "percentage": 89.0, "elapsed_time": "1:32:25", "remaining_time": "0:11:25"} | |
| {"current_steps": 3174, "total_steps": 3564, "loss": 0.9073632955551147, "lr": 2.586107599232164e-07, "epoch": 2.6717171717171717, "percentage": 89.06, "elapsed_time": "1:32:29", "remaining_time": "0:11:21"} | |
| {"current_steps": 3176, "total_steps": 3564, "loss": 0.46630191802978516, "lr": 2.5801920549205023e-07, "epoch": 2.6734006734006734, "percentage": 89.11, "elapsed_time": "1:32:32", "remaining_time": "0:11:18"} | |
| {"current_steps": 3178, "total_steps": 3564, "loss": 0.9780217409133911, "lr": 2.5743055203899167e-07, "epoch": 2.675084175084175, "percentage": 89.17, "elapsed_time": "1:32:36", "remaining_time": "0:11:14"} | |
| {"current_steps": 3180, "total_steps": 3564, "loss": 0.639081597328186, "lr": 2.568448015921996e-07, "epoch": 2.676767676767677, "percentage": 89.23, "elapsed_time": "1:32:39", "remaining_time": "0:11:11"} | |
| {"current_steps": 3182, "total_steps": 3564, "loss": 0.7984585762023926, "lr": 2.562619561698306e-07, "epoch": 2.678451178451178, "percentage": 89.28, "elapsed_time": "1:32:43", "remaining_time": "0:11:07"} | |
| {"current_steps": 3184, "total_steps": 3564, "loss": 0.9407286643981934, "lr": 2.556820177800324e-07, "epoch": 2.68013468013468, "percentage": 89.34, "elapsed_time": "1:32:46", "remaining_time": "0:11:04"} | |
| {"current_steps": 3186, "total_steps": 3564, "loss": 0.8115611672401428, "lr": 2.551049884209371e-07, "epoch": 2.6818181818181817, "percentage": 89.39, "elapsed_time": "1:32:50", "remaining_time": "0:11:00"} | |
| {"current_steps": 3188, "total_steps": 3564, "loss": 0.7339519262313843, "lr": 2.5453087008065307e-07, "epoch": 2.6835016835016834, "percentage": 89.45, "elapsed_time": "1:32:53", "remaining_time": "0:10:57"} | |
| {"current_steps": 3190, "total_steps": 3564, "loss": 0.49706321954727173, "lr": 2.5395966473725994e-07, "epoch": 2.685185185185185, "percentage": 89.51, "elapsed_time": "1:32:57", "remaining_time": "0:10:53"} | |
| {"current_steps": 3192, "total_steps": 3564, "loss": 0.6397048234939575, "lr": 2.5339137435880043e-07, "epoch": 2.686868686868687, "percentage": 89.56, "elapsed_time": "1:33:00", "remaining_time": "0:10:50"} | |
| {"current_steps": 3194, "total_steps": 3564, "loss": 0.7652658820152283, "lr": 2.5282600090327383e-07, "epoch": 2.6885521885521886, "percentage": 89.62, "elapsed_time": "1:33:04", "remaining_time": "0:10:46"} | |
| {"current_steps": 3196, "total_steps": 3564, "loss": 0.6125460863113403, "lr": 2.5226354631862966e-07, "epoch": 2.6902356902356903, "percentage": 89.67, "elapsed_time": "1:33:07", "remaining_time": "0:10:43"} | |
| {"current_steps": 3198, "total_steps": 3564, "loss": 0.7383702397346497, "lr": 2.517040125427608e-07, "epoch": 2.691919191919192, "percentage": 89.73, "elapsed_time": "1:33:11", "remaining_time": "0:10:39"} | |
| {"current_steps": 3200, "total_steps": 3564, "loss": 0.8494305610656738, "lr": 2.511474015034964e-07, "epoch": 2.6936026936026938, "percentage": 89.79, "elapsed_time": "1:33:15", "remaining_time": "0:10:36"} | |
| {"current_steps": 3202, "total_steps": 3564, "loss": 0.6800326108932495, "lr": 2.5059371511859557e-07, "epoch": 2.6952861952861955, "percentage": 89.84, "elapsed_time": "1:33:19", "remaining_time": "0:10:33"} | |
| {"current_steps": 3204, "total_steps": 3564, "loss": 0.6918296813964844, "lr": 2.50042955295741e-07, "epoch": 2.6969696969696972, "percentage": 89.9, "elapsed_time": "1:33:22", "remaining_time": "0:10:29"} | |
| {"current_steps": 3206, "total_steps": 3564, "loss": 0.6519820094108582, "lr": 2.494951239325321e-07, "epoch": 2.6986531986531985, "percentage": 89.96, "elapsed_time": "1:33:26", "remaining_time": "0:10:26"} | |
| {"current_steps": 3208, "total_steps": 3564, "loss": 0.5281827449798584, "lr": 2.489502229164781e-07, "epoch": 2.7003367003367003, "percentage": 90.01, "elapsed_time": "1:33:29", "remaining_time": "0:10:22"} | |
| {"current_steps": 3210, "total_steps": 3564, "loss": 0.8719410300254822, "lr": 2.4840825412499274e-07, "epoch": 2.702020202020202, "percentage": 90.07, "elapsed_time": "1:33:33", "remaining_time": "0:10:19"} | |
| {"current_steps": 3212, "total_steps": 3564, "loss": 0.5532783269882202, "lr": 2.478692194253861e-07, "epoch": 2.7037037037037037, "percentage": 90.12, "elapsed_time": "1:33:36", "remaining_time": "0:10:15"} | |
| {"current_steps": 3214, "total_steps": 3564, "loss": 0.5865626931190491, "lr": 2.473331206748597e-07, "epoch": 2.7053872053872055, "percentage": 90.18, "elapsed_time": "1:33:40", "remaining_time": "0:10:12"} | |
| {"current_steps": 3216, "total_steps": 3564, "loss": 0.2805863618850708, "lr": 2.467999597204996e-07, "epoch": 2.707070707070707, "percentage": 90.24, "elapsed_time": "1:33:43", "remaining_time": "0:10:08"} | |
| {"current_steps": 3218, "total_steps": 3564, "loss": 0.7335485219955444, "lr": 2.462697383992691e-07, "epoch": 2.708754208754209, "percentage": 90.29, "elapsed_time": "1:33:46", "remaining_time": "0:10:04"} | |
| {"current_steps": 3220, "total_steps": 3564, "loss": 0.3276599943637848, "lr": 2.457424585380041e-07, "epoch": 2.71043771043771, "percentage": 90.35, "elapsed_time": "1:33:50", "remaining_time": "0:10:01"} | |
| {"current_steps": 3222, "total_steps": 3564, "loss": 0.672775149345398, "lr": 2.4521812195340544e-07, "epoch": 2.712121212121212, "percentage": 90.4, "elapsed_time": "1:33:53", "remaining_time": "0:09:57"} | |
| {"current_steps": 3224, "total_steps": 3564, "loss": 0.40836215019226074, "lr": 2.4469673045203333e-07, "epoch": 2.7138047138047137, "percentage": 90.46, "elapsed_time": "1:33:56", "remaining_time": "0:09:54"} | |
| {"current_steps": 3226, "total_steps": 3564, "loss": 0.4133344888687134, "lr": 2.441782858303007e-07, "epoch": 2.7154882154882154, "percentage": 90.52, "elapsed_time": "1:34:00", "remaining_time": "0:09:50"} | |
| {"current_steps": 3228, "total_steps": 3564, "loss": 0.7267272472381592, "lr": 2.436627898744678e-07, "epoch": 2.717171717171717, "percentage": 90.57, "elapsed_time": "1:34:03", "remaining_time": "0:09:47"} | |
| {"current_steps": 3230, "total_steps": 3564, "loss": 0.42516928911209106, "lr": 2.4315024436063464e-07, "epoch": 2.718855218855219, "percentage": 90.63, "elapsed_time": "1:34:06", "remaining_time": "0:09:43"} | |
| {"current_steps": 3232, "total_steps": 3564, "loss": 0.768959641456604, "lr": 2.4264065105473637e-07, "epoch": 2.7205387205387206, "percentage": 90.68, "elapsed_time": "1:34:10", "remaining_time": "0:09:40"} | |
| {"current_steps": 3234, "total_steps": 3564, "loss": 0.6403470039367676, "lr": 2.4213401171253656e-07, "epoch": 2.7222222222222223, "percentage": 90.74, "elapsed_time": "1:34:13", "remaining_time": "0:09:36"} | |
| {"current_steps": 3236, "total_steps": 3564, "loss": 0.7732399106025696, "lr": 2.416303280796206e-07, "epoch": 2.723905723905724, "percentage": 90.8, "elapsed_time": "1:34:16", "remaining_time": "0:09:33"} | |
| {"current_steps": 3238, "total_steps": 3564, "loss": 0.7329007387161255, "lr": 2.411296018913907e-07, "epoch": 2.725589225589226, "percentage": 90.85, "elapsed_time": "1:34:20", "remaining_time": "0:09:29"} | |
| {"current_steps": 3240, "total_steps": 3564, "loss": 0.7464162111282349, "lr": 2.406318348730592e-07, "epoch": 2.7272727272727275, "percentage": 90.91, "elapsed_time": "1:34:24", "remaining_time": "0:09:26"} | |
| {"current_steps": 3242, "total_steps": 3564, "loss": 0.7636083364486694, "lr": 2.401370287396428e-07, "epoch": 2.728956228956229, "percentage": 90.97, "elapsed_time": "1:34:28", "remaining_time": "0:09:22"} | |
| {"current_steps": 3244, "total_steps": 3564, "loss": 0.599960207939148, "lr": 2.396451851959571e-07, "epoch": 2.7306397306397305, "percentage": 91.02, "elapsed_time": "1:34:31", "remaining_time": "0:09:19"} | |
| {"current_steps": 3246, "total_steps": 3564, "loss": 0.7824025750160217, "lr": 2.391563059366099e-07, "epoch": 2.7323232323232323, "percentage": 91.08, "elapsed_time": "1:34:35", "remaining_time": "0:09:16"} | |
| {"current_steps": 3248, "total_steps": 3564, "loss": 0.8408564329147339, "lr": 2.3867039264599587e-07, "epoch": 2.734006734006734, "percentage": 91.13, "elapsed_time": "1:34:39", "remaining_time": "0:09:12"} | |
| {"current_steps": 3250, "total_steps": 3564, "loss": 0.6503514051437378, "lr": 2.3818744699829105e-07, "epoch": 2.7356902356902357, "percentage": 91.19, "elapsed_time": "1:34:41", "remaining_time": "0:09:08"} | |
| {"current_steps": 3252, "total_steps": 3564, "loss": 0.3846713900566101, "lr": 2.3770747065744594e-07, "epoch": 2.7373737373737375, "percentage": 91.25, "elapsed_time": "1:34:44", "remaining_time": "0:09:05"} | |
| {"current_steps": 3254, "total_steps": 3564, "loss": 0.5147488713264465, "lr": 2.3723046527718137e-07, "epoch": 2.739057239057239, "percentage": 91.3, "elapsed_time": "1:34:48", "remaining_time": "0:09:01"} | |
| {"current_steps": 3256, "total_steps": 3564, "loss": 0.5139864087104797, "lr": 2.367564325009815e-07, "epoch": 2.7407407407407405, "percentage": 91.36, "elapsed_time": "1:34:52", "remaining_time": "0:08:58"} | |
| {"current_steps": 3258, "total_steps": 3564, "loss": 0.5290718078613281, "lr": 2.362853739620885e-07, "epoch": 2.742424242424242, "percentage": 91.41, "elapsed_time": "1:34:55", "remaining_time": "0:08:54"} | |
| {"current_steps": 3260, "total_steps": 3564, "loss": 0.3965787887573242, "lr": 2.3581729128349745e-07, "epoch": 2.744107744107744, "percentage": 91.47, "elapsed_time": "1:34:58", "remaining_time": "0:08:51"} | |
| {"current_steps": 3262, "total_steps": 3564, "loss": 0.6484100222587585, "lr": 2.3535218607795013e-07, "epoch": 2.7457912457912457, "percentage": 91.53, "elapsed_time": "1:35:02", "remaining_time": "0:08:47"} | |
| {"current_steps": 3264, "total_steps": 3564, "loss": 0.8430534601211548, "lr": 2.3489005994792948e-07, "epoch": 2.7474747474747474, "percentage": 91.58, "elapsed_time": "1:35:05", "remaining_time": "0:08:44"} | |
| {"current_steps": 3266, "total_steps": 3564, "loss": 0.957166314125061, "lr": 2.3443091448565454e-07, "epoch": 2.749158249158249, "percentage": 91.64, "elapsed_time": "1:35:08", "remaining_time": "0:08:40"} | |
| {"current_steps": 3268, "total_steps": 3564, "loss": 0.3728073835372925, "lr": 2.339747512730749e-07, "epoch": 2.750841750841751, "percentage": 91.69, "elapsed_time": "1:35:11", "remaining_time": "0:08:37"} | |
| {"current_steps": 3270, "total_steps": 3564, "loss": 0.9523381590843201, "lr": 2.3352157188186424e-07, "epoch": 2.7525252525252526, "percentage": 91.75, "elapsed_time": "1:35:15", "remaining_time": "0:08:33"} | |
| {"current_steps": 3272, "total_steps": 3564, "loss": 0.4420832395553589, "lr": 2.3307137787341667e-07, "epoch": 2.7542087542087543, "percentage": 91.81, "elapsed_time": "1:35:18", "remaining_time": "0:08:30"} | |
| {"current_steps": 3274, "total_steps": 3564, "loss": 0.660933792591095, "lr": 2.3262417079883986e-07, "epoch": 2.755892255892256, "percentage": 91.86, "elapsed_time": "1:35:22", "remaining_time": "0:08:26"} | |
| {"current_steps": 3276, "total_steps": 3564, "loss": 0.3062414228916168, "lr": 2.3217995219895016e-07, "epoch": 2.757575757575758, "percentage": 91.92, "elapsed_time": "1:35:26", "remaining_time": "0:08:23"} | |
| {"current_steps": 3278, "total_steps": 3564, "loss": 0.021941782906651497, "lr": 2.317387236042678e-07, "epoch": 2.7592592592592595, "percentage": 91.98, "elapsed_time": "1:35:29", "remaining_time": "0:08:19"} | |
| {"current_steps": 3280, "total_steps": 3564, "loss": 1.040034532546997, "lr": 2.313004865350109e-07, "epoch": 2.760942760942761, "percentage": 92.03, "elapsed_time": "1:35:33", "remaining_time": "0:08:16"} | |
| {"current_steps": 3282, "total_steps": 3564, "loss": 1.0358326435089111, "lr": 2.3086524250109045e-07, "epoch": 2.7626262626262625, "percentage": 92.09, "elapsed_time": "1:35:37", "remaining_time": "0:08:12"} | |
| {"current_steps": 3284, "total_steps": 3564, "loss": 0.23045207560062408, "lr": 2.3043299300210528e-07, "epoch": 2.7643097643097643, "percentage": 92.14, "elapsed_time": "1:35:40", "remaining_time": "0:08:09"} | |
| {"current_steps": 3286, "total_steps": 3564, "loss": 0.7953276038169861, "lr": 2.30003739527337e-07, "epoch": 2.765993265993266, "percentage": 92.2, "elapsed_time": "1:35:44", "remaining_time": "0:08:05"} | |
| {"current_steps": 3288, "total_steps": 3564, "loss": 0.7808912396430969, "lr": 2.2957748355574408e-07, "epoch": 2.7676767676767677, "percentage": 92.26, "elapsed_time": "1:35:47", "remaining_time": "0:08:02"} | |
| {"current_steps": 3290, "total_steps": 3564, "loss": 0.2024976909160614, "lr": 2.2915422655595795e-07, "epoch": 2.7693602693602695, "percentage": 92.31, "elapsed_time": "1:35:51", "remaining_time": "0:07:58"} | |
| {"current_steps": 3292, "total_steps": 3564, "loss": 0.9757770299911499, "lr": 2.287339699862771e-07, "epoch": 2.771043771043771, "percentage": 92.37, "elapsed_time": "1:35:54", "remaining_time": "0:07:55"} | |
| {"current_steps": 3294, "total_steps": 3564, "loss": 0.8145531415939331, "lr": 2.2831671529466205e-07, "epoch": 2.7727272727272725, "percentage": 92.42, "elapsed_time": "1:35:58", "remaining_time": "0:07:51"} | |
| {"current_steps": 3296, "total_steps": 3564, "loss": 0.8364596366882324, "lr": 2.2790246391873086e-07, "epoch": 2.774410774410774, "percentage": 92.48, "elapsed_time": "1:36:01", "remaining_time": "0:07:48"} | |
| {"current_steps": 3298, "total_steps": 3564, "loss": 0.2111830711364746, "lr": 2.2749121728575393e-07, "epoch": 2.776094276094276, "percentage": 92.54, "elapsed_time": "1:36:05", "remaining_time": "0:07:44"} | |
| {"current_steps": 3300, "total_steps": 3564, "loss": 0.4531656801700592, "lr": 2.2708297681264874e-07, "epoch": 2.7777777777777777, "percentage": 92.59, "elapsed_time": "1:36:08", "remaining_time": "0:07:41"} | |
| {"current_steps": 3302, "total_steps": 3564, "loss": 0.486369788646698, "lr": 2.2667774390597562e-07, "epoch": 2.7794612794612794, "percentage": 92.65, "elapsed_time": "1:36:12", "remaining_time": "0:07:38"} | |
| {"current_steps": 3304, "total_steps": 3564, "loss": 0.4338839054107666, "lr": 2.2627551996193247e-07, "epoch": 2.781144781144781, "percentage": 92.7, "elapsed_time": "1:36:16", "remaining_time": "0:07:34"} | |
| {"current_steps": 3306, "total_steps": 3564, "loss": 0.7146729230880737, "lr": 2.2587630636634985e-07, "epoch": 2.782828282828283, "percentage": 92.76, "elapsed_time": "1:36:19", "remaining_time": "0:07:31"} | |
| {"current_steps": 3308, "total_steps": 3564, "loss": 0.426150381565094, "lr": 2.2548010449468676e-07, "epoch": 2.7845117845117846, "percentage": 92.82, "elapsed_time": "1:36:23", "remaining_time": "0:07:27"} | |
| {"current_steps": 3310, "total_steps": 3564, "loss": 0.6131501793861389, "lr": 2.2508691571202528e-07, "epoch": 2.7861952861952863, "percentage": 92.87, "elapsed_time": "1:36:27", "remaining_time": "0:07:24"} | |
| {"current_steps": 3312, "total_steps": 3564, "loss": 0.4474066197872162, "lr": 2.2469674137306627e-07, "epoch": 2.787878787878788, "percentage": 92.93, "elapsed_time": "1:36:30", "remaining_time": "0:07:20"} | |
| {"current_steps": 3314, "total_steps": 3564, "loss": 0.676105260848999, "lr": 2.2430958282212414e-07, "epoch": 2.78956228956229, "percentage": 92.99, "elapsed_time": "1:36:34", "remaining_time": "0:07:17"} | |
| {"current_steps": 3316, "total_steps": 3564, "loss": 0.9383071660995483, "lr": 2.239254413931236e-07, "epoch": 2.791245791245791, "percentage": 93.04, "elapsed_time": "1:36:38", "remaining_time": "0:07:13"} | |
| {"current_steps": 3318, "total_steps": 3564, "loss": 0.7455552220344543, "lr": 2.2354431840959307e-07, "epoch": 2.792929292929293, "percentage": 93.1, "elapsed_time": "1:36:41", "remaining_time": "0:07:10"} | |
| {"current_steps": 3320, "total_steps": 3564, "loss": 0.28741055727005005, "lr": 2.2316621518466167e-07, "epoch": 2.7946127946127945, "percentage": 93.15, "elapsed_time": "1:36:45", "remaining_time": "0:07:06"} | |
| {"current_steps": 3322, "total_steps": 3564, "loss": 0.6114668250083923, "lr": 2.227911330210542e-07, "epoch": 2.7962962962962963, "percentage": 93.21, "elapsed_time": "1:36:49", "remaining_time": "0:07:03"} | |
| {"current_steps": 3324, "total_steps": 3564, "loss": 0.6540449857711792, "lr": 2.2241907321108638e-07, "epoch": 2.797979797979798, "percentage": 93.27, "elapsed_time": "1:36:52", "remaining_time": "0:06:59"} | |
| {"current_steps": 3326, "total_steps": 3564, "loss": 0.30680525302886963, "lr": 2.22050037036661e-07, "epoch": 2.7996632996632997, "percentage": 93.32, "elapsed_time": "1:36:55", "remaining_time": "0:06:56"} | |
| {"current_steps": 3328, "total_steps": 3564, "loss": 0.7153966426849365, "lr": 2.216840257692628e-07, "epoch": 2.8013468013468015, "percentage": 93.38, "elapsed_time": "1:36:59", "remaining_time": "0:06:52"} | |
| {"current_steps": 3330, "total_steps": 3564, "loss": 0.7619553804397583, "lr": 2.213210406699547e-07, "epoch": 2.8030303030303028, "percentage": 93.43, "elapsed_time": "1:37:03", "remaining_time": "0:06:49"} | |
| {"current_steps": 3332, "total_steps": 3564, "loss": 0.5717604160308838, "lr": 2.209610829893729e-07, "epoch": 2.8047138047138045, "percentage": 93.49, "elapsed_time": "1:37:06", "remaining_time": "0:06:45"} | |
| {"current_steps": 3334, "total_steps": 3564, "loss": 0.5182145833969116, "lr": 2.2060415396772337e-07, "epoch": 2.8063973063973062, "percentage": 93.55, "elapsed_time": "1:37:09", "remaining_time": "0:06:42"} | |
| {"current_steps": 3336, "total_steps": 3564, "loss": 0.5500608682632446, "lr": 2.2025025483477654e-07, "epoch": 2.808080808080808, "percentage": 93.6, "elapsed_time": "1:37:13", "remaining_time": "0:06:38"} | |
| {"current_steps": 3338, "total_steps": 3564, "loss": 0.2802525758743286, "lr": 2.1989938680986382e-07, "epoch": 2.8097643097643097, "percentage": 93.66, "elapsed_time": "1:37:15", "remaining_time": "0:06:35"} | |
| {"current_steps": 3340, "total_steps": 3564, "loss": 0.6136119365692139, "lr": 2.1955155110187344e-07, "epoch": 2.8114478114478114, "percentage": 93.71, "elapsed_time": "1:37:18", "remaining_time": "0:06:31"} | |
| {"current_steps": 3342, "total_steps": 3564, "loss": 0.7545953989028931, "lr": 2.1920674890924545e-07, "epoch": 2.813131313131313, "percentage": 93.77, "elapsed_time": "1:37:22", "remaining_time": "0:06:28"} | |
| {"current_steps": 3344, "total_steps": 3564, "loss": 0.33089566230773926, "lr": 2.1886498141996858e-07, "epoch": 2.814814814814815, "percentage": 93.83, "elapsed_time": "1:37:25", "remaining_time": "0:06:24"} | |
| {"current_steps": 3346, "total_steps": 3564, "loss": 0.820242166519165, "lr": 2.185262498115759e-07, "epoch": 2.8164983164983166, "percentage": 93.88, "elapsed_time": "1:37:29", "remaining_time": "0:06:21"} | |
| {"current_steps": 3348, "total_steps": 3564, "loss": 0.4794435501098633, "lr": 2.1819055525113995e-07, "epoch": 2.8181818181818183, "percentage": 93.94, "elapsed_time": "1:37:32", "remaining_time": "0:06:17"} | |
| {"current_steps": 3350, "total_steps": 3564, "loss": 0.8766056299209595, "lr": 2.178578988952698e-07, "epoch": 2.81986531986532, "percentage": 94.0, "elapsed_time": "1:37:35", "remaining_time": "0:06:14"} | |
| {"current_steps": 3352, "total_steps": 3564, "loss": 0.8210408687591553, "lr": 2.1752828189010677e-07, "epoch": 2.821548821548822, "percentage": 94.05, "elapsed_time": "1:37:39", "remaining_time": "0:06:10"} | |
| {"current_steps": 3354, "total_steps": 3564, "loss": 0.7889919281005859, "lr": 2.1720170537132003e-07, "epoch": 2.823232323232323, "percentage": 94.11, "elapsed_time": "1:37:43", "remaining_time": "0:06:07"} | |
| {"current_steps": 3356, "total_steps": 3564, "loss": 0.7373786568641663, "lr": 2.16878170464103e-07, "epoch": 2.824915824915825, "percentage": 94.16, "elapsed_time": "1:37:46", "remaining_time": "0:06:03"} | |
| {"current_steps": 3358, "total_steps": 3564, "loss": 0.4632776975631714, "lr": 2.1655767828316967e-07, "epoch": 2.8265993265993266, "percentage": 94.22, "elapsed_time": "1:37:50", "remaining_time": "0:06:00"} | |
| {"current_steps": 3360, "total_steps": 3564, "loss": 0.47924166917800903, "lr": 2.1624022993275042e-07, "epoch": 2.8282828282828283, "percentage": 94.28, "elapsed_time": "1:37:54", "remaining_time": "0:05:56"} | |
| {"current_steps": 3362, "total_steps": 3564, "loss": 0.5661218166351318, "lr": 2.1592582650658838e-07, "epoch": 2.82996632996633, "percentage": 94.33, "elapsed_time": "1:37:57", "remaining_time": "0:05:53"} | |
| {"current_steps": 3364, "total_steps": 3564, "loss": 0.5744220018386841, "lr": 2.1561446908793575e-07, "epoch": 2.8316498316498318, "percentage": 94.39, "elapsed_time": "1:38:00", "remaining_time": "0:05:49"} | |
| {"current_steps": 3366, "total_steps": 3564, "loss": 0.4627985954284668, "lr": 2.1530615874954978e-07, "epoch": 2.8333333333333335, "percentage": 94.44, "elapsed_time": "1:38:03", "remaining_time": "0:05:46"} | |
| {"current_steps": 3368, "total_steps": 3564, "loss": 0.4576794505119324, "lr": 2.1500089655368913e-07, "epoch": 2.8350168350168348, "percentage": 94.5, "elapsed_time": "1:38:06", "remaining_time": "0:05:42"} | |
| {"current_steps": 3370, "total_steps": 3564, "loss": 0.8104113340377808, "lr": 2.146986835521108e-07, "epoch": 2.8367003367003365, "percentage": 94.56, "elapsed_time": "1:38:10", "remaining_time": "0:05:39"} | |
| {"current_steps": 3372, "total_steps": 3564, "loss": 0.6803615093231201, "lr": 2.143995207860655e-07, "epoch": 2.8383838383838382, "percentage": 94.61, "elapsed_time": "1:38:14", "remaining_time": "0:05:35"} | |
| {"current_steps": 3374, "total_steps": 3564, "loss": 0.2819385230541229, "lr": 2.1410340928629483e-07, "epoch": 2.84006734006734, "percentage": 94.67, "elapsed_time": "1:38:17", "remaining_time": "0:05:32"} | |
| {"current_steps": 3376, "total_steps": 3564, "loss": 0.8866885900497437, "lr": 2.138103500730278e-07, "epoch": 2.8417508417508417, "percentage": 94.73, "elapsed_time": "1:38:21", "remaining_time": "0:05:28"} | |
| {"current_steps": 3378, "total_steps": 3564, "loss": 0.7249988317489624, "lr": 2.1352034415597635e-07, "epoch": 2.8434343434343434, "percentage": 94.78, "elapsed_time": "1:38:25", "remaining_time": "0:05:25"} | |
| {"current_steps": 3380, "total_steps": 3564, "loss": 0.5438086986541748, "lr": 2.1323339253433309e-07, "epoch": 2.845117845117845, "percentage": 94.84, "elapsed_time": "1:38:28", "remaining_time": "0:05:21"} | |
| {"current_steps": 3382, "total_steps": 3564, "loss": 0.5575168132781982, "lr": 2.1294949619676717e-07, "epoch": 2.846801346801347, "percentage": 94.89, "elapsed_time": "1:38:32", "remaining_time": "0:05:18"} | |
| {"current_steps": 3384, "total_steps": 3564, "loss": 0.5616028308868408, "lr": 2.1266865612142064e-07, "epoch": 2.8484848484848486, "percentage": 94.95, "elapsed_time": "1:38:36", "remaining_time": "0:05:14"} | |
| {"current_steps": 3386, "total_steps": 3564, "loss": 0.7617322206497192, "lr": 2.1239087327590582e-07, "epoch": 2.8501683501683504, "percentage": 95.01, "elapsed_time": "1:38:39", "remaining_time": "0:05:11"} | |
| {"current_steps": 3388, "total_steps": 3564, "loss": 0.7200487852096558, "lr": 2.121161486173017e-07, "epoch": 2.851851851851852, "percentage": 95.06, "elapsed_time": "1:38:42", "remaining_time": "0:05:07"} | |
| {"current_steps": 3390, "total_steps": 3564, "loss": 0.4146542549133301, "lr": 2.1184448309215015e-07, "epoch": 2.8535353535353534, "percentage": 95.12, "elapsed_time": "1:38:46", "remaining_time": "0:05:04"} | |
| {"current_steps": 3392, "total_steps": 3564, "loss": 0.46166175603866577, "lr": 2.1157587763645322e-07, "epoch": 2.855218855218855, "percentage": 95.17, "elapsed_time": "1:38:50", "remaining_time": "0:05:00"} | |
| {"current_steps": 3394, "total_steps": 3564, "loss": 0.930475652217865, "lr": 2.113103331756698e-07, "epoch": 2.856902356902357, "percentage": 95.23, "elapsed_time": "1:38:53", "remaining_time": "0:04:57"} | |
| {"current_steps": 3396, "total_steps": 3564, "loss": 0.9054207801818848, "lr": 2.110478506247122e-07, "epoch": 2.8585858585858586, "percentage": 95.29, "elapsed_time": "1:38:56", "remaining_time": "0:04:53"} | |
| {"current_steps": 3398, "total_steps": 3564, "loss": 0.4588157534599304, "lr": 2.1078843088794325e-07, "epoch": 2.8602693602693603, "percentage": 95.34, "elapsed_time": "1:39:00", "remaining_time": "0:04:50"} | |
| {"current_steps": 3400, "total_steps": 3564, "loss": 0.3445073962211609, "lr": 2.105320748591732e-07, "epoch": 2.861952861952862, "percentage": 95.4, "elapsed_time": "1:39:03", "remaining_time": "0:04:46"} | |
| {"current_steps": 3402, "total_steps": 3564, "loss": 0.4542715847492218, "lr": 2.1027878342165624e-07, "epoch": 2.8636363636363638, "percentage": 95.45, "elapsed_time": "1:39:07", "remaining_time": "0:04:43"} | |
| {"current_steps": 3404, "total_steps": 3564, "loss": 0.38249820470809937, "lr": 2.1002855744808815e-07, "epoch": 2.865319865319865, "percentage": 95.51, "elapsed_time": "1:39:11", "remaining_time": "0:04:39"} | |
| {"current_steps": 3406, "total_steps": 3564, "loss": 0.7736653089523315, "lr": 2.0978139780060257e-07, "epoch": 2.8670033670033668, "percentage": 95.57, "elapsed_time": "1:39:14", "remaining_time": "0:04:36"} | |
| {"current_steps": 3408, "total_steps": 3564, "loss": 0.30026775598526, "lr": 2.0953730533076862e-07, "epoch": 2.8686868686868685, "percentage": 95.62, "elapsed_time": "1:39:17", "remaining_time": "0:04:32"} | |
| {"current_steps": 3410, "total_steps": 3564, "loss": 0.7915642261505127, "lr": 2.0929628087958734e-07, "epoch": 2.8703703703703702, "percentage": 95.68, "elapsed_time": "1:39:21", "remaining_time": "0:04:29"} | |
| {"current_steps": 3412, "total_steps": 3564, "loss": 0.4548564851284027, "lr": 2.0905832527748953e-07, "epoch": 2.872053872053872, "percentage": 95.74, "elapsed_time": "1:39:23", "remaining_time": "0:04:25"} | |
| {"current_steps": 3414, "total_steps": 3564, "loss": 0.6330816745758057, "lr": 2.0882343934433236e-07, "epoch": 2.8737373737373737, "percentage": 95.79, "elapsed_time": "1:39:27", "remaining_time": "0:04:22"} | |
| {"current_steps": 3416, "total_steps": 3564, "loss": 0.17160841822624207, "lr": 2.085916238893966e-07, "epoch": 2.8754208754208754, "percentage": 95.85, "elapsed_time": "1:39:30", "remaining_time": "0:04:18"} | |
| {"current_steps": 3418, "total_steps": 3564, "loss": 0.6133572459220886, "lr": 2.0836287971138418e-07, "epoch": 2.877104377104377, "percentage": 95.9, "elapsed_time": "1:39:34", "remaining_time": "0:04:15"} | |
| {"current_steps": 3420, "total_steps": 3564, "loss": 0.37677788734436035, "lr": 2.0813720759841492e-07, "epoch": 2.878787878787879, "percentage": 95.96, "elapsed_time": "1:39:37", "remaining_time": "0:04:11"} | |
| {"current_steps": 3422, "total_steps": 3564, "loss": 0.6834679841995239, "lr": 2.0791460832802423e-07, "epoch": 2.8804713804713806, "percentage": 96.02, "elapsed_time": "1:39:41", "remaining_time": "0:04:08"} | |
| {"current_steps": 3424, "total_steps": 3564, "loss": 0.5820834636688232, "lr": 2.0769508266716027e-07, "epoch": 2.8821548821548824, "percentage": 96.07, "elapsed_time": "1:39:44", "remaining_time": "0:04:04"} | |
| {"current_steps": 3426, "total_steps": 3564, "loss": 0.6087404489517212, "lr": 2.0747863137218126e-07, "epoch": 2.883838383838384, "percentage": 96.13, "elapsed_time": "1:39:48", "remaining_time": "0:04:01"} | |
| {"current_steps": 3428, "total_steps": 3564, "loss": 0.5436590909957886, "lr": 2.0726525518885308e-07, "epoch": 2.8855218855218854, "percentage": 96.18, "elapsed_time": "1:39:51", "remaining_time": "0:03:57"} | |
| {"current_steps": 3430, "total_steps": 3564, "loss": 0.28521019220352173, "lr": 2.0705495485234653e-07, "epoch": 2.887205387205387, "percentage": 96.24, "elapsed_time": "1:39:55", "remaining_time": "0:03:54"} | |
| {"current_steps": 3432, "total_steps": 3564, "loss": 0.5188443660736084, "lr": 2.0684773108723455e-07, "epoch": 2.888888888888889, "percentage": 96.3, "elapsed_time": "1:39:59", "remaining_time": "0:03:50"} | |
| {"current_steps": 3434, "total_steps": 3564, "loss": 0.2710973620414734, "lr": 2.0664358460749018e-07, "epoch": 2.8905723905723906, "percentage": 96.35, "elapsed_time": "1:40:02", "remaining_time": "0:03:47"} | |
| {"current_steps": 3436, "total_steps": 3564, "loss": 0.9403241872787476, "lr": 2.064425161164842e-07, "epoch": 2.8922558922558923, "percentage": 96.41, "elapsed_time": "1:40:05", "remaining_time": "0:03:43"} | |
| {"current_steps": 3438, "total_steps": 3564, "loss": 0.8685269355773926, "lr": 2.0624452630698195e-07, "epoch": 2.893939393939394, "percentage": 96.46, "elapsed_time": "1:40:08", "remaining_time": "0:03:40"} | |
| {"current_steps": 3440, "total_steps": 3564, "loss": 0.7080799341201782, "lr": 2.0604961586114163e-07, "epoch": 2.8956228956228958, "percentage": 96.52, "elapsed_time": "1:40:12", "remaining_time": "0:03:36"} | |
| {"current_steps": 3442, "total_steps": 3564, "loss": 0.9225847721099854, "lr": 2.0585778545051195e-07, "epoch": 2.897306397306397, "percentage": 96.58, "elapsed_time": "1:40:16", "remaining_time": "0:03:33"} | |
| {"current_steps": 3444, "total_steps": 3564, "loss": 0.26514777541160583, "lr": 2.0566903573602913e-07, "epoch": 2.898989898989899, "percentage": 96.63, "elapsed_time": "1:40:19", "remaining_time": "0:03:29"} | |
| {"current_steps": 3446, "total_steps": 3564, "loss": 0.5182454586029053, "lr": 2.0548336736801548e-07, "epoch": 2.9006734006734005, "percentage": 96.69, "elapsed_time": "1:40:23", "remaining_time": "0:03:26"} | |
| {"current_steps": 3448, "total_steps": 3564, "loss": 1.0010104179382324, "lr": 2.0530078098617668e-07, "epoch": 2.9023569023569022, "percentage": 96.75, "elapsed_time": "1:40:27", "remaining_time": "0:03:22"} | |
| {"current_steps": 3450, "total_steps": 3564, "loss": 0.23654749989509583, "lr": 2.0512127721959954e-07, "epoch": 2.904040404040404, "percentage": 96.8, "elapsed_time": "1:40:30", "remaining_time": "0:03:19"} | |
| {"current_steps": 3452, "total_steps": 3564, "loss": 0.6079249382019043, "lr": 2.0494485668675003e-07, "epoch": 2.9057239057239057, "percentage": 96.86, "elapsed_time": "1:40:34", "remaining_time": "0:03:15"} | |
| {"current_steps": 3454, "total_steps": 3564, "loss": 0.5366786122322083, "lr": 2.0477151999547137e-07, "epoch": 2.9074074074074074, "percentage": 96.91, "elapsed_time": "1:40:38", "remaining_time": "0:03:12"} | |
| {"current_steps": 3456, "total_steps": 3564, "loss": 0.9563678503036499, "lr": 2.0460126774298115e-07, "epoch": 2.909090909090909, "percentage": 96.97, "elapsed_time": "1:40:41", "remaining_time": "0:03:08"} | |
| {"current_steps": 3458, "total_steps": 3564, "loss": 0.7329115867614746, "lr": 2.044341005158701e-07, "epoch": 2.910774410774411, "percentage": 97.03, "elapsed_time": "1:40:45", "remaining_time": "0:03:05"} | |
| {"current_steps": 3460, "total_steps": 3564, "loss": 0.9082905054092407, "lr": 2.042700188900996e-07, "epoch": 2.9124579124579126, "percentage": 97.08, "elapsed_time": "1:40:49", "remaining_time": "0:03:01"} | |
| {"current_steps": 3462, "total_steps": 3564, "loss": 1.0648142099380493, "lr": 2.0410902343099998e-07, "epoch": 2.9141414141414144, "percentage": 97.14, "elapsed_time": "1:40:52", "remaining_time": "0:02:58"} | |
| {"current_steps": 3464, "total_steps": 3564, "loss": 0.6280519962310791, "lr": 2.039511146932683e-07, "epoch": 2.915824915824916, "percentage": 97.19, "elapsed_time": "1:40:55", "remaining_time": "0:02:54"} | |
| {"current_steps": 3466, "total_steps": 3564, "loss": 0.9411839246749878, "lr": 2.0379629322096658e-07, "epoch": 2.9175084175084174, "percentage": 97.25, "elapsed_time": "1:40:59", "remaining_time": "0:02:51"} | |
| {"current_steps": 3468, "total_steps": 3564, "loss": 0.5461298823356628, "lr": 2.036445595475199e-07, "epoch": 2.919191919191919, "percentage": 97.31, "elapsed_time": "1:41:02", "remaining_time": "0:02:47"} | |
| {"current_steps": 3470, "total_steps": 3564, "loss": 0.0855223536491394, "lr": 2.0349591419571473e-07, "epoch": 2.920875420875421, "percentage": 97.36, "elapsed_time": "1:41:05", "remaining_time": "0:02:44"} | |
| {"current_steps": 3472, "total_steps": 3564, "loss": 0.6720945835113525, "lr": 2.0335035767769674e-07, "epoch": 2.9225589225589226, "percentage": 97.42, "elapsed_time": "1:41:08", "remaining_time": "0:02:40"} | |
| {"current_steps": 3474, "total_steps": 3564, "loss": 0.6181377172470093, "lr": 2.032078904949694e-07, "epoch": 2.9242424242424243, "percentage": 97.47, "elapsed_time": "1:41:11", "remaining_time": "0:02:37"} | |
| {"current_steps": 3476, "total_steps": 3564, "loss": 0.25879359245300293, "lr": 2.0306851313839217e-07, "epoch": 2.925925925925926, "percentage": 97.53, "elapsed_time": "1:41:15", "remaining_time": "0:02:33"} | |
| {"current_steps": 3478, "total_steps": 3564, "loss": 0.7951024770736694, "lr": 2.0293222608817862e-07, "epoch": 2.9276094276094278, "percentage": 97.59, "elapsed_time": "1:41:19", "remaining_time": "0:02:30"} | |
| {"current_steps": 3480, "total_steps": 3564, "loss": 0.4090489447116852, "lr": 2.0279902981389491e-07, "epoch": 2.929292929292929, "percentage": 97.64, "elapsed_time": "1:41:22", "remaining_time": "0:02:26"} | |
| {"current_steps": 3482, "total_steps": 3564, "loss": 0.7058537602424622, "lr": 2.026689247744584e-07, "epoch": 2.930976430976431, "percentage": 97.7, "elapsed_time": "1:41:26", "remaining_time": "0:02:23"} | |
| {"current_steps": 3484, "total_steps": 3564, "loss": 0.4949754476547241, "lr": 2.0254191141813563e-07, "epoch": 2.9326599326599325, "percentage": 97.76, "elapsed_time": "1:41:29", "remaining_time": "0:02:19"} | |
| {"current_steps": 3486, "total_steps": 3564, "loss": 0.6103169322013855, "lr": 2.0241799018254102e-07, "epoch": 2.9343434343434343, "percentage": 97.81, "elapsed_time": "1:41:33", "remaining_time": "0:02:16"} | |
| {"current_steps": 3488, "total_steps": 3564, "loss": 0.5724541544914246, "lr": 2.0229716149463543e-07, "epoch": 2.936026936026936, "percentage": 97.87, "elapsed_time": "1:41:36", "remaining_time": "0:02:12"} | |
| {"current_steps": 3490, "total_steps": 3564, "loss": 0.5570365190505981, "lr": 2.0217942577072447e-07, "epoch": 2.9377104377104377, "percentage": 97.92, "elapsed_time": "1:41:39", "remaining_time": "0:02:09"} | |
| {"current_steps": 3492, "total_steps": 3564, "loss": 0.8093217611312866, "lr": 2.0206478341645734e-07, "epoch": 2.9393939393939394, "percentage": 97.98, "elapsed_time": "1:41:42", "remaining_time": "0:02:05"} | |
| {"current_steps": 3494, "total_steps": 3564, "loss": 0.40408650040626526, "lr": 2.0195323482682508e-07, "epoch": 2.941077441077441, "percentage": 98.04, "elapsed_time": "1:41:46", "remaining_time": "0:02:02"} | |
| {"current_steps": 3496, "total_steps": 3564, "loss": 0.6976212859153748, "lr": 2.0184478038615948e-07, "epoch": 2.942760942760943, "percentage": 98.09, "elapsed_time": "1:41:49", "remaining_time": "0:01:58"} | |
| {"current_steps": 3498, "total_steps": 3564, "loss": 0.30283308029174805, "lr": 2.0173942046813191e-07, "epoch": 2.9444444444444446, "percentage": 98.15, "elapsed_time": "1:41:53", "remaining_time": "0:01:55"} | |
| {"current_steps": 3500, "total_steps": 3564, "loss": 0.6129805445671082, "lr": 2.016371554357515e-07, "epoch": 2.9461279461279464, "percentage": 98.2, "elapsed_time": "1:41:57", "remaining_time": "0:01:51"} | |
| {"current_steps": 3502, "total_steps": 3564, "loss": 0.6700767278671265, "lr": 2.015379856413643e-07, "epoch": 2.9478114478114477, "percentage": 98.26, "elapsed_time": "1:42:00", "remaining_time": "0:01:48"} | |
| {"current_steps": 3504, "total_steps": 3564, "loss": 0.32376813888549805, "lr": 2.01441911426652e-07, "epoch": 2.9494949494949494, "percentage": 98.32, "elapsed_time": "1:42:04", "remaining_time": "0:01:44"} | |
| {"current_steps": 3506, "total_steps": 3564, "loss": 0.6684743762016296, "lr": 2.013489331226307e-07, "epoch": 2.951178451178451, "percentage": 98.37, "elapsed_time": "1:42:07", "remaining_time": "0:01:41"} | |
| {"current_steps": 3508, "total_steps": 3564, "loss": 0.846743106842041, "lr": 2.0125905104964978e-07, "epoch": 2.952861952861953, "percentage": 98.43, "elapsed_time": "1:42:10", "remaining_time": "0:01:37"} | |
| {"current_steps": 3510, "total_steps": 3564, "loss": 0.6087542772293091, "lr": 2.0117226551739068e-07, "epoch": 2.9545454545454546, "percentage": 98.48, "elapsed_time": "1:42:14", "remaining_time": "0:01:34"} | |
| {"current_steps": 3512, "total_steps": 3564, "loss": 0.8167439103126526, "lr": 2.0108857682486629e-07, "epoch": 2.9562289562289563, "percentage": 98.54, "elapsed_time": "1:42:18", "remaining_time": "0:01:30"} | |
| {"current_steps": 3514, "total_steps": 3564, "loss": 0.304475873708725, "lr": 2.0100798526041927e-07, "epoch": 2.957912457912458, "percentage": 98.6, "elapsed_time": "1:42:21", "remaining_time": "0:01:27"} | |
| {"current_steps": 3516, "total_steps": 3564, "loss": 0.8450760841369629, "lr": 2.009304911017215e-07, "epoch": 2.9595959595959593, "percentage": 98.65, "elapsed_time": "1:42:25", "remaining_time": "0:01:23"} | |
| {"current_steps": 3518, "total_steps": 3564, "loss": 0.8154351711273193, "lr": 2.0085609461577295e-07, "epoch": 2.961279461279461, "percentage": 98.71, "elapsed_time": "1:42:28", "remaining_time": "0:01:20"} | |
| {"current_steps": 3520, "total_steps": 3564, "loss": 0.35378673672676086, "lr": 2.0078479605890064e-07, "epoch": 2.962962962962963, "percentage": 98.77, "elapsed_time": "1:42:32", "remaining_time": "0:01:16"} | |
| {"current_steps": 3522, "total_steps": 3564, "loss": 0.6887914538383484, "lr": 2.007165956767584e-07, "epoch": 2.9646464646464645, "percentage": 98.82, "elapsed_time": "1:42:35", "remaining_time": "0:01:13"} | |
| {"current_steps": 3524, "total_steps": 3564, "loss": 0.22204965353012085, "lr": 2.00651493704325e-07, "epoch": 2.9663299663299663, "percentage": 98.88, "elapsed_time": "1:42:39", "remaining_time": "0:01:09"} | |
| {"current_steps": 3526, "total_steps": 3564, "loss": 0.8485254645347595, "lr": 2.0058949036590426e-07, "epoch": 2.968013468013468, "percentage": 98.93, "elapsed_time": "1:42:42", "remaining_time": "0:01:06"} | |
| {"current_steps": 3528, "total_steps": 3564, "loss": 0.7592622637748718, "lr": 2.0053058587512378e-07, "epoch": 2.9696969696969697, "percentage": 98.99, "elapsed_time": "1:42:46", "remaining_time": "0:01:02"} | |
| {"current_steps": 3530, "total_steps": 3564, "loss": 0.7468944191932678, "lr": 2.0047478043493418e-07, "epoch": 2.9713804713804715, "percentage": 99.05, "elapsed_time": "1:42:49", "remaining_time": "0:00:59"} | |
| {"current_steps": 3532, "total_steps": 3564, "loss": 0.6274712681770325, "lr": 2.004220742376088e-07, "epoch": 2.973063973063973, "percentage": 99.1, "elapsed_time": "1:42:53", "remaining_time": "0:00:55"} | |
| {"current_steps": 3534, "total_steps": 3564, "loss": 0.19880472123622894, "lr": 2.0037246746474277e-07, "epoch": 2.974747474747475, "percentage": 99.16, "elapsed_time": "1:42:56", "remaining_time": "0:00:52"} | |
| {"current_steps": 3536, "total_steps": 3564, "loss": 0.8517122268676758, "lr": 2.0032596028725204e-07, "epoch": 2.9764309764309766, "percentage": 99.21, "elapsed_time": "1:42:59", "remaining_time": "0:00:48"} | |
| {"current_steps": 3538, "total_steps": 3564, "loss": 0.4260925352573395, "lr": 2.0028255286537355e-07, "epoch": 2.9781144781144784, "percentage": 99.27, "elapsed_time": "1:43:03", "remaining_time": "0:00:45"} | |
| {"current_steps": 3540, "total_steps": 3564, "loss": 0.9670834541320801, "lr": 2.0024224534866408e-07, "epoch": 2.9797979797979797, "percentage": 99.33, "elapsed_time": "1:43:07", "remaining_time": "0:00:41"} | |
| {"current_steps": 3542, "total_steps": 3564, "loss": 0.8684190511703491, "lr": 2.0020503787599998e-07, "epoch": 2.9814814814814814, "percentage": 99.38, "elapsed_time": "1:43:10", "remaining_time": "0:00:38"} | |
| {"current_steps": 3544, "total_steps": 3564, "loss": 0.4294402599334717, "lr": 2.001709305755767e-07, "epoch": 2.983164983164983, "percentage": 99.44, "elapsed_time": "1:43:14", "remaining_time": "0:00:34"} | |
| {"current_steps": 3546, "total_steps": 3564, "loss": 0.8262860178947449, "lr": 2.0013992356490827e-07, "epoch": 2.984848484848485, "percentage": 99.49, "elapsed_time": "1:43:17", "remaining_time": "0:00:31"} | |
| {"current_steps": 3548, "total_steps": 3564, "loss": 0.39053958654403687, "lr": 2.0011201695082687e-07, "epoch": 2.9865319865319866, "percentage": 99.55, "elapsed_time": "1:43:21", "remaining_time": "0:00:27"} | |
| {"current_steps": 3550, "total_steps": 3564, "loss": 0.2766346037387848, "lr": 2.0008721082948243e-07, "epoch": 2.9882154882154883, "percentage": 99.61, "elapsed_time": "1:43:24", "remaining_time": "0:00:24"} | |
| {"current_steps": 3552, "total_steps": 3564, "loss": 0.5050246715545654, "lr": 2.0006550528634258e-07, "epoch": 2.98989898989899, "percentage": 99.66, "elapsed_time": "1:43:28", "remaining_time": "0:00:20"} | |
| {"current_steps": 3554, "total_steps": 3564, "loss": 0.8541325926780701, "lr": 2.00046900396192e-07, "epoch": 2.9915824915824913, "percentage": 99.72, "elapsed_time": "1:43:31", "remaining_time": "0:00:17"} | |
| {"current_steps": 3556, "total_steps": 3564, "loss": 0.7546226978302002, "lr": 2.0003139622313241e-07, "epoch": 2.993265993265993, "percentage": 99.78, "elapsed_time": "1:43:35", "remaining_time": "0:00:13"} | |
| {"current_steps": 3558, "total_steps": 3564, "loss": 0.6056807041168213, "lr": 2.0001899282058216e-07, "epoch": 2.994949494949495, "percentage": 99.83, "elapsed_time": "1:43:38", "remaining_time": "0:00:10"} | |
| {"current_steps": 3560, "total_steps": 3564, "loss": 0.3962956964969635, "lr": 2.000096902312762e-07, "epoch": 2.9966329966329965, "percentage": 99.89, "elapsed_time": "1:43:42", "remaining_time": "0:00:06"} | |
| {"current_steps": 3562, "total_steps": 3564, "loss": 0.5580795407295227, "lr": 2.0000348848726586e-07, "epoch": 2.9983164983164983, "percentage": 99.94, "elapsed_time": "1:43:46", "remaining_time": "0:00:03"} | |
| {"current_steps": 3564, "total_steps": 3564, "loss": 0.46740537881851196, "lr": 2.0000038760991877e-07, "epoch": 3.0, "percentage": 100.0, "elapsed_time": "1:43:49", "remaining_time": "0:00:00"} | |
| {"current_steps": 3564, "total_steps": 3564, "epoch": 3.0, "percentage": 100.0, "elapsed_time": "1:43:49", "remaining_time": "0:00:00"} | |