Text Generation
Transformers
Safetensors
qwen2
llama-factory
full
Generated from Trainer
conversational
text-generation-inference
Instructions to use salmannyu/step_cot with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use salmannyu/step_cot with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="salmannyu/step_cot") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForMultimodalLM tokenizer = AutoTokenizer.from_pretrained("salmannyu/step_cot") model = AutoModelForMultimodalLM.from_pretrained("salmannyu/step_cot") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use salmannyu/step_cot with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "salmannyu/step_cot" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "salmannyu/step_cot", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/salmannyu/step_cot
- SGLang
How to use salmannyu/step_cot with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "salmannyu/step_cot" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "salmannyu/step_cot", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "salmannyu/step_cot" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "salmannyu/step_cot", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use salmannyu/step_cot with Docker Model Runner:
docker model run hf.co/salmannyu/step_cot
| {"current_steps": 10, "total_steps": 5907, "loss": 0.755, "lr": 7.614213197969544e-08, "epoch": 0.005078720162519045, "percentage": 0.17, "elapsed_time": "0:00:33", "remaining_time": "5:25:45"} | |
| {"current_steps": 20, "total_steps": 5907, "loss": 0.7436, "lr": 1.607445008460237e-07, "epoch": 0.01015744032503809, "percentage": 0.34, "elapsed_time": "0:01:25", "remaining_time": "6:57:31"} | |
| {"current_steps": 30, "total_steps": 5907, "loss": 0.7286, "lr": 2.4534686971235194e-07, "epoch": 0.015236160487557136, "percentage": 0.51, "elapsed_time": "0:02:08", "remaining_time": "7:00:50"} | |
| {"current_steps": 40, "total_steps": 5907, "loss": 0.6637, "lr": 3.2994923857868026e-07, "epoch": 0.02031488065007618, "percentage": 0.68, "elapsed_time": "0:02:40", "remaining_time": "6:31:17"} | |
| {"current_steps": 50, "total_steps": 5907, "loss": 0.5849, "lr": 4.1455160744500853e-07, "epoch": 0.025393600812595226, "percentage": 0.85, "elapsed_time": "0:03:26", "remaining_time": "6:42:41"} | |
| {"current_steps": 60, "total_steps": 5907, "loss": 0.5074, "lr": 4.991539763113367e-07, "epoch": 0.03047232097511427, "percentage": 1.02, "elapsed_time": "0:04:12", "remaining_time": "6:50:09"} | |
| {"current_steps": 70, "total_steps": 5907, "loss": 0.4381, "lr": 5.83756345177665e-07, "epoch": 0.03555104113763331, "percentage": 1.19, "elapsed_time": "0:04:46", "remaining_time": "6:38:10"} | |
| {"current_steps": 80, "total_steps": 5907, "loss": 0.3912, "lr": 6.683587140439933e-07, "epoch": 0.04062976130015236, "percentage": 1.35, "elapsed_time": "0:05:20", "remaining_time": "6:29:39"} | |
| {"current_steps": 90, "total_steps": 5907, "loss": 0.3717, "lr": 7.529610829103214e-07, "epoch": 0.0457084814626714, "percentage": 1.52, "elapsed_time": "0:05:53", "remaining_time": "6:21:04"} | |
| {"current_steps": 100, "total_steps": 5907, "loss": 0.3763, "lr": 8.375634517766498e-07, "epoch": 0.05078720162519045, "percentage": 1.69, "elapsed_time": "0:06:53", "remaining_time": "6:40:14"} | |
| {"current_steps": 110, "total_steps": 5907, "loss": 0.3648, "lr": 9.22165820642978e-07, "epoch": 0.055865921787709494, "percentage": 1.86, "elapsed_time": "0:07:47", "remaining_time": "6:50:57"} | |
| {"current_steps": 120, "total_steps": 5907, "loss": 0.3593, "lr": 1.0067681895093063e-06, "epoch": 0.06094464195022854, "percentage": 2.03, "elapsed_time": "0:08:18", "remaining_time": "6:40:36"} | |
| {"current_steps": 130, "total_steps": 5907, "loss": 0.3619, "lr": 1.0913705583756345e-06, "epoch": 0.06602336211274759, "percentage": 2.2, "elapsed_time": "0:08:48", "remaining_time": "6:31:44"} | |
| {"current_steps": 140, "total_steps": 5907, "loss": 0.3492, "lr": 1.1759729272419628e-06, "epoch": 0.07110208227526663, "percentage": 2.37, "elapsed_time": "0:09:36", "remaining_time": "6:35:44"} | |
| {"current_steps": 150, "total_steps": 5907, "loss": 0.359, "lr": 1.2605752961082913e-06, "epoch": 0.07618080243778567, "percentage": 2.54, "elapsed_time": "0:10:17", "remaining_time": "6:34:50"} | |
| {"current_steps": 160, "total_steps": 5907, "loss": 0.3396, "lr": 1.3451776649746193e-06, "epoch": 0.08125952260030472, "percentage": 2.71, "elapsed_time": "0:10:50", "remaining_time": "6:29:40"} | |
| {"current_steps": 170, "total_steps": 5907, "loss": 0.3412, "lr": 1.4297800338409476e-06, "epoch": 0.08633824276282377, "percentage": 2.88, "elapsed_time": "0:11:43", "remaining_time": "6:35:56"} | |
| {"current_steps": 180, "total_steps": 5907, "loss": 0.3436, "lr": 1.5143824027072759e-06, "epoch": 0.0914169629253428, "percentage": 3.05, "elapsed_time": "0:12:15", "remaining_time": "6:29:46"} | |
| {"current_steps": 190, "total_steps": 5907, "loss": 0.3399, "lr": 1.5989847715736043e-06, "epoch": 0.09649568308786186, "percentage": 3.22, "elapsed_time": "0:12:49", "remaining_time": "6:25:39"} | |
| {"current_steps": 200, "total_steps": 5907, "loss": 0.3465, "lr": 1.6835871404399324e-06, "epoch": 0.1015744032503809, "percentage": 3.39, "elapsed_time": "0:13:23", "remaining_time": "6:22:20"} | |
| {"current_steps": 210, "total_steps": 5907, "loss": 0.3422, "lr": 1.7681895093062607e-06, "epoch": 0.10665312341289995, "percentage": 3.56, "elapsed_time": "0:14:01", "remaining_time": "6:20:32"} | |
| {"current_steps": 220, "total_steps": 5907, "loss": 0.3336, "lr": 1.852791878172589e-06, "epoch": 0.11173184357541899, "percentage": 3.72, "elapsed_time": "0:14:52", "remaining_time": "6:24:30"} | |
| {"current_steps": 230, "total_steps": 5907, "loss": 0.3293, "lr": 1.937394247038917e-06, "epoch": 0.11681056373793804, "percentage": 3.89, "elapsed_time": "0:15:43", "remaining_time": "6:27:59"} | |
| {"current_steps": 240, "total_steps": 5907, "loss": 0.3376, "lr": 2.0219966159052453e-06, "epoch": 0.12188928390045708, "percentage": 4.06, "elapsed_time": "0:16:13", "remaining_time": "6:23:15"} | |
| {"current_steps": 250, "total_steps": 5907, "loss": 0.342, "lr": 2.1065989847715737e-06, "epoch": 0.12696800406297612, "percentage": 4.23, "elapsed_time": "0:16:58", "remaining_time": "6:24:02"} | |
| {"current_steps": 260, "total_steps": 5907, "loss": 0.3322, "lr": 2.1912013536379022e-06, "epoch": 0.13204672422549518, "percentage": 4.4, "elapsed_time": "0:17:28", "remaining_time": "6:19:23"} | |
| {"current_steps": 270, "total_steps": 5907, "loss": 0.3311, "lr": 2.2758037225042303e-06, "epoch": 0.13712544438801422, "percentage": 4.57, "elapsed_time": "0:18:00", "remaining_time": "6:15:57"} | |
| {"current_steps": 280, "total_steps": 5907, "loss": 0.3426, "lr": 2.3604060913705588e-06, "epoch": 0.14220416455053325, "percentage": 4.74, "elapsed_time": "0:18:33", "remaining_time": "6:13:03"} | |
| {"current_steps": 290, "total_steps": 5907, "loss": 0.321, "lr": 2.445008460236887e-06, "epoch": 0.14728288471305231, "percentage": 4.91, "elapsed_time": "0:19:17", "remaining_time": "6:13:40"} | |
| {"current_steps": 300, "total_steps": 5907, "loss": 0.328, "lr": 2.5296108291032153e-06, "epoch": 0.15236160487557135, "percentage": 5.08, "elapsed_time": "0:19:59", "remaining_time": "6:13:47"} | |
| {"current_steps": 310, "total_steps": 5907, "loss": 0.3299, "lr": 2.6142131979695434e-06, "epoch": 0.1574403250380904, "percentage": 5.25, "elapsed_time": "0:20:31", "remaining_time": "6:10:38"} | |
| {"current_steps": 320, "total_steps": 5907, "loss": 0.3192, "lr": 2.698815566835872e-06, "epoch": 0.16251904520060945, "percentage": 5.42, "elapsed_time": "0:21:08", "remaining_time": "6:09:00"} | |
| {"current_steps": 330, "total_steps": 5907, "loss": 0.3211, "lr": 2.7834179357022e-06, "epoch": 0.16759776536312848, "percentage": 5.59, "elapsed_time": "0:21:42", "remaining_time": "6:06:46"} | |
| {"current_steps": 340, "total_steps": 5907, "loss": 0.3299, "lr": 2.8680203045685284e-06, "epoch": 0.17267648552564754, "percentage": 5.76, "elapsed_time": "0:22:39", "remaining_time": "6:11:05"} | |
| {"current_steps": 350, "total_steps": 5907, "loss": 0.3286, "lr": 2.952622673434856e-06, "epoch": 0.17775520568816658, "percentage": 5.93, "elapsed_time": "0:24:05", "remaining_time": "6:22:23"} | |
| {"current_steps": 360, "total_steps": 5907, "loss": 0.3174, "lr": 3.0372250423011845e-06, "epoch": 0.1828339258506856, "percentage": 6.09, "elapsed_time": "0:24:37", "remaining_time": "6:19:29"} | |
| {"current_steps": 370, "total_steps": 5907, "loss": 0.327, "lr": 3.121827411167513e-06, "epoch": 0.18791264601320468, "percentage": 6.26, "elapsed_time": "0:25:15", "remaining_time": "6:17:56"} | |
| {"current_steps": 380, "total_steps": 5907, "loss": 0.3266, "lr": 3.206429780033841e-06, "epoch": 0.1929913661757237, "percentage": 6.43, "elapsed_time": "0:26:13", "remaining_time": "6:21:21"} | |
| {"current_steps": 390, "total_steps": 5907, "loss": 0.3288, "lr": 3.2910321489001695e-06, "epoch": 0.19807008633824277, "percentage": 6.6, "elapsed_time": "0:26:43", "remaining_time": "6:17:58"} | |
| {"current_steps": 400, "total_steps": 5907, "loss": 0.3318, "lr": 3.375634517766498e-06, "epoch": 0.2031488065007618, "percentage": 6.77, "elapsed_time": "0:27:14", "remaining_time": "6:14:56"} | |
| {"current_steps": 410, "total_steps": 5907, "loss": 0.3153, "lr": 3.460236886632826e-06, "epoch": 0.20822752666328084, "percentage": 6.94, "elapsed_time": "0:28:03", "remaining_time": "6:16:07"} | |
| {"current_steps": 420, "total_steps": 5907, "loss": 0.3249, "lr": 3.5448392554991545e-06, "epoch": 0.2133062468257999, "percentage": 7.11, "elapsed_time": "0:28:48", "remaining_time": "6:16:17"} | |
| {"current_steps": 430, "total_steps": 5907, "loss": 0.3218, "lr": 3.629441624365482e-06, "epoch": 0.21838496698831894, "percentage": 7.28, "elapsed_time": "0:29:36", "remaining_time": "6:17:05"} | |
| {"current_steps": 440, "total_steps": 5907, "loss": 0.3338, "lr": 3.7140439932318106e-06, "epoch": 0.22346368715083798, "percentage": 7.45, "elapsed_time": "0:30:21", "remaining_time": "6:17:13"} | |
| {"current_steps": 450, "total_steps": 5907, "loss": 0.3251, "lr": 3.798646362098139e-06, "epoch": 0.22854240731335704, "percentage": 7.62, "elapsed_time": "0:31:08", "remaining_time": "6:17:44"} | |
| {"current_steps": 460, "total_steps": 5907, "loss": 0.3251, "lr": 3.883248730964467e-06, "epoch": 0.23362112747587607, "percentage": 7.79, "elapsed_time": "0:31:45", "remaining_time": "6:16:07"} | |
| {"current_steps": 470, "total_steps": 5907, "loss": 0.3234, "lr": 3.967851099830796e-06, "epoch": 0.23869984763839514, "percentage": 7.96, "elapsed_time": "0:32:25", "remaining_time": "6:15:00"} | |
| {"current_steps": 480, "total_steps": 5907, "loss": 0.3255, "lr": 4.052453468697124e-06, "epoch": 0.24377856780091417, "percentage": 8.13, "elapsed_time": "0:33:14", "remaining_time": "6:15:53"} | |
| {"current_steps": 490, "total_steps": 5907, "loss": 0.3214, "lr": 4.137055837563453e-06, "epoch": 0.2488572879634332, "percentage": 8.3, "elapsed_time": "0:34:00", "remaining_time": "6:16:01"} | |
| {"current_steps": 500, "total_steps": 5907, "loss": 0.3221, "lr": 4.22165820642978e-06, "epoch": 0.25393600812595224, "percentage": 8.46, "elapsed_time": "0:35:02", "remaining_time": "6:18:57"} | |
| {"current_steps": 500, "total_steps": 5907, "eval_loss": 0.3276064395904541, "epoch": 0.25393600812595224, "percentage": 8.46, "elapsed_time": "0:37:04", "remaining_time": "6:40:53"} | |
| {"current_steps": 510, "total_steps": 5907, "loss": 0.3102, "lr": 4.306260575296109e-06, "epoch": 0.25901472828847133, "percentage": 8.63, "elapsed_time": "0:40:21", "remaining_time": "7:07:03"} | |
| {"current_steps": 520, "total_steps": 5907, "loss": 0.3044, "lr": 4.390862944162436e-06, "epoch": 0.26409344845099036, "percentage": 8.8, "elapsed_time": "0:41:09", "remaining_time": "7:06:21"} | |
| {"current_steps": 530, "total_steps": 5907, "loss": 0.3178, "lr": 4.475465313028765e-06, "epoch": 0.2691721686135094, "percentage": 8.97, "elapsed_time": "0:41:41", "remaining_time": "7:02:59"} | |
| {"current_steps": 540, "total_steps": 5907, "loss": 0.3224, "lr": 4.560067681895093e-06, "epoch": 0.27425088877602843, "percentage": 9.14, "elapsed_time": "0:42:11", "remaining_time": "6:59:16"} | |
| {"current_steps": 550, "total_steps": 5907, "loss": 0.3232, "lr": 4.644670050761422e-06, "epoch": 0.27932960893854747, "percentage": 9.31, "elapsed_time": "0:43:08", "remaining_time": "7:00:12"} | |
| {"current_steps": 560, "total_steps": 5907, "loss": 0.3224, "lr": 4.72927241962775e-06, "epoch": 0.2844083291010665, "percentage": 9.48, "elapsed_time": "0:43:55", "remaining_time": "6:59:22"} | |
| {"current_steps": 570, "total_steps": 5907, "loss": 0.3204, "lr": 4.813874788494079e-06, "epoch": 0.2894870492635856, "percentage": 9.65, "elapsed_time": "0:44:29", "remaining_time": "6:56:35"} | |
| {"current_steps": 580, "total_steps": 5907, "loss": 0.3249, "lr": 4.898477157360406e-06, "epoch": 0.29456576942610463, "percentage": 9.82, "elapsed_time": "0:44:59", "remaining_time": "6:53:15"} | |
| {"current_steps": 590, "total_steps": 5907, "loss": 0.3185, "lr": 4.983079526226735e-06, "epoch": 0.29964448958862366, "percentage": 9.99, "elapsed_time": "0:45:31", "remaining_time": "6:50:17"} | |
| {"current_steps": 600, "total_steps": 5907, "loss": 0.3197, "lr": 4.999972060477541e-06, "epoch": 0.3047232097511427, "percentage": 10.16, "elapsed_time": "0:46:20", "remaining_time": "6:49:51"} | |
| {"current_steps": 610, "total_steps": 5907, "loss": 0.316, "lr": 4.999858557237848e-06, "epoch": 0.30980192991366173, "percentage": 10.33, "elapsed_time": "0:46:56", "remaining_time": "6:47:36"} | |
| {"current_steps": 620, "total_steps": 5907, "loss": 0.3256, "lr": 4.999657748021748e-06, "epoch": 0.3148806500761808, "percentage": 10.5, "elapsed_time": "0:47:41", "remaining_time": "6:46:37"} | |
| {"current_steps": 630, "total_steps": 5907, "loss": 0.3253, "lr": 4.999369639842375e-06, "epoch": 0.31995937023869986, "percentage": 10.67, "elapsed_time": "0:48:10", "remaining_time": "6:43:35"} | |
| {"current_steps": 640, "total_steps": 5907, "loss": 0.3142, "lr": 4.998994242761724e-06, "epoch": 0.3250380904012189, "percentage": 10.83, "elapsed_time": "0:48:56", "remaining_time": "6:42:47"} | |
| {"current_steps": 650, "total_steps": 5907, "loss": 0.3117, "lr": 4.998531569890301e-06, "epoch": 0.33011681056373793, "percentage": 11.0, "elapsed_time": "0:49:42", "remaining_time": "6:41:59"} | |
| {"current_steps": 660, "total_steps": 5907, "loss": 0.3089, "lr": 4.997981637386663e-06, "epoch": 0.33519553072625696, "percentage": 11.17, "elapsed_time": "0:50:27", "remaining_time": "6:41:05"} | |
| {"current_steps": 670, "total_steps": 5907, "loss": 0.3121, "lr": 4.997344464456854e-06, "epoch": 0.34027425088877605, "percentage": 11.34, "elapsed_time": "0:51:02", "remaining_time": "6:38:59"} | |
| {"current_steps": 680, "total_steps": 5907, "loss": 0.3268, "lr": 4.9966200733537345e-06, "epoch": 0.3453529710512951, "percentage": 11.51, "elapsed_time": "0:51:33", "remaining_time": "6:36:15"} | |
| {"current_steps": 690, "total_steps": 5907, "loss": 0.3214, "lr": 4.995808489376206e-06, "epoch": 0.3504316912138141, "percentage": 11.68, "elapsed_time": "0:52:23", "remaining_time": "6:36:04"} | |
| {"current_steps": 700, "total_steps": 5907, "loss": 0.3148, "lr": 4.9949097408683235e-06, "epoch": 0.35551041137633316, "percentage": 11.85, "elapsed_time": "0:53:27", "remaining_time": "6:37:40"} | |
| {"current_steps": 710, "total_steps": 5907, "loss": 0.3108, "lr": 4.99392385921831e-06, "epoch": 0.3605891315388522, "percentage": 12.02, "elapsed_time": "0:54:00", "remaining_time": "6:35:22"} | |
| {"current_steps": 720, "total_steps": 5907, "loss": 0.3189, "lr": 4.992850878857458e-06, "epoch": 0.3656678517013712, "percentage": 12.19, "elapsed_time": "0:54:32", "remaining_time": "6:32:52"} | |
| {"current_steps": 730, "total_steps": 5907, "loss": 0.3157, "lr": 4.991690837258926e-06, "epoch": 0.3707465718638903, "percentage": 12.36, "elapsed_time": "0:55:34", "remaining_time": "6:34:04"} | |
| {"current_steps": 740, "total_steps": 5907, "loss": 0.3295, "lr": 4.990443774936432e-06, "epoch": 0.37582529202640935, "percentage": 12.53, "elapsed_time": "0:56:20", "remaining_time": "6:33:21"} | |
| {"current_steps": 750, "total_steps": 5907, "loss": 0.3223, "lr": 4.989109735442838e-06, "epoch": 0.3809040121889284, "percentage": 12.7, "elapsed_time": "0:56:57", "remaining_time": "6:31:41"} | |
| {"current_steps": 760, "total_steps": 5907, "loss": 0.3068, "lr": 4.987688765368628e-06, "epoch": 0.3859827323514474, "percentage": 12.87, "elapsed_time": "0:57:32", "remaining_time": "6:29:40"} | |
| {"current_steps": 770, "total_steps": 5907, "loss": 0.3145, "lr": 4.986180914340281e-06, "epoch": 0.39106145251396646, "percentage": 13.04, "elapsed_time": "0:58:01", "remaining_time": "6:27:09"} | |
| {"current_steps": 780, "total_steps": 5907, "loss": 0.3155, "lr": 4.9845862350185405e-06, "epoch": 0.39614017267648555, "percentage": 13.2, "elapsed_time": "0:58:33", "remaining_time": "6:24:55"} | |
| {"current_steps": 790, "total_steps": 5907, "loss": 0.3274, "lr": 4.98290478309657e-06, "epoch": 0.4012188928390046, "percentage": 13.37, "elapsed_time": "0:59:07", "remaining_time": "6:22:55"} | |
| {"current_steps": 800, "total_steps": 5907, "loss": 0.314, "lr": 4.981136617298012e-06, "epoch": 0.4062976130015236, "percentage": 13.54, "elapsed_time": "0:59:40", "remaining_time": "6:20:57"} | |
| {"current_steps": 810, "total_steps": 5907, "loss": 0.3182, "lr": 4.97928179937494e-06, "epoch": 0.41137633316404265, "percentage": 13.71, "elapsed_time": "1:00:13", "remaining_time": "6:18:55"} | |
| {"current_steps": 820, "total_steps": 5907, "loss": 0.3152, "lr": 4.977340394105692e-06, "epoch": 0.4164550533265617, "percentage": 13.88, "elapsed_time": "1:00:56", "remaining_time": "6:18:01"} | |
| {"current_steps": 830, "total_steps": 5907, "loss": 0.3084, "lr": 4.975312469292618e-06, "epoch": 0.4215337734890808, "percentage": 14.05, "elapsed_time": "1:01:38", "remaining_time": "6:17:03"} | |
| {"current_steps": 840, "total_steps": 5907, "loss": 0.3054, "lr": 4.973198095759708e-06, "epoch": 0.4266124936515998, "percentage": 14.22, "elapsed_time": "1:02:43", "remaining_time": "6:18:21"} | |
| {"current_steps": 850, "total_steps": 5907, "loss": 0.3067, "lr": 4.970997347350117e-06, "epoch": 0.43169121381411885, "percentage": 14.39, "elapsed_time": "1:03:31", "remaining_time": "6:17:54"} | |
| {"current_steps": 860, "total_steps": 5907, "loss": 0.3112, "lr": 4.96871030092359e-06, "epoch": 0.4367699339766379, "percentage": 14.56, "elapsed_time": "1:04:02", "remaining_time": "6:15:50"} | |
| {"current_steps": 870, "total_steps": 5907, "loss": 0.3075, "lr": 4.966337036353775e-06, "epoch": 0.4418486541391569, "percentage": 14.73, "elapsed_time": "1:04:49", "remaining_time": "6:15:20"} | |
| {"current_steps": 880, "total_steps": 5907, "loss": 0.3166, "lr": 4.963877636525431e-06, "epoch": 0.44692737430167595, "percentage": 14.9, "elapsed_time": "1:05:25", "remaining_time": "6:13:44"} | |
| {"current_steps": 890, "total_steps": 5907, "loss": 0.3221, "lr": 4.961332187331541e-06, "epoch": 0.45200609446419504, "percentage": 15.07, "elapsed_time": "1:05:58", "remaining_time": "6:11:52"} | |
| {"current_steps": 900, "total_steps": 5907, "loss": 0.3168, "lr": 4.958700777670306e-06, "epoch": 0.4570848146267141, "percentage": 15.24, "elapsed_time": "1:06:28", "remaining_time": "6:09:51"} | |
| {"current_steps": 910, "total_steps": 5907, "loss": 0.3168, "lr": 4.955983499442039e-06, "epoch": 0.4621635347892331, "percentage": 15.41, "elapsed_time": "1:07:15", "remaining_time": "6:09:17"} | |
| {"current_steps": 920, "total_steps": 5907, "loss": 0.3175, "lr": 4.953180447545965e-06, "epoch": 0.46724225495175215, "percentage": 15.57, "elapsed_time": "1:07:50", "remaining_time": "6:07:45"} | |
| {"current_steps": 930, "total_steps": 5907, "loss": 0.3217, "lr": 4.950291719876891e-06, "epoch": 0.4723209751142712, "percentage": 15.74, "elapsed_time": "1:08:24", "remaining_time": "6:06:08"} | |
| {"current_steps": 940, "total_steps": 5907, "loss": 0.2997, "lr": 4.947317417321803e-06, "epoch": 0.47739969527679027, "percentage": 15.91, "elapsed_time": "1:09:03", "remaining_time": "6:04:54"} | |
| {"current_steps": 950, "total_steps": 5907, "loss": 0.3148, "lr": 4.944257643756333e-06, "epoch": 0.4824784154393093, "percentage": 16.08, "elapsed_time": "1:10:11", "remaining_time": "6:06:12"} | |
| {"current_steps": 960, "total_steps": 5907, "loss": 0.3132, "lr": 4.941112506041135e-06, "epoch": 0.48755713560182834, "percentage": 16.25, "elapsed_time": "1:10:43", "remaining_time": "6:04:26"} | |
| {"current_steps": 970, "total_steps": 5907, "loss": 0.3141, "lr": 4.93788211401815e-06, "epoch": 0.4926358557643474, "percentage": 16.42, "elapsed_time": "1:11:39", "remaining_time": "6:04:45"} | |
| {"current_steps": 980, "total_steps": 5907, "loss": 0.3175, "lr": 4.9345665805067735e-06, "epoch": 0.4977145759268664, "percentage": 16.59, "elapsed_time": "1:12:26", "remaining_time": "6:04:11"} | |
| {"current_steps": 990, "total_steps": 5907, "loss": 0.3055, "lr": 4.931166021299914e-06, "epoch": 0.5027932960893855, "percentage": 16.76, "elapsed_time": "1:13:00", "remaining_time": "6:02:38"} | |
| {"current_steps": 1000, "total_steps": 5907, "loss": 0.3095, "lr": 4.927680555159946e-06, "epoch": 0.5078720162519045, "percentage": 16.93, "elapsed_time": "1:13:54", "remaining_time": "6:02:40"} | |
| {"current_steps": 1000, "total_steps": 5907, "eval_loss": 0.3185386657714844, "epoch": 0.5078720162519045, "percentage": 16.93, "elapsed_time": "1:15:56", "remaining_time": "6:12:38"} | |
| {"current_steps": 1010, "total_steps": 5907, "loss": 0.3165, "lr": 4.924110303814567e-06, "epoch": 0.5129507364144236, "percentage": 17.1, "elapsed_time": "1:20:12", "remaining_time": "6:28:51"} | |
| {"current_steps": 1020, "total_steps": 5907, "loss": 0.303, "lr": 4.920455391952543e-06, "epoch": 0.5180294565769427, "percentage": 17.27, "elapsed_time": "1:20:57", "remaining_time": "6:27:51"} | |
| {"current_steps": 1030, "total_steps": 5907, "loss": 0.3015, "lr": 4.916715947219356e-06, "epoch": 0.5231081767394616, "percentage": 17.44, "elapsed_time": "1:21:57", "remaining_time": "6:28:02"} | |
| {"current_steps": 1040, "total_steps": 5907, "loss": 0.3054, "lr": 4.912892100212744e-06, "epoch": 0.5281868969019807, "percentage": 17.61, "elapsed_time": "1:22:33", "remaining_time": "6:26:21"} | |
| {"current_steps": 1050, "total_steps": 5907, "loss": 0.2985, "lr": 4.908983984478141e-06, "epoch": 0.5332656170644997, "percentage": 17.78, "elapsed_time": "1:23:22", "remaining_time": "6:25:42"} | |
| {"current_steps": 1060, "total_steps": 5907, "loss": 0.3138, "lr": 4.9049917365040135e-06, "epoch": 0.5383443372270188, "percentage": 17.94, "elapsed_time": "1:23:56", "remaining_time": "6:23:49"} | |
| {"current_steps": 1070, "total_steps": 5907, "loss": 0.3073, "lr": 4.900915495717092e-06, "epoch": 0.5434230573895379, "percentage": 18.11, "elapsed_time": "1:24:29", "remaining_time": "6:21:57"} | |
| {"current_steps": 1080, "total_steps": 5907, "loss": 0.3047, "lr": 4.896755404477505e-06, "epoch": 0.5485017775520569, "percentage": 18.28, "elapsed_time": "1:24:58", "remaining_time": "6:19:47"} | |
| {"current_steps": 1090, "total_steps": 5907, "loss": 0.3178, "lr": 4.892511608073804e-06, "epoch": 0.553580497714576, "percentage": 18.45, "elapsed_time": "1:25:45", "remaining_time": "6:18:57"} | |
| {"current_steps": 1100, "total_steps": 5907, "loss": 0.314, "lr": 4.888184254717886e-06, "epoch": 0.5586592178770949, "percentage": 18.62, "elapsed_time": "1:26:24", "remaining_time": "6:17:35"} | |
| {"current_steps": 1110, "total_steps": 5907, "loss": 0.3187, "lr": 4.88377349553983e-06, "epoch": 0.563737938039614, "percentage": 18.79, "elapsed_time": "1:26:55", "remaining_time": "6:15:39"} | |
| {"current_steps": 1120, "total_steps": 5907, "loss": 0.3152, "lr": 4.879279484582603e-06, "epoch": 0.568816658202133, "percentage": 18.96, "elapsed_time": "1:27:34", "remaining_time": "6:14:16"} | |
| {"current_steps": 1130, "total_steps": 5907, "loss": 0.3064, "lr": 4.874702378796694e-06, "epoch": 0.5738953783646521, "percentage": 19.13, "elapsed_time": "1:28:04", "remaining_time": "6:12:19"} | |
| {"current_steps": 1140, "total_steps": 5907, "loss": 0.3083, "lr": 4.870042338034618e-06, "epoch": 0.5789740985271712, "percentage": 19.3, "elapsed_time": "1:28:36", "remaining_time": "6:10:32"} | |
| {"current_steps": 1150, "total_steps": 5907, "loss": 0.3011, "lr": 4.8652995250453515e-06, "epoch": 0.5840528186896902, "percentage": 19.47, "elapsed_time": "1:29:09", "remaining_time": "6:08:48"} | |
| {"current_steps": 1160, "total_steps": 5907, "loss": 0.3105, "lr": 4.86047410546863e-06, "epoch": 0.5891315388522093, "percentage": 19.64, "elapsed_time": "1:29:41", "remaining_time": "6:07:00"} | |
| {"current_steps": 1170, "total_steps": 5907, "loss": 0.3118, "lr": 4.855566247829177e-06, "epoch": 0.5942102590147282, "percentage": 19.81, "elapsed_time": "1:30:14", "remaining_time": "6:05:22"} | |
| {"current_steps": 1180, "total_steps": 5907, "loss": 0.3057, "lr": 4.85057612353081e-06, "epoch": 0.5992889791772473, "percentage": 19.98, "elapsed_time": "1:30:46", "remaining_time": "6:03:38"} | |
| {"current_steps": 1190, "total_steps": 5907, "loss": 0.3081, "lr": 4.845503906850461e-06, "epoch": 0.6043676993397664, "percentage": 20.15, "elapsed_time": "1:31:18", "remaining_time": "6:01:54"} | |
| {"current_steps": 1200, "total_steps": 5907, "loss": 0.3101, "lr": 4.840349774932081e-06, "epoch": 0.6094464195022854, "percentage": 20.31, "elapsed_time": "1:31:49", "remaining_time": "6:00:11"} | |
| {"current_steps": 1210, "total_steps": 5907, "loss": 0.3162, "lr": 4.835113907780464e-06, "epoch": 0.6145251396648045, "percentage": 20.48, "elapsed_time": "1:32:19", "remaining_time": "5:58:24"} | |
| {"current_steps": 1220, "total_steps": 5907, "loss": 0.3104, "lr": 4.829796488254954e-06, "epoch": 0.6196038598273235, "percentage": 20.65, "elapsed_time": "1:32:53", "remaining_time": "5:56:54"} | |
| {"current_steps": 1230, "total_steps": 5907, "loss": 0.3225, "lr": 4.824397702063058e-06, "epoch": 0.6246825799898426, "percentage": 20.82, "elapsed_time": "1:33:25", "remaining_time": "5:55:14"} | |
| {"current_steps": 1240, "total_steps": 5907, "loss": 0.3172, "lr": 4.8189177377539635e-06, "epoch": 0.6297613001523616, "percentage": 20.99, "elapsed_time": "1:34:13", "remaining_time": "5:54:38"} | |
| {"current_steps": 1250, "total_steps": 5907, "loss": 0.3091, "lr": 4.8133567867119525e-06, "epoch": 0.6348400203148806, "percentage": 21.16, "elapsed_time": "1:34:50", "remaining_time": "5:53:19"} | |
| {"current_steps": 1260, "total_steps": 5907, "loss": 0.3119, "lr": 4.8077150431497175e-06, "epoch": 0.6399187404773997, "percentage": 21.33, "elapsed_time": "1:36:04", "remaining_time": "5:54:20"} | |
| {"current_steps": 1270, "total_steps": 5907, "loss": 0.3121, "lr": 4.801992704101578e-06, "epoch": 0.6449974606399187, "percentage": 21.5, "elapsed_time": "1:36:33", "remaining_time": "5:52:32"} | |
| {"current_steps": 1280, "total_steps": 5907, "loss": 0.3042, "lr": 4.796189969416601e-06, "epoch": 0.6500761808024378, "percentage": 21.67, "elapsed_time": "1:37:17", "remaining_time": "5:51:43"} | |
| {"current_steps": 1290, "total_steps": 5907, "loss": 0.3031, "lr": 4.790307041751617e-06, "epoch": 0.6551549009649569, "percentage": 21.84, "elapsed_time": "1:37:48", "remaining_time": "5:50:04"} | |
| {"current_steps": 1300, "total_steps": 5907, "loss": 0.305, "lr": 4.78434412656415e-06, "epoch": 0.6602336211274759, "percentage": 22.01, "elapsed_time": "1:38:41", "remaining_time": "5:49:43"} | |
| {"current_steps": 1310, "total_steps": 5907, "loss": 0.3066, "lr": 4.778301432105234e-06, "epoch": 0.665312341289995, "percentage": 22.18, "elapsed_time": "1:39:25", "remaining_time": "5:48:54"} | |
| {"current_steps": 1320, "total_steps": 5907, "loss": 0.3023, "lr": 4.772179169412146e-06, "epoch": 0.6703910614525139, "percentage": 22.35, "elapsed_time": "1:40:02", "remaining_time": "5:47:38"} | |
| {"current_steps": 1330, "total_steps": 5907, "loss": 0.3093, "lr": 4.765977552301031e-06, "epoch": 0.675469781615033, "percentage": 22.52, "elapsed_time": "1:41:00", "remaining_time": "5:47:37"} | |
| {"current_steps": 1340, "total_steps": 5907, "loss": 0.3084, "lr": 4.759696797359438e-06, "epoch": 0.6805485017775521, "percentage": 22.68, "elapsed_time": "1:41:35", "remaining_time": "5:46:14"} | |
| {"current_steps": 1350, "total_steps": 5907, "loss": 0.3057, "lr": 4.753337123938754e-06, "epoch": 0.6856272219400711, "percentage": 22.85, "elapsed_time": "1:42:32", "remaining_time": "5:46:08"} | |
| {"current_steps": 1360, "total_steps": 5907, "loss": 0.3034, "lr": 4.746898754146545e-06, "epoch": 0.6907059421025902, "percentage": 23.02, "elapsed_time": "1:43:14", "remaining_time": "5:45:09"} | |
| {"current_steps": 1370, "total_steps": 5907, "loss": 0.2972, "lr": 4.740381912838797e-06, "epoch": 0.6957846622651092, "percentage": 23.19, "elapsed_time": "1:43:59", "remaining_time": "5:44:23"} | |
| {"current_steps": 1380, "total_steps": 5907, "loss": 0.3075, "lr": 4.733786827612064e-06, "epoch": 0.7008633824276282, "percentage": 23.36, "elapsed_time": "1:44:59", "remaining_time": "5:44:23"} | |
| {"current_steps": 1390, "total_steps": 5907, "loss": 0.3057, "lr": 4.72711372879552e-06, "epoch": 0.7059421025901473, "percentage": 23.53, "elapsed_time": "1:45:29", "remaining_time": "5:42:47"} | |
| {"current_steps": 1400, "total_steps": 5907, "loss": 0.3118, "lr": 4.720362849442912e-06, "epoch": 0.7110208227526663, "percentage": 23.7, "elapsed_time": "1:46:09", "remaining_time": "5:41:44"} | |
| {"current_steps": 1410, "total_steps": 5907, "loss": 0.3049, "lr": 4.713534425324426e-06, "epoch": 0.7160995429151854, "percentage": 23.87, "elapsed_time": "1:46:40", "remaining_time": "5:40:12"} | |
| {"current_steps": 1420, "total_steps": 5907, "loss": 0.3107, "lr": 4.706628694918448e-06, "epoch": 0.7211782630777044, "percentage": 24.04, "elapsed_time": "1:47:12", "remaining_time": "5:38:46"} | |
| {"current_steps": 1430, "total_steps": 5907, "loss": 0.3074, "lr": 4.699645899403238e-06, "epoch": 0.7262569832402235, "percentage": 24.21, "elapsed_time": "1:47:57", "remaining_time": "5:37:58"} | |
| {"current_steps": 1440, "total_steps": 5907, "loss": 0.3082, "lr": 4.692586282648504e-06, "epoch": 0.7313357034027425, "percentage": 24.38, "elapsed_time": "1:48:29", "remaining_time": "5:36:32"} | |
| {"current_steps": 1450, "total_steps": 5907, "loss": 0.3127, "lr": 4.685450091206893e-06, "epoch": 0.7364144235652615, "percentage": 24.55, "elapsed_time": "1:49:02", "remaining_time": "5:35:11"} | |
| {"current_steps": 1460, "total_steps": 5907, "loss": 0.3018, "lr": 4.678237574305364e-06, "epoch": 0.7414931437277806, "percentage": 24.72, "elapsed_time": "1:49:50", "remaining_time": "5:34:33"} | |
| {"current_steps": 1470, "total_steps": 5907, "loss": 0.3024, "lr": 4.670948983836505e-06, "epoch": 0.7465718638902996, "percentage": 24.89, "elapsed_time": "1:50:24", "remaining_time": "5:33:14"} | |
| {"current_steps": 1480, "total_steps": 5907, "loss": 0.3167, "lr": 4.66358457434972e-06, "epoch": 0.7516505840528187, "percentage": 25.06, "elapsed_time": "1:51:17", "remaining_time": "5:32:54"} | |
| {"current_steps": 1490, "total_steps": 5907, "loss": 0.3064, "lr": 4.6561446030423435e-06, "epoch": 0.7567293042153377, "percentage": 25.22, "elapsed_time": "1:52:25", "remaining_time": "5:33:16"} | |
| {"current_steps": 1500, "total_steps": 5907, "loss": 0.308, "lr": 4.648629329750662e-06, "epoch": 0.7618080243778568, "percentage": 25.39, "elapsed_time": "1:53:01", "remaining_time": "5:32:04"} | |
| {"current_steps": 1500, "total_steps": 5907, "eval_loss": 0.3137528896331787, "epoch": 0.7618080243778568, "percentage": 25.39, "elapsed_time": "1:55:03", "remaining_time": "5:38:02"} | |
| {"current_steps": 1510, "total_steps": 5907, "loss": 0.3086, "lr": 4.641039016940832e-06, "epoch": 0.7668867445403759, "percentage": 25.56, "elapsed_time": "1:58:24", "remaining_time": "5:44:49"} | |
| {"current_steps": 1520, "total_steps": 5907, "loss": 0.3, "lr": 4.6333739296997205e-06, "epoch": 0.7719654647028948, "percentage": 25.73, "elapsed_time": "1:59:38", "remaining_time": "5:45:17"} | |
| {"current_steps": 1530, "total_steps": 5907, "loss": 0.3134, "lr": 4.625634335725644e-06, "epoch": 0.7770441848654139, "percentage": 25.9, "elapsed_time": "2:00:27", "remaining_time": "5:44:37"} | |
| {"current_steps": 1540, "total_steps": 5907, "loss": 0.3076, "lr": 4.617820505319018e-06, "epoch": 0.7821229050279329, "percentage": 26.07, "elapsed_time": "2:00:59", "remaining_time": "5:43:06"} | |
| {"current_steps": 1550, "total_steps": 5907, "loss": 0.3141, "lr": 4.609932711372921e-06, "epoch": 0.787201625190452, "percentage": 26.24, "elapsed_time": "2:01:32", "remaining_time": "5:41:39"} | |
| {"current_steps": 1560, "total_steps": 5907, "loss": 0.3053, "lr": 4.601971229363558e-06, "epoch": 0.7922803453529711, "percentage": 26.41, "elapsed_time": "2:02:04", "remaining_time": "5:40:09"} | |
| {"current_steps": 1570, "total_steps": 5907, "loss": 0.3123, "lr": 4.593936337340645e-06, "epoch": 0.7973590655154901, "percentage": 26.58, "elapsed_time": "2:03:09", "remaining_time": "5:40:13"} | |
| {"current_steps": 1580, "total_steps": 5907, "loss": 0.2999, "lr": 4.5858283159176955e-06, "epoch": 0.8024377856780092, "percentage": 26.75, "elapsed_time": "2:03:43", "remaining_time": "5:38:48"} | |
| {"current_steps": 1590, "total_steps": 5907, "loss": 0.3077, "lr": 4.57764744826222e-06, "epoch": 0.8075165058405281, "percentage": 26.92, "elapsed_time": "2:04:38", "remaining_time": "5:38:24"} | |
| {"current_steps": 1600, "total_steps": 5907, "loss": 0.3104, "lr": 4.569394020085841e-06, "epoch": 0.8125952260030472, "percentage": 27.09, "elapsed_time": "2:05:16", "remaining_time": "5:37:12"} | |
| {"current_steps": 1610, "total_steps": 5907, "loss": 0.2998, "lr": 4.561068319634307e-06, "epoch": 0.8176739461655663, "percentage": 27.26, "elapsed_time": "2:06:06", "remaining_time": "5:36:33"} | |
| {"current_steps": 1620, "total_steps": 5907, "loss": 0.3011, "lr": 4.552670637677432e-06, "epoch": 0.8227526663280853, "percentage": 27.43, "elapsed_time": "2:06:40", "remaining_time": "5:35:12"} | |
| {"current_steps": 1630, "total_steps": 5907, "loss": 0.3042, "lr": 4.544201267498939e-06, "epoch": 0.8278313864906044, "percentage": 27.59, "elapsed_time": "2:07:15", "remaining_time": "5:33:55"} | |
| {"current_steps": 1640, "total_steps": 5907, "loss": 0.3079, "lr": 4.535660504886215e-06, "epoch": 0.8329101066531234, "percentage": 27.76, "elapsed_time": "2:07:48", "remaining_time": "5:32:32"} | |
| {"current_steps": 1650, "total_steps": 5907, "loss": 0.3002, "lr": 4.527048648119986e-06, "epoch": 0.8379888268156425, "percentage": 27.93, "elapsed_time": "2:08:36", "remaining_time": "5:31:47"} | |
| {"current_steps": 1660, "total_steps": 5907, "loss": 0.3117, "lr": 4.5183659979638905e-06, "epoch": 0.8430675469781616, "percentage": 28.1, "elapsed_time": "2:09:07", "remaining_time": "5:30:20"} | |
| {"current_steps": 1670, "total_steps": 5907, "loss": 0.3079, "lr": 4.509612857653987e-06, "epoch": 0.8481462671406805, "percentage": 28.27, "elapsed_time": "2:09:37", "remaining_time": "5:28:51"} | |
| {"current_steps": 1680, "total_steps": 5907, "loss": 0.2998, "lr": 4.500789532888154e-06, "epoch": 0.8532249873031996, "percentage": 28.44, "elapsed_time": "2:10:21", "remaining_time": "5:27:59"} | |
| {"current_steps": 1690, "total_steps": 5907, "loss": 0.3003, "lr": 4.49189633181542e-06, "epoch": 0.8583037074657186, "percentage": 28.61, "elapsed_time": "2:10:56", "remaining_time": "5:26:43"} | |
| {"current_steps": 1700, "total_steps": 5907, "loss": 0.3118, "lr": 4.482933565025198e-06, "epoch": 0.8633824276282377, "percentage": 28.78, "elapsed_time": "2:11:42", "remaining_time": "5:25:57"} | |
| {"current_steps": 1710, "total_steps": 5907, "loss": 0.3056, "lr": 4.47390154553644e-06, "epoch": 0.8684611477907568, "percentage": 28.95, "elapsed_time": "2:12:29", "remaining_time": "5:25:10"} | |
| {"current_steps": 1720, "total_steps": 5907, "loss": 0.2969, "lr": 4.4648005887867064e-06, "epoch": 0.8735398679532758, "percentage": 29.12, "elapsed_time": "2:13:02", "remaining_time": "5:23:51"} | |
| {"current_steps": 1730, "total_steps": 5907, "loss": 0.3068, "lr": 4.455631012621143e-06, "epoch": 0.8786185881157949, "percentage": 29.29, "elapsed_time": "2:13:35", "remaining_time": "5:22:32"} | |
| {"current_steps": 1740, "total_steps": 5907, "loss": 0.304, "lr": 4.4463931372813914e-06, "epoch": 0.8836973082783138, "percentage": 29.46, "elapsed_time": "2:14:10", "remaining_time": "5:21:19"} | |
| {"current_steps": 1750, "total_steps": 5907, "loss": 0.298, "lr": 4.4370872853943936e-06, "epoch": 0.8887760284408329, "percentage": 29.63, "elapsed_time": "2:15:09", "remaining_time": "5:21:04"} | |
| {"current_steps": 1760, "total_steps": 5907, "loss": 0.2996, "lr": 4.427713781961132e-06, "epoch": 0.8938547486033519, "percentage": 29.8, "elapsed_time": "2:15:43", "remaining_time": "5:19:47"} | |
| {"current_steps": 1770, "total_steps": 5907, "loss": 0.2929, "lr": 4.4182729543452765e-06, "epoch": 0.898933468765871, "percentage": 29.96, "elapsed_time": "2:16:15", "remaining_time": "5:18:29"} | |
| {"current_steps": 1780, "total_steps": 5907, "loss": 0.3088, "lr": 4.408765132261749e-06, "epoch": 0.9040121889283901, "percentage": 30.13, "elapsed_time": "2:17:35", "remaining_time": "5:19:00"} | |
| {"current_steps": 1790, "total_steps": 5907, "loss": 0.3059, "lr": 4.399190647765213e-06, "epoch": 0.9090909090909091, "percentage": 30.3, "elapsed_time": "2:18:08", "remaining_time": "5:17:43"} | |
| {"current_steps": 1800, "total_steps": 5907, "loss": 0.3088, "lr": 4.389549835238473e-06, "epoch": 0.9141696292534282, "percentage": 30.47, "elapsed_time": "2:18:54", "remaining_time": "5:16:56"} | |
| {"current_steps": 1810, "total_steps": 5907, "loss": 0.3025, "lr": 4.379843031380801e-06, "epoch": 0.9192483494159471, "percentage": 30.64, "elapsed_time": "2:19:54", "remaining_time": "5:16:40"} | |
| {"current_steps": 1820, "total_steps": 5907, "loss": 0.3056, "lr": 4.370070575196172e-06, "epoch": 0.9243270695784662, "percentage": 30.81, "elapsed_time": "2:20:25", "remaining_time": "5:15:20"} | |
| {"current_steps": 1830, "total_steps": 5907, "loss": 0.3001, "lr": 4.360232807981426e-06, "epoch": 0.9294057897409853, "percentage": 30.98, "elapsed_time": "2:21:22", "remaining_time": "5:14:58"} | |
| {"current_steps": 1840, "total_steps": 5907, "loss": 0.3101, "lr": 4.350330073314351e-06, "epoch": 0.9344845099035043, "percentage": 31.15, "elapsed_time": "2:22:33", "remaining_time": "5:15:05"} | |
| {"current_steps": 1850, "total_steps": 5907, "loss": 0.3055, "lr": 4.340362717041682e-06, "epoch": 0.9395632300660234, "percentage": 31.32, "elapsed_time": "2:23:04", "remaining_time": "5:13:46"} | |
| {"current_steps": 1860, "total_steps": 5907, "loss": 0.3018, "lr": 4.3303310872670226e-06, "epoch": 0.9446419502285424, "percentage": 31.49, "elapsed_time": "2:23:38", "remaining_time": "5:12:31"} | |
| {"current_steps": 1870, "total_steps": 5907, "loss": 0.2943, "lr": 4.320235534338685e-06, "epoch": 0.9497206703910615, "percentage": 31.66, "elapsed_time": "2:24:09", "remaining_time": "5:11:13"} | |
| {"current_steps": 1880, "total_steps": 5907, "loss": 0.3026, "lr": 4.310076410837463e-06, "epoch": 0.9547993905535805, "percentage": 31.83, "elapsed_time": "2:24:54", "remaining_time": "5:10:22"} | |
| {"current_steps": 1890, "total_steps": 5907, "loss": 0.2926, "lr": 4.299854071564307e-06, "epoch": 0.9598781107160995, "percentage": 32.0, "elapsed_time": "2:25:29", "remaining_time": "5:09:14"} | |
| {"current_steps": 1900, "total_steps": 5907, "loss": 0.304, "lr": 4.289568873527941e-06, "epoch": 0.9649568308786186, "percentage": 32.17, "elapsed_time": "2:26:16", "remaining_time": "5:08:28"} | |
| {"current_steps": 1910, "total_steps": 5907, "loss": 0.297, "lr": 4.279221175932389e-06, "epoch": 0.9700355510411376, "percentage": 32.33, "elapsed_time": "2:27:19", "remaining_time": "5:08:17"} | |
| {"current_steps": 1920, "total_steps": 5907, "loss": 0.2986, "lr": 4.268811340164436e-06, "epoch": 0.9751142712036567, "percentage": 32.5, "elapsed_time": "2:28:05", "remaining_time": "5:07:30"} | |
| {"current_steps": 1930, "total_steps": 5907, "loss": 0.2957, "lr": 4.258339729781e-06, "epoch": 0.9801929913661758, "percentage": 32.67, "elapsed_time": "2:28:55", "remaining_time": "5:06:51"} | |
| {"current_steps": 1940, "total_steps": 5907, "loss": 0.3023, "lr": 4.24780671049644e-06, "epoch": 0.9852717115286947, "percentage": 32.84, "elapsed_time": "2:29:25", "remaining_time": "5:05:33"} | |
| {"current_steps": 1950, "total_steps": 5907, "loss": 0.3078, "lr": 4.237212650169783e-06, "epoch": 0.9903504316912138, "percentage": 33.01, "elapsed_time": "2:30:32", "remaining_time": "5:05:28"} | |
| {"current_steps": 1960, "total_steps": 5907, "loss": 0.3002, "lr": 4.226557918791872e-06, "epoch": 0.9954291518537328, "percentage": 33.18, "elapsed_time": "2:31:17", "remaining_time": "5:04:39"} | |
| {"current_steps": 1970, "total_steps": 5907, "loss": 0.3125, "lr": 4.215842888472452e-06, "epoch": 1.000507872016252, "percentage": 33.35, "elapsed_time": "2:31:51", "remaining_time": "5:03:29"} | |
| {"current_steps": 1980, "total_steps": 5907, "loss": 0.2544, "lr": 4.205067933427169e-06, "epoch": 1.005586592178771, "percentage": 33.52, "elapsed_time": "2:32:33", "remaining_time": "5:02:34"} | |
| {"current_steps": 1990, "total_steps": 5907, "loss": 0.2535, "lr": 4.194233429964501e-06, "epoch": 1.01066531234129, "percentage": 33.69, "elapsed_time": "2:33:04", "remaining_time": "5:01:18"} | |
| {"current_steps": 2000, "total_steps": 5907, "loss": 0.2496, "lr": 4.183339756472617e-06, "epoch": 1.015744032503809, "percentage": 33.86, "elapsed_time": "2:33:40", "remaining_time": "5:00:12"} | |
| {"current_steps": 2000, "total_steps": 5907, "eval_loss": 0.31630080938339233, "epoch": 1.015744032503809, "percentage": 33.86, "elapsed_time": "2:35:42", "remaining_time": "5:04:10"} | |
| {"current_steps": 2010, "total_steps": 5907, "loss": 0.2465, "lr": 4.172387293406164e-06, "epoch": 1.020822752666328, "percentage": 34.03, "elapsed_time": "2:39:03", "remaining_time": "5:08:22"} | |
| {"current_steps": 2020, "total_steps": 5907, "loss": 0.2479, "lr": 4.161376423272974e-06, "epoch": 1.0259014728288471, "percentage": 34.2, "elapsed_time": "2:39:34", "remaining_time": "5:07:03"} | |
| {"current_steps": 2030, "total_steps": 5907, "loss": 0.25, "lr": 4.150307530620714e-06, "epoch": 1.0309801929913662, "percentage": 34.37, "elapsed_time": "2:40:10", "remaining_time": "5:05:55"} | |
| {"current_steps": 2040, "total_steps": 5907, "loss": 0.2518, "lr": 4.139181002023445e-06, "epoch": 1.0360589131538853, "percentage": 34.54, "elapsed_time": "2:40:55", "remaining_time": "5:05:03"} | |
| {"current_steps": 2050, "total_steps": 5907, "loss": 0.2521, "lr": 4.1279972260681286e-06, "epoch": 1.0411376333164042, "percentage": 34.7, "elapsed_time": "2:41:49", "remaining_time": "5:04:27"} | |
| {"current_steps": 2060, "total_steps": 5907, "loss": 0.2529, "lr": 4.1167565933410575e-06, "epoch": 1.0462163534789233, "percentage": 34.87, "elapsed_time": "2:42:35", "remaining_time": "5:03:38"} | |
| {"current_steps": 2070, "total_steps": 5907, "loss": 0.2508, "lr": 4.105459496414207e-06, "epoch": 1.0512950736414424, "percentage": 35.04, "elapsed_time": "2:43:47", "remaining_time": "5:03:36"} | |
| {"current_steps": 2080, "total_steps": 5907, "loss": 0.247, "lr": 4.094106329831531e-06, "epoch": 1.0563737938039615, "percentage": 35.21, "elapsed_time": "2:44:39", "remaining_time": "5:02:56"} | |
| {"current_steps": 2090, "total_steps": 5907, "loss": 0.2466, "lr": 4.08269749009518e-06, "epoch": 1.0614525139664805, "percentage": 35.38, "elapsed_time": "2:45:10", "remaining_time": "5:01:38"} | |
| {"current_steps": 2100, "total_steps": 5907, "loss": 0.2482, "lr": 4.0712333756516535e-06, "epoch": 1.0665312341289994, "percentage": 35.55, "elapsed_time": "2:45:56", "remaining_time": "5:00:48"} | |
| {"current_steps": 2110, "total_steps": 5907, "loss": 0.2578, "lr": 4.059714386877886e-06, "epoch": 1.0716099542915185, "percentage": 35.72, "elapsed_time": "2:46:30", "remaining_time": "4:59:37"} | |
| {"current_steps": 2120, "total_steps": 5907, "loss": 0.2568, "lr": 4.048140926067262e-06, "epoch": 1.0766886744540376, "percentage": 35.89, "elapsed_time": "2:47:07", "remaining_time": "4:58:32"} | |
| {"current_steps": 2130, "total_steps": 5907, "loss": 0.2496, "lr": 4.036513397415571e-06, "epoch": 1.0817673946165567, "percentage": 36.06, "elapsed_time": "2:48:12", "remaining_time": "4:58:16"} | |
| {"current_steps": 2140, "total_steps": 5907, "loss": 0.2489, "lr": 4.024832207006883e-06, "epoch": 1.0868461147790756, "percentage": 36.23, "elapsed_time": "2:48:42", "remaining_time": "4:56:58"} | |
| {"current_steps": 2150, "total_steps": 5907, "loss": 0.2503, "lr": 4.013097762799372e-06, "epoch": 1.0919248349415946, "percentage": 36.4, "elapsed_time": "2:49:14", "remaining_time": "4:55:44"} | |
| {"current_steps": 2160, "total_steps": 5907, "loss": 0.2489, "lr": 4.001310474611069e-06, "epoch": 1.0970035551041137, "percentage": 36.57, "elapsed_time": "2:49:51", "remaining_time": "4:54:38"} | |
| {"current_steps": 2170, "total_steps": 5907, "loss": 0.2516, "lr": 3.989470754105546e-06, "epoch": 1.1020822752666328, "percentage": 36.74, "elapsed_time": "2:50:24", "remaining_time": "4:53:28"} | |
| {"current_steps": 2180, "total_steps": 5907, "loss": 0.2455, "lr": 3.9775790147775425e-06, "epoch": 1.107160995429152, "percentage": 36.91, "elapsed_time": "2:50:56", "remaining_time": "4:52:14"} | |
| {"current_steps": 2190, "total_steps": 5907, "loss": 0.2539, "lr": 3.96563567193852e-06, "epoch": 1.112239715591671, "percentage": 37.07, "elapsed_time": "2:51:29", "remaining_time": "4:51:03"} | |
| {"current_steps": 2200, "total_steps": 5907, "loss": 0.2473, "lr": 3.953641142702161e-06, "epoch": 1.1173184357541899, "percentage": 37.24, "elapsed_time": "2:52:00", "remaining_time": "4:49:49"} | |
| {"current_steps": 2210, "total_steps": 5907, "loss": 0.2522, "lr": 3.941595845969799e-06, "epoch": 1.122397155916709, "percentage": 37.41, "elapsed_time": "2:52:33", "remaining_time": "4:48:39"} | |
| {"current_steps": 2220, "total_steps": 5907, "loss": 0.2469, "lr": 3.929500202415793e-06, "epoch": 1.127475876079228, "percentage": 37.58, "elapsed_time": "2:53:33", "remaining_time": "4:48:14"} | |
| {"current_steps": 2230, "total_steps": 5907, "loss": 0.2475, "lr": 3.917354634472831e-06, "epoch": 1.1325545962417471, "percentage": 37.75, "elapsed_time": "2:54:06", "remaining_time": "4:47:05"} | |
| {"current_steps": 2240, "total_steps": 5907, "loss": 0.2519, "lr": 3.9051595663171795e-06, "epoch": 1.137633316404266, "percentage": 37.92, "elapsed_time": "2:54:50", "remaining_time": "4:46:13"} | |
| {"current_steps": 2250, "total_steps": 5907, "loss": 0.2417, "lr": 3.892915423853866e-06, "epoch": 1.142712036566785, "percentage": 38.09, "elapsed_time": "2:55:25", "remaining_time": "4:45:06"} | |
| {"current_steps": 2260, "total_steps": 5907, "loss": 0.2489, "lr": 3.880622634701812e-06, "epoch": 1.1477907567293042, "percentage": 38.26, "elapsed_time": "2:56:04", "remaining_time": "4:44:08"} | |
| {"current_steps": 2270, "total_steps": 5907, "loss": 0.2548, "lr": 3.868281628178888e-06, "epoch": 1.1528694768918233, "percentage": 38.43, "elapsed_time": "2:56:37", "remaining_time": "4:42:59"} | |
| {"current_steps": 2280, "total_steps": 5907, "loss": 0.2467, "lr": 3.855892835286931e-06, "epoch": 1.1579481970543424, "percentage": 38.6, "elapsed_time": "2:57:12", "remaining_time": "4:41:54"} | |
| {"current_steps": 2290, "total_steps": 5907, "loss": 0.2486, "lr": 3.843456688696683e-06, "epoch": 1.1630269172168615, "percentage": 38.77, "elapsed_time": "2:58:00", "remaining_time": "4:41:08"} | |
| {"current_steps": 2300, "total_steps": 5907, "loss": 0.248, "lr": 3.830973622732686e-06, "epoch": 1.1681056373793803, "percentage": 38.94, "elapsed_time": "2:58:39", "remaining_time": "4:40:11"} | |
| {"current_steps": 2310, "total_steps": 5907, "loss": 0.2352, "lr": 3.818444073358108e-06, "epoch": 1.1731843575418994, "percentage": 39.11, "elapsed_time": "2:59:12", "remaining_time": "4:39:03"} | |
| {"current_steps": 2320, "total_steps": 5907, "loss": 0.2434, "lr": 3.8058684781595277e-06, "epoch": 1.1782630777044185, "percentage": 39.28, "elapsed_time": "2:59:42", "remaining_time": "4:37:51"} | |
| {"current_steps": 2330, "total_steps": 5907, "loss": 0.2488, "lr": 3.793247276331636e-06, "epoch": 1.1833417978669376, "percentage": 39.44, "elapsed_time": "3:00:12", "remaining_time": "4:36:39"} | |
| {"current_steps": 2340, "total_steps": 5907, "loss": 0.248, "lr": 3.780580908661915e-06, "epoch": 1.1884205180294565, "percentage": 39.61, "elapsed_time": "3:01:00", "remaining_time": "4:35:54"} | |
| {"current_steps": 2350, "total_steps": 5907, "loss": 0.2529, "lr": 3.7678698175152286e-06, "epoch": 1.1934992381919756, "percentage": 39.78, "elapsed_time": "3:01:46", "remaining_time": "4:35:08"} | |
| {"current_steps": 2360, "total_steps": 5907, "loss": 0.2526, "lr": 3.7551144468183824e-06, "epoch": 1.1985779583544947, "percentage": 39.95, "elapsed_time": "3:02:29", "remaining_time": "4:34:16"} | |
| {"current_steps": 2370, "total_steps": 5907, "loss": 0.2478, "lr": 3.7423152420446185e-06, "epoch": 1.2036566785170137, "percentage": 40.12, "elapsed_time": "3:03:02", "remaining_time": "4:33:09"} | |
| {"current_steps": 2380, "total_steps": 5907, "loss": 0.2434, "lr": 3.729472650198054e-06, "epoch": 1.2087353986795328, "percentage": 40.29, "elapsed_time": "3:03:52", "remaining_time": "4:32:30"} | |
| {"current_steps": 2390, "total_steps": 5907, "loss": 0.2401, "lr": 3.716587119798074e-06, "epoch": 1.213814118842052, "percentage": 40.46, "elapsed_time": "3:04:50", "remaining_time": "4:32:00"} | |
| {"current_steps": 2400, "total_steps": 5907, "loss": 0.2511, "lr": 3.703659100863664e-06, "epoch": 1.2188928390045708, "percentage": 40.63, "elapsed_time": "3:05:22", "remaining_time": "4:30:52"} | |
| {"current_steps": 2410, "total_steps": 5907, "loss": 0.2509, "lr": 3.690689044897695e-06, "epoch": 1.2239715591670899, "percentage": 40.8, "elapsed_time": "3:06:19", "remaining_time": "4:30:21"} | |
| {"current_steps": 2420, "total_steps": 5907, "loss": 0.2466, "lr": 3.6776774048711558e-06, "epoch": 1.229050279329609, "percentage": 40.97, "elapsed_time": "3:06:53", "remaining_time": "4:29:17"} | |
| {"current_steps": 2430, "total_steps": 5907, "loss": 0.244, "lr": 3.66462463520733e-06, "epoch": 1.234128999492128, "percentage": 41.14, "elapsed_time": "3:07:29", "remaining_time": "4:28:16"} | |
| {"current_steps": 2440, "total_steps": 5907, "loss": 0.2411, "lr": 3.6515311917659302e-06, "epoch": 1.239207719654647, "percentage": 41.31, "elapsed_time": "3:08:17", "remaining_time": "4:27:33"} | |
| {"current_steps": 2450, "total_steps": 5907, "loss": 0.2556, "lr": 3.6383975318271724e-06, "epoch": 1.244286439817166, "percentage": 41.48, "elapsed_time": "3:09:17", "remaining_time": "4:27:05"} | |
| {"current_steps": 2460, "total_steps": 5907, "loss": 0.243, "lr": 3.6252241140758103e-06, "epoch": 1.2493651599796851, "percentage": 41.65, "elapsed_time": "3:10:02", "remaining_time": "4:26:17"} | |
| {"current_steps": 2470, "total_steps": 5907, "loss": 0.2522, "lr": 3.6120113985851134e-06, "epoch": 1.2544438801422042, "percentage": 41.81, "elapsed_time": "3:10:32", "remaining_time": "4:25:08"} | |
| {"current_steps": 2480, "total_steps": 5907, "loss": 0.2475, "lr": 3.5987598468007993e-06, "epoch": 1.2595226003047233, "percentage": 41.98, "elapsed_time": "3:11:05", "remaining_time": "4:24:03"} | |
| {"current_steps": 2490, "total_steps": 5907, "loss": 0.2534, "lr": 3.585469921524919e-06, "epoch": 1.2646013204672424, "percentage": 42.15, "elapsed_time": "3:11:37", "remaining_time": "4:22:57"} | |
| {"current_steps": 2500, "total_steps": 5907, "loss": 0.248, "lr": 3.5721420868996943e-06, "epoch": 1.2696800406297613, "percentage": 42.32, "elapsed_time": "3:12:06", "remaining_time": "4:21:48"} | |
| {"current_steps": 2500, "total_steps": 5907, "eval_loss": 0.3149872124195099, "epoch": 1.2696800406297613, "percentage": 42.32, "elapsed_time": "3:14:08", "remaining_time": "4:24:34"} | |
| {"current_steps": 2510, "total_steps": 5907, "loss": 0.251, "lr": 3.5587768083913037e-06, "epoch": 1.2747587607922803, "percentage": 42.49, "elapsed_time": "3:17:40", "remaining_time": "4:27:31"} | |
| {"current_steps": 2520, "total_steps": 5907, "loss": 0.2514, "lr": 3.545374552773635e-06, "epoch": 1.2798374809547994, "percentage": 42.66, "elapsed_time": "3:18:13", "remaining_time": "4:26:25"} | |
| {"current_steps": 2530, "total_steps": 5907, "loss": 0.2451, "lr": 3.5319357881119733e-06, "epoch": 1.2849162011173183, "percentage": 42.83, "elapsed_time": "3:18:58", "remaining_time": "4:25:35"} | |
| {"current_steps": 2540, "total_steps": 5907, "loss": 0.248, "lr": 3.518460983746661e-06, "epoch": 1.2899949212798374, "percentage": 43.0, "elapsed_time": "3:19:41", "remaining_time": "4:24:43"} | |
| {"current_steps": 2550, "total_steps": 5907, "loss": 0.2414, "lr": 3.5049506102767037e-06, "epoch": 1.2950736414423565, "percentage": 43.17, "elapsed_time": "3:20:27", "remaining_time": "4:23:53"} | |
| {"current_steps": 2560, "total_steps": 5907, "loss": 0.2482, "lr": 3.4914051395433363e-06, "epoch": 1.3001523616048756, "percentage": 43.34, "elapsed_time": "3:21:01", "remaining_time": "4:22:49"} | |
| {"current_steps": 2570, "total_steps": 5907, "loss": 0.2501, "lr": 3.477825044613543e-06, "epoch": 1.3052310817673947, "percentage": 43.51, "elapsed_time": "3:21:33", "remaining_time": "4:21:43"} | |
| {"current_steps": 2580, "total_steps": 5907, "loss": 0.247, "lr": 3.464210799763536e-06, "epoch": 1.3103098019299138, "percentage": 43.68, "elapsed_time": "3:22:20", "remaining_time": "4:20:55"} | |
| {"current_steps": 2590, "total_steps": 5907, "loss": 0.2479, "lr": 3.450562880462191e-06, "epoch": 1.3153885220924328, "percentage": 43.85, "elapsed_time": "3:23:06", "remaining_time": "4:20:07"} | |
| {"current_steps": 2600, "total_steps": 5907, "loss": 0.2502, "lr": 3.436881763354444e-06, "epoch": 1.3204672422549517, "percentage": 44.02, "elapsed_time": "3:23:40", "remaining_time": "4:19:04"} | |
| {"current_steps": 2610, "total_steps": 5907, "loss": 0.2517, "lr": 3.4231679262446426e-06, "epoch": 1.3255459624174708, "percentage": 44.18, "elapsed_time": "3:24:16", "remaining_time": "4:18:03"} | |
| {"current_steps": 2620, "total_steps": 5907, "loss": 0.2448, "lr": 3.4094218480798608e-06, "epoch": 1.33062468257999, "percentage": 44.35, "elapsed_time": "3:25:44", "remaining_time": "4:18:06"} | |
| {"current_steps": 2630, "total_steps": 5907, "loss": 0.2431, "lr": 3.3956440089331687e-06, "epoch": 1.3357034027425088, "percentage": 44.52, "elapsed_time": "3:26:18", "remaining_time": "4:17:03"} | |
| {"current_steps": 2640, "total_steps": 5907, "loss": 0.2517, "lr": 3.3818348899868707e-06, "epoch": 1.3407821229050279, "percentage": 44.69, "elapsed_time": "3:26:52", "remaining_time": "4:15:59"} | |
| {"current_steps": 2650, "total_steps": 5907, "loss": 0.2478, "lr": 3.3679949735156974e-06, "epoch": 1.345860843067547, "percentage": 44.86, "elapsed_time": "3:27:28", "remaining_time": "4:15:00"} | |
| {"current_steps": 2660, "total_steps": 5907, "loss": 0.2451, "lr": 3.354124742869965e-06, "epoch": 1.350939563230066, "percentage": 45.03, "elapsed_time": "3:28:07", "remaining_time": "4:14:03"} | |
| {"current_steps": 2670, "total_steps": 5907, "loss": 0.2448, "lr": 3.3402246824586897e-06, "epoch": 1.3560182833925851, "percentage": 45.2, "elapsed_time": "3:28:54", "remaining_time": "4:13:16"} | |
| {"current_steps": 2680, "total_steps": 5907, "loss": 0.2503, "lr": 3.3262952777326775e-06, "epoch": 1.3610970035551042, "percentage": 45.37, "elapsed_time": "3:29:25", "remaining_time": "4:12:10"} | |
| {"current_steps": 2690, "total_steps": 5907, "loss": 0.2537, "lr": 3.3123370151675615e-06, "epoch": 1.366175723717623, "percentage": 45.54, "elapsed_time": "3:30:27", "remaining_time": "4:11:41"} | |
| {"current_steps": 2700, "total_steps": 5907, "loss": 0.2496, "lr": 3.2983503822468214e-06, "epoch": 1.3712544438801422, "percentage": 45.71, "elapsed_time": "3:31:02", "remaining_time": "4:10:39"} | |
| {"current_steps": 2710, "total_steps": 5907, "loss": 0.2489, "lr": 3.28433586744475e-06, "epoch": 1.3763331640426613, "percentage": 45.88, "elapsed_time": "3:31:33", "remaining_time": "4:09:34"} | |
| {"current_steps": 2720, "total_steps": 5907, "loss": 0.2448, "lr": 3.2702939602093988e-06, "epoch": 1.3814118842051804, "percentage": 46.05, "elapsed_time": "3:32:14", "remaining_time": "4:08:40"} | |
| {"current_steps": 2730, "total_steps": 5907, "loss": 0.2578, "lr": 3.2562251509454813e-06, "epoch": 1.3864906043676992, "percentage": 46.22, "elapsed_time": "3:33:13", "remaining_time": "4:08:07"} | |
| {"current_steps": 2740, "total_steps": 5907, "loss": 0.2489, "lr": 3.2421299309972485e-06, "epoch": 1.3915693245302183, "percentage": 46.39, "elapsed_time": "3:33:45", "remaining_time": "4:07:03"} | |
| {"current_steps": 2750, "total_steps": 5907, "loss": 0.248, "lr": 3.2280087926313288e-06, "epoch": 1.3966480446927374, "percentage": 46.55, "elapsed_time": "3:34:34", "remaining_time": "4:06:20"} | |
| {"current_steps": 2760, "total_steps": 5907, "loss": 0.2535, "lr": 3.2138622290195325e-06, "epoch": 1.4017267648552565, "percentage": 46.72, "elapsed_time": "3:35:20", "remaining_time": "4:05:32"} | |
| {"current_steps": 2770, "total_steps": 5907, "loss": 0.2434, "lr": 3.1996907342216318e-06, "epoch": 1.4068054850177756, "percentage": 46.89, "elapsed_time": "3:35:58", "remaining_time": "4:04:34"} | |
| {"current_steps": 2780, "total_steps": 5907, "loss": 0.2518, "lr": 3.1854948031681044e-06, "epoch": 1.4118842051802947, "percentage": 47.06, "elapsed_time": "3:36:35", "remaining_time": "4:03:37"} | |
| {"current_steps": 2790, "total_steps": 5907, "loss": 0.2433, "lr": 3.1712749316428487e-06, "epoch": 1.4169629253428135, "percentage": 47.23, "elapsed_time": "3:37:07", "remaining_time": "4:02:34"} | |
| {"current_steps": 2800, "total_steps": 5907, "loss": 0.2543, "lr": 3.157031616265871e-06, "epoch": 1.4220416455053326, "percentage": 47.4, "elapsed_time": "3:37:54", "remaining_time": "4:01:48"} | |
| {"current_steps": 2810, "total_steps": 5907, "loss": 0.247, "lr": 3.1427653544759352e-06, "epoch": 1.4271203656678517, "percentage": 47.57, "elapsed_time": "3:38:57", "remaining_time": "4:01:19"} | |
| {"current_steps": 2820, "total_steps": 5907, "loss": 0.2485, "lr": 3.1284766445131975e-06, "epoch": 1.4321990858303708, "percentage": 47.74, "elapsed_time": "3:39:30", "remaining_time": "4:00:17"} | |
| {"current_steps": 2830, "total_steps": 5907, "loss": 0.2469, "lr": 3.114165985401801e-06, "epoch": 1.4372778059928897, "percentage": 47.91, "elapsed_time": "3:40:07", "remaining_time": "3:59:20"} | |
| {"current_steps": 2840, "total_steps": 5907, "loss": 0.2492, "lr": 3.09983387693245e-06, "epoch": 1.4423565261554088, "percentage": 48.08, "elapsed_time": "3:40:43", "remaining_time": "3:58:21"} | |
| {"current_steps": 2850, "total_steps": 5907, "loss": 0.2417, "lr": 3.085480819644951e-06, "epoch": 1.4474352463179279, "percentage": 48.25, "elapsed_time": "3:41:17", "remaining_time": "3:57:22"} | |
| {"current_steps": 2860, "total_steps": 5907, "loss": 0.2533, "lr": 3.0711073148107395e-06, "epoch": 1.452513966480447, "percentage": 48.42, "elapsed_time": "3:42:06", "remaining_time": "3:56:37"} | |
| {"current_steps": 2870, "total_steps": 5907, "loss": 0.2483, "lr": 3.056713864415363e-06, "epoch": 1.457592686642966, "percentage": 48.59, "elapsed_time": "3:43:04", "remaining_time": "3:56:03"} | |
| {"current_steps": 2880, "total_steps": 5907, "loss": 0.2502, "lr": 3.0423009711409614e-06, "epoch": 1.4626714068054851, "percentage": 48.76, "elapsed_time": "3:43:35", "remaining_time": "3:55:00"} | |
| {"current_steps": 2890, "total_steps": 5907, "loss": 0.2491, "lr": 3.0278691383486992e-06, "epoch": 1.467750126968004, "percentage": 48.93, "elapsed_time": "3:44:06", "remaining_time": "3:53:57"} | |
| {"current_steps": 2900, "total_steps": 5907, "loss": 0.2461, "lr": 3.013418870061194e-06, "epoch": 1.472828847130523, "percentage": 49.09, "elapsed_time": "3:44:50", "remaining_time": "3:53:08"} | |
| {"current_steps": 2910, "total_steps": 5907, "loss": 0.2498, "lr": 2.9989506709449123e-06, "epoch": 1.4779075672930422, "percentage": 49.26, "elapsed_time": "3:45:24", "remaining_time": "3:52:09"} | |
| {"current_steps": 2920, "total_steps": 5907, "loss": 0.2473, "lr": 2.984465046292541e-06, "epoch": 1.4829862874555613, "percentage": 49.43, "elapsed_time": "3:46:28", "remaining_time": "3:51:40"} | |
| {"current_steps": 2930, "total_steps": 5907, "loss": 0.2443, "lr": 2.9699625020053457e-06, "epoch": 1.4880650076180801, "percentage": 49.6, "elapsed_time": "3:47:02", "remaining_time": "3:50:41"} | |
| {"current_steps": 2940, "total_steps": 5907, "loss": 0.2386, "lr": 2.9554435445754976e-06, "epoch": 1.4931437277805992, "percentage": 49.77, "elapsed_time": "3:47:42", "remaining_time": "3:49:48"} | |
| {"current_steps": 2950, "total_steps": 5907, "loss": 0.2503, "lr": 2.9409086810683858e-06, "epoch": 1.4982224479431183, "percentage": 49.94, "elapsed_time": "3:48:26", "remaining_time": "3:48:59"} | |
| {"current_steps": 2960, "total_steps": 5907, "loss": 0.2421, "lr": 2.926358419104911e-06, "epoch": 1.5033011681056374, "percentage": 50.11, "elapsed_time": "3:49:10", "remaining_time": "3:48:10"} | |
| {"current_steps": 2970, "total_steps": 5907, "loss": 0.2515, "lr": 2.9117932668437542e-06, "epoch": 1.5083798882681565, "percentage": 50.28, "elapsed_time": "3:49:42", "remaining_time": "3:47:09"} | |
| {"current_steps": 2980, "total_steps": 5907, "loss": 0.2486, "lr": 2.8972137329636324e-06, "epoch": 1.5134586084306756, "percentage": 50.45, "elapsed_time": "3:50:29", "remaining_time": "3:46:23"} | |
| {"current_steps": 2990, "total_steps": 5907, "loss": 0.2406, "lr": 2.8826203266455276e-06, "epoch": 1.5185373285931947, "percentage": 50.62, "elapsed_time": "3:51:09", "remaining_time": "3:45:30"} | |
| {"current_steps": 3000, "total_steps": 5907, "loss": 0.2526, "lr": 2.868013557554911e-06, "epoch": 1.5236160487557135, "percentage": 50.79, "elapsed_time": "3:51:45", "remaining_time": "3:44:34"} | |
| {"current_steps": 3000, "total_steps": 5907, "eval_loss": 0.3125091791152954, "epoch": 1.5236160487557135, "percentage": 50.79, "elapsed_time": "3:53:47", "remaining_time": "3:46:32"} | |
| {"current_steps": 3010, "total_steps": 5907, "loss": 0.2447, "lr": 2.8533939358239405e-06, "epoch": 1.5286947689182326, "percentage": 50.96, "elapsed_time": "3:56:41", "remaining_time": "3:47:48"} | |
| {"current_steps": 3020, "total_steps": 5907, "loss": 0.2508, "lr": 2.838761972033643e-06, "epoch": 1.5337734890807515, "percentage": 51.13, "elapsed_time": "3:57:28", "remaining_time": "3:47:01"} | |
| {"current_steps": 3030, "total_steps": 5907, "loss": 0.2481, "lr": 2.824118177196083e-06, "epoch": 1.5388522092432706, "percentage": 51.3, "elapsed_time": "3:58:13", "remaining_time": "3:46:11"} | |
| {"current_steps": 3040, "total_steps": 5907, "loss": 0.2525, "lr": 2.8094630627365193e-06, "epoch": 1.5439309294057897, "percentage": 51.46, "elapsed_time": "3:59:03", "remaining_time": "3:45:26"} | |
| {"current_steps": 3050, "total_steps": 5907, "loss": 0.2503, "lr": 2.7947971404755392e-06, "epoch": 1.5490096495683088, "percentage": 51.63, "elapsed_time": "4:00:16", "remaining_time": "3:45:03"} | |
| {"current_steps": 3060, "total_steps": 5907, "loss": 0.2508, "lr": 2.7801209226111874e-06, "epoch": 1.5540883697308279, "percentage": 51.8, "elapsed_time": "4:01:28", "remaining_time": "3:44:39"} | |
| {"current_steps": 3070, "total_steps": 5907, "loss": 0.2435, "lr": 2.765434921701075e-06, "epoch": 1.559167089893347, "percentage": 51.97, "elapsed_time": "4:02:02", "remaining_time": "3:43:40"} | |
| {"current_steps": 3080, "total_steps": 5907, "loss": 0.2468, "lr": 2.7507396506444805e-06, "epoch": 1.564245810055866, "percentage": 52.14, "elapsed_time": "4:02:45", "remaining_time": "3:42:49"} | |
| {"current_steps": 3090, "total_steps": 5907, "loss": 0.2473, "lr": 2.7360356226644342e-06, "epoch": 1.569324530218385, "percentage": 52.31, "elapsed_time": "4:03:29", "remaining_time": "3:41:58"} | |
| {"current_steps": 3100, "total_steps": 5907, "loss": 0.2468, "lr": 2.721323351289799e-06, "epoch": 1.574403250380904, "percentage": 52.48, "elapsed_time": "4:04:01", "remaining_time": "3:40:57"} | |
| {"current_steps": 3110, "total_steps": 5907, "loss": 0.243, "lr": 2.7066033503373323e-06, "epoch": 1.579481970543423, "percentage": 52.65, "elapsed_time": "4:04:32", "remaining_time": "3:39:55"} | |
| {"current_steps": 3120, "total_steps": 5907, "loss": 0.2566, "lr": 2.6918761338937427e-06, "epoch": 1.584560690705942, "percentage": 52.82, "elapsed_time": "4:05:03", "remaining_time": "3:38:54"} | |
| {"current_steps": 3130, "total_steps": 5907, "loss": 0.2518, "lr": 2.677142216297733e-06, "epoch": 1.589639410868461, "percentage": 52.99, "elapsed_time": "4:06:01", "remaining_time": "3:38:16"} | |
| {"current_steps": 3140, "total_steps": 5907, "loss": 0.2537, "lr": 2.6624021121220415e-06, "epoch": 1.5947181310309801, "percentage": 53.16, "elapsed_time": "4:06:33", "remaining_time": "3:37:16"} | |
| {"current_steps": 3150, "total_steps": 5907, "loss": 0.248, "lr": 2.647656336155469e-06, "epoch": 1.5997968511934992, "percentage": 53.33, "elapsed_time": "4:07:04", "remaining_time": "3:36:15"} | |
| {"current_steps": 3160, "total_steps": 5907, "loss": 0.2435, "lr": 2.6329054033848994e-06, "epoch": 1.6048755713560183, "percentage": 53.5, "elapsed_time": "4:07:37", "remaining_time": "3:35:15"} | |
| {"current_steps": 3170, "total_steps": 5907, "loss": 0.2473, "lr": 2.6181498289773145e-06, "epoch": 1.6099542915185374, "percentage": 53.67, "elapsed_time": "4:08:10", "remaining_time": "3:34:16"} | |
| {"current_steps": 3180, "total_steps": 5907, "loss": 0.2466, "lr": 2.603390128261802e-06, "epoch": 1.6150330116810565, "percentage": 53.83, "elapsed_time": "4:08:57", "remaining_time": "3:33:29"} | |
| {"current_steps": 3190, "total_steps": 5907, "loss": 0.2463, "lr": 2.5886268167115597e-06, "epoch": 1.6201117318435754, "percentage": 54.0, "elapsed_time": "4:09:46", "remaining_time": "3:32:44"} | |
| {"current_steps": 3200, "total_steps": 5907, "loss": 0.2414, "lr": 2.5738604099258908e-06, "epoch": 1.6251904520060945, "percentage": 54.17, "elapsed_time": "4:10:35", "remaining_time": "3:31:59"} | |
| {"current_steps": 3210, "total_steps": 5907, "loss": 0.2573, "lr": 2.559091423612196e-06, "epoch": 1.6302691721686136, "percentage": 54.34, "elapsed_time": "4:11:08", "remaining_time": "3:30:59"} | |
| {"current_steps": 3220, "total_steps": 5907, "loss": 0.2468, "lr": 2.5443203735679682e-06, "epoch": 1.6353478923311324, "percentage": 54.51, "elapsed_time": "4:11:39", "remaining_time": "3:29:59"} | |
| {"current_steps": 3230, "total_steps": 5907, "loss": 0.2475, "lr": 2.52954777566277e-06, "epoch": 1.6404266124936515, "percentage": 54.68, "elapsed_time": "4:12:12", "remaining_time": "3:29:01"} | |
| {"current_steps": 3240, "total_steps": 5907, "loss": 0.2528, "lr": 2.5147741458202266e-06, "epoch": 1.6455053326561706, "percentage": 54.85, "elapsed_time": "4:13:11", "remaining_time": "3:28:24"} | |
| {"current_steps": 3250, "total_steps": 5907, "loss": 0.252, "lr": 2.5e-06, "epoch": 1.6505840528186897, "percentage": 55.02, "elapsed_time": "4:13:49", "remaining_time": "3:27:30"} | |
| {"current_steps": 3260, "total_steps": 5907, "loss": 0.2529, "lr": 2.485225854179774e-06, "epoch": 1.6556627729812088, "percentage": 55.19, "elapsed_time": "4:14:27", "remaining_time": "3:26:36"} | |
| {"current_steps": 3270, "total_steps": 5907, "loss": 0.2548, "lr": 2.47045222433723e-06, "epoch": 1.6607414931437279, "percentage": 55.36, "elapsed_time": "4:15:24", "remaining_time": "3:25:58"} | |
| {"current_steps": 3280, "total_steps": 5907, "loss": 0.2472, "lr": 2.455679626432032e-06, "epoch": 1.665820213306247, "percentage": 55.53, "elapsed_time": "4:16:03", "remaining_time": "3:25:05"} | |
| {"current_steps": 3290, "total_steps": 5907, "loss": 0.2437, "lr": 2.4409085763878043e-06, "epoch": 1.6708989334687658, "percentage": 55.7, "elapsed_time": "4:16:37", "remaining_time": "3:24:07"} | |
| {"current_steps": 3300, "total_steps": 5907, "loss": 0.2418, "lr": 2.426139590074111e-06, "epoch": 1.675977653631285, "percentage": 55.87, "elapsed_time": "4:17:21", "remaining_time": "3:23:18"} | |
| {"current_steps": 3310, "total_steps": 5907, "loss": 0.239, "lr": 2.4113731832884407e-06, "epoch": 1.681056373793804, "percentage": 56.04, "elapsed_time": "4:17:57", "remaining_time": "3:22:23"} | |
| {"current_steps": 3320, "total_steps": 5907, "loss": 0.2474, "lr": 2.396609871738199e-06, "epoch": 1.6861350939563229, "percentage": 56.2, "elapsed_time": "4:18:30", "remaining_time": "3:21:26"} | |
| {"current_steps": 3330, "total_steps": 5907, "loss": 0.2502, "lr": 2.3818501710226867e-06, "epoch": 1.691213814118842, "percentage": 56.37, "elapsed_time": "4:19:34", "remaining_time": "3:20:53"} | |
| {"current_steps": 3340, "total_steps": 5907, "loss": 0.2571, "lr": 2.3670945966151014e-06, "epoch": 1.696292534281361, "percentage": 56.54, "elapsed_time": "4:20:07", "remaining_time": "3:19:55"} | |
| {"current_steps": 3350, "total_steps": 5907, "loss": 0.2527, "lr": 2.3523436638445312e-06, "epoch": 1.7013712544438802, "percentage": 56.71, "elapsed_time": "4:20:38", "remaining_time": "3:18:56"} | |
| {"current_steps": 3360, "total_steps": 5907, "loss": 0.2465, "lr": 2.3375978878779593e-06, "epoch": 1.7064499746063992, "percentage": 56.88, "elapsed_time": "4:21:11", "remaining_time": "3:17:59"} | |
| {"current_steps": 3370, "total_steps": 5907, "loss": 0.2515, "lr": 2.322857783702268e-06, "epoch": 1.7115286947689183, "percentage": 57.05, "elapsed_time": "4:21:44", "remaining_time": "3:17:02"} | |
| {"current_steps": 3380, "total_steps": 5907, "loss": 0.253, "lr": 2.3081238661062585e-06, "epoch": 1.7166074149314374, "percentage": 57.22, "elapsed_time": "4:22:14", "remaining_time": "3:16:03"} | |
| {"current_steps": 3390, "total_steps": 5907, "loss": 0.2549, "lr": 2.2933966496626677e-06, "epoch": 1.7216861350939563, "percentage": 57.39, "elapsed_time": "4:23:01", "remaining_time": "3:15:17"} | |
| {"current_steps": 3400, "total_steps": 5907, "loss": 0.2565, "lr": 2.2786766487102014e-06, "epoch": 1.7267648552564754, "percentage": 57.56, "elapsed_time": "4:23:58", "remaining_time": "3:14:38"} | |
| {"current_steps": 3410, "total_steps": 5907, "loss": 0.2405, "lr": 2.2639643773355666e-06, "epoch": 1.7318435754189943, "percentage": 57.73, "elapsed_time": "4:24:51", "remaining_time": "3:13:56"} | |
| {"current_steps": 3420, "total_steps": 5907, "loss": 0.2472, "lr": 2.2492603493555208e-06, "epoch": 1.7369222955815133, "percentage": 57.9, "elapsed_time": "4:25:23", "remaining_time": "3:12:59"} | |
| {"current_steps": 3430, "total_steps": 5907, "loss": 0.2538, "lr": 2.234565078298925e-06, "epoch": 1.7420010157440324, "percentage": 58.07, "elapsed_time": "4:26:08", "remaining_time": "3:12:12"} | |
| {"current_steps": 3440, "total_steps": 5907, "loss": 0.2542, "lr": 2.219879077388813e-06, "epoch": 1.7470797359065515, "percentage": 58.24, "elapsed_time": "4:26:39", "remaining_time": "3:11:13"} | |
| {"current_steps": 3450, "total_steps": 5907, "loss": 0.254, "lr": 2.2052028595244616e-06, "epoch": 1.7521584560690706, "percentage": 58.41, "elapsed_time": "4:27:10", "remaining_time": "3:10:16"} | |
| {"current_steps": 3460, "total_steps": 5907, "loss": 0.2431, "lr": 2.190536937263482e-06, "epoch": 1.7572371762315897, "percentage": 58.57, "elapsed_time": "4:27:53", "remaining_time": "3:09:27"} | |
| {"current_steps": 3470, "total_steps": 5907, "loss": 0.2467, "lr": 2.175881822803917e-06, "epoch": 1.7623158963941088, "percentage": 58.74, "elapsed_time": "4:28:30", "remaining_time": "3:08:34"} | |
| {"current_steps": 3480, "total_steps": 5907, "loss": 0.2414, "lr": 2.1612380279663576e-06, "epoch": 1.7673946165566279, "percentage": 58.91, "elapsed_time": "4:29:14", "remaining_time": "3:07:46"} | |
| {"current_steps": 3490, "total_steps": 5907, "loss": 0.2479, "lr": 2.14660606417606e-06, "epoch": 1.7724733367191468, "percentage": 59.08, "elapsed_time": "4:29:47", "remaining_time": "3:06:50"} | |
| {"current_steps": 3500, "total_steps": 5907, "loss": 0.254, "lr": 2.1319864424450894e-06, "epoch": 1.7775520568816658, "percentage": 59.25, "elapsed_time": "4:30:20", "remaining_time": "3:05:54"} | |
| {"current_steps": 3500, "total_steps": 5907, "eval_loss": 0.30940133333206177, "epoch": 1.7775520568816658, "percentage": 59.25, "elapsed_time": "4:32:22", "remaining_time": "3:07:18"} | |
| {"current_steps": 3510, "total_steps": 5907, "loss": 0.2461, "lr": 2.117379673354473e-06, "epoch": 1.7826307770441847, "percentage": 59.42, "elapsed_time": "4:35:29", "remaining_time": "3:08:08"} | |
| {"current_steps": 3520, "total_steps": 5907, "loss": 0.2475, "lr": 2.1027862670363685e-06, "epoch": 1.7877094972067038, "percentage": 59.59, "elapsed_time": "4:36:03", "remaining_time": "3:07:12"} | |
| {"current_steps": 3530, "total_steps": 5907, "loss": 0.2346, "lr": 2.088206733156246e-06, "epoch": 1.792788217369223, "percentage": 59.76, "elapsed_time": "4:36:49", "remaining_time": "3:06:24"} | |
| {"current_steps": 3540, "total_steps": 5907, "loss": 0.255, "lr": 2.0736415808950898e-06, "epoch": 1.797866937531742, "percentage": 59.93, "elapsed_time": "4:37:36", "remaining_time": "3:05:37"} | |
| {"current_steps": 3550, "total_steps": 5907, "loss": 0.2466, "lr": 2.059091318931615e-06, "epoch": 1.802945657694261, "percentage": 60.1, "elapsed_time": "4:38:33", "remaining_time": "3:04:56"} | |
| {"current_steps": 3560, "total_steps": 5907, "loss": 0.2431, "lr": 2.0445564554245033e-06, "epoch": 1.8080243778567802, "percentage": 60.27, "elapsed_time": "4:39:05", "remaining_time": "3:04:00"} | |
| {"current_steps": 3570, "total_steps": 5907, "loss": 0.2556, "lr": 2.030037497994655e-06, "epoch": 1.8131030980192993, "percentage": 60.44, "elapsed_time": "4:39:59", "remaining_time": "3:03:17"} | |
| {"current_steps": 3580, "total_steps": 5907, "loss": 0.2376, "lr": 2.0155349537074598e-06, "epoch": 1.8181818181818183, "percentage": 60.61, "elapsed_time": "4:40:47", "remaining_time": "3:02:30"} | |
| {"current_steps": 3590, "total_steps": 5907, "loss": 0.2416, "lr": 2.001049329055088e-06, "epoch": 1.8232605383443372, "percentage": 60.78, "elapsed_time": "4:41:18", "remaining_time": "3:01:33"} | |
| {"current_steps": 3600, "total_steps": 5907, "loss": 0.2447, "lr": 1.9865811299388062e-06, "epoch": 1.8283392585068563, "percentage": 60.94, "elapsed_time": "4:42:14", "remaining_time": "3:00:52"} | |
| {"current_steps": 3610, "total_steps": 5907, "loss": 0.2475, "lr": 1.972130861651302e-06, "epoch": 1.8334179786693752, "percentage": 61.11, "elapsed_time": "4:42:46", "remaining_time": "2:59:55"} | |
| {"current_steps": 3620, "total_steps": 5907, "loss": 0.25, "lr": 1.95769902885904e-06, "epoch": 1.8384966988318943, "percentage": 61.28, "elapsed_time": "4:43:56", "remaining_time": "2:59:23"} | |
| {"current_steps": 3630, "total_steps": 5907, "loss": 0.245, "lr": 1.943286135584637e-06, "epoch": 1.8435754189944134, "percentage": 61.45, "elapsed_time": "4:44:41", "remaining_time": "2:58:34"} | |
| {"current_steps": 3640, "total_steps": 5907, "loss": 0.2424, "lr": 1.9288926851892614e-06, "epoch": 1.8486541391569324, "percentage": 61.62, "elapsed_time": "4:45:35", "remaining_time": "2:57:51"} | |
| {"current_steps": 3650, "total_steps": 5907, "loss": 0.2367, "lr": 1.9145191803550493e-06, "epoch": 1.8537328593194515, "percentage": 61.79, "elapsed_time": "4:46:08", "remaining_time": "2:56:56"} | |
| {"current_steps": 3660, "total_steps": 5907, "loss": 0.2434, "lr": 1.9001661230675516e-06, "epoch": 1.8588115794819706, "percentage": 61.96, "elapsed_time": "4:46:43", "remaining_time": "2:56:01"} | |
| {"current_steps": 3670, "total_steps": 5907, "loss": 0.2475, "lr": 1.8858340145981994e-06, "epoch": 1.8638902996444897, "percentage": 62.13, "elapsed_time": "4:47:34", "remaining_time": "2:55:17"} | |
| {"current_steps": 3680, "total_steps": 5907, "loss": 0.2409, "lr": 1.8715233554868035e-06, "epoch": 1.8689690198070086, "percentage": 62.3, "elapsed_time": "4:48:34", "remaining_time": "2:54:38"} | |
| {"current_steps": 3690, "total_steps": 5907, "loss": 0.2415, "lr": 1.8572346455240656e-06, "epoch": 1.8740477399695277, "percentage": 62.47, "elapsed_time": "4:49:06", "remaining_time": "2:53:42"} | |
| {"current_steps": 3700, "total_steps": 5907, "loss": 0.2499, "lr": 1.8429683837341306e-06, "epoch": 1.8791264601320468, "percentage": 62.64, "elapsed_time": "4:49:50", "remaining_time": "2:52:52"} | |
| {"current_steps": 3710, "total_steps": 5907, "loss": 0.2499, "lr": 1.828725068357151e-06, "epoch": 1.8842051802945656, "percentage": 62.81, "elapsed_time": "4:50:34", "remaining_time": "2:52:04"} | |
| {"current_steps": 3720, "total_steps": 5907, "loss": 0.2431, "lr": 1.8145051968318966e-06, "epoch": 1.8892839004570847, "percentage": 62.98, "elapsed_time": "4:51:25", "remaining_time": "2:51:19"} | |
| {"current_steps": 3730, "total_steps": 5907, "loss": 0.2417, "lr": 1.800309265778369e-06, "epoch": 1.8943626206196038, "percentage": 63.15, "elapsed_time": "4:51:55", "remaining_time": "2:50:22"} | |
| {"current_steps": 3740, "total_steps": 5907, "loss": 0.2538, "lr": 1.7861377709804687e-06, "epoch": 1.899441340782123, "percentage": 63.31, "elapsed_time": "4:52:42", "remaining_time": "2:49:35"} | |
| {"current_steps": 3750, "total_steps": 5907, "loss": 0.2414, "lr": 1.7719912073686712e-06, "epoch": 1.904520060944642, "percentage": 63.48, "elapsed_time": "4:53:13", "remaining_time": "2:48:39"} | |
| {"current_steps": 3760, "total_steps": 5907, "loss": 0.2409, "lr": 1.7578700690027517e-06, "epoch": 1.909598781107161, "percentage": 63.65, "elapsed_time": "4:53:50", "remaining_time": "2:47:47"} | |
| {"current_steps": 3770, "total_steps": 5907, "loss": 0.2414, "lr": 1.7437748490545191e-06, "epoch": 1.9146775012696802, "percentage": 63.82, "elapsed_time": "4:54:26", "remaining_time": "2:46:54"} | |
| {"current_steps": 3780, "total_steps": 5907, "loss": 0.2424, "lr": 1.7297060397906023e-06, "epoch": 1.919756221432199, "percentage": 63.99, "elapsed_time": "4:55:11", "remaining_time": "2:46:06"} | |
| {"current_steps": 3790, "total_steps": 5907, "loss": 0.2521, "lr": 1.7156641325552503e-06, "epoch": 1.9248349415947181, "percentage": 64.16, "elapsed_time": "4:55:59", "remaining_time": "2:45:20"} | |
| {"current_steps": 3800, "total_steps": 5907, "loss": 0.2399, "lr": 1.7016496177531792e-06, "epoch": 1.9299136617572372, "percentage": 64.33, "elapsed_time": "4:56:58", "remaining_time": "2:44:39"} | |
| {"current_steps": 3810, "total_steps": 5907, "loss": 0.2336, "lr": 1.6876629848324391e-06, "epoch": 1.934992381919756, "percentage": 64.5, "elapsed_time": "4:57:29", "remaining_time": "2:43:44"} | |
| {"current_steps": 3820, "total_steps": 5907, "loss": 0.25, "lr": 1.6737047222673235e-06, "epoch": 1.9400711020822752, "percentage": 64.67, "elapsed_time": "4:58:01", "remaining_time": "2:42:49"} | |
| {"current_steps": 3830, "total_steps": 5907, "loss": 0.2516, "lr": 1.6597753175413103e-06, "epoch": 1.9451498222447943, "percentage": 64.84, "elapsed_time": "4:58:37", "remaining_time": "2:41:56"} | |
| {"current_steps": 3840, "total_steps": 5907, "loss": 0.2422, "lr": 1.6458752571300358e-06, "epoch": 1.9502285424073134, "percentage": 65.01, "elapsed_time": "4:59:25", "remaining_time": "2:41:10"} | |
| {"current_steps": 3850, "total_steps": 5907, "loss": 0.2467, "lr": 1.632005026484303e-06, "epoch": 1.9553072625698324, "percentage": 65.18, "elapsed_time": "5:00:24", "remaining_time": "2:40:30"} | |
| {"current_steps": 3860, "total_steps": 5907, "loss": 0.2478, "lr": 1.6181651100131302e-06, "epoch": 1.9603859827323515, "percentage": 65.35, "elapsed_time": "5:00:56", "remaining_time": "2:39:35"} | |
| {"current_steps": 3870, "total_steps": 5907, "loss": 0.2466, "lr": 1.6043559910668315e-06, "epoch": 1.9654647028948706, "percentage": 65.52, "elapsed_time": "5:01:44", "remaining_time": "2:38:49"} | |
| {"current_steps": 3880, "total_steps": 5907, "loss": 0.2401, "lr": 1.5905781519201398e-06, "epoch": 1.9705434230573895, "percentage": 65.68, "elapsed_time": "5:02:20", "remaining_time": "2:37:57"} | |
| {"current_steps": 3890, "total_steps": 5907, "loss": 0.2548, "lr": 1.576832073755358e-06, "epoch": 1.9756221432199086, "percentage": 65.85, "elapsed_time": "5:02:50", "remaining_time": "2:37:01"} | |
| {"current_steps": 3900, "total_steps": 5907, "loss": 0.2394, "lr": 1.5631182366455566e-06, "epoch": 1.9807008633824277, "percentage": 66.02, "elapsed_time": "5:03:36", "remaining_time": "2:36:14"} | |
| {"current_steps": 3910, "total_steps": 5907, "loss": 0.2396, "lr": 1.54943711953781e-06, "epoch": 1.9857795835449465, "percentage": 66.19, "elapsed_time": "5:04:24", "remaining_time": "2:35:28"} | |
| {"current_steps": 3920, "total_steps": 5907, "loss": 0.2435, "lr": 1.5357892002364649e-06, "epoch": 1.9908583037074656, "percentage": 66.36, "elapsed_time": "5:04:56", "remaining_time": "2:34:34"} | |
| {"current_steps": 3930, "total_steps": 5907, "loss": 0.2416, "lr": 1.5221749553864578e-06, "epoch": 1.9959370238699847, "percentage": 66.53, "elapsed_time": "5:05:33", "remaining_time": "2:33:42"} | |
| {"current_steps": 3940, "total_steps": 5907, "loss": 0.2302, "lr": 1.5085948604566647e-06, "epoch": 2.001015744032504, "percentage": 66.7, "elapsed_time": "5:06:30", "remaining_time": "2:33:01"} | |
| {"current_steps": 3950, "total_steps": 5907, "loss": 0.1941, "lr": 1.4950493897232967e-06, "epoch": 2.006094464195023, "percentage": 66.87, "elapsed_time": "5:07:15", "remaining_time": "2:32:13"} | |
| {"current_steps": 3960, "total_steps": 5907, "loss": 0.1885, "lr": 1.4815390162533397e-06, "epoch": 2.011173184357542, "percentage": 67.04, "elapsed_time": "5:08:34", "remaining_time": "2:31:43"} | |
| {"current_steps": 3970, "total_steps": 5907, "loss": 0.1911, "lr": 1.4680642118880275e-06, "epoch": 2.016251904520061, "percentage": 67.21, "elapsed_time": "5:09:06", "remaining_time": "2:30:49"} | |
| {"current_steps": 3980, "total_steps": 5907, "loss": 0.1874, "lr": 1.454625447226366e-06, "epoch": 2.02133062468258, "percentage": 67.38, "elapsed_time": "5:09:39", "remaining_time": "2:29:55"} | |
| {"current_steps": 3990, "total_steps": 5907, "loss": 0.1862, "lr": 1.441223191608696e-06, "epoch": 2.026409344845099, "percentage": 67.55, "elapsed_time": "5:10:23", "remaining_time": "2:29:07"} | |
| {"current_steps": 4000, "total_steps": 5907, "loss": 0.1867, "lr": 1.4278579131003067e-06, "epoch": 2.031488065007618, "percentage": 67.72, "elapsed_time": "5:11:12", "remaining_time": "2:28:21"} | |
| {"current_steps": 4000, "total_steps": 5907, "eval_loss": 0.33027777075767517, "epoch": 2.031488065007618, "percentage": 67.72, "elapsed_time": "5:13:13", "remaining_time": "2:29:19"} | |
| {"current_steps": 4010, "total_steps": 5907, "loss": 0.1844, "lr": 1.414530078475082e-06, "epoch": 2.036566785170137, "percentage": 67.89, "elapsed_time": "5:16:50", "remaining_time": "2:29:53"} | |
| {"current_steps": 4020, "total_steps": 5907, "loss": 0.1806, "lr": 1.4012401531992013e-06, "epoch": 2.041645505332656, "percentage": 68.05, "elapsed_time": "5:17:36", "remaining_time": "2:29:05"} | |
| {"current_steps": 4030, "total_steps": 5907, "loss": 0.1834, "lr": 1.3879886014148864e-06, "epoch": 2.046724225495175, "percentage": 68.22, "elapsed_time": "5:18:08", "remaining_time": "2:28:10"} | |
| {"current_steps": 4040, "total_steps": 5907, "loss": 0.1832, "lr": 1.3747758859241896e-06, "epoch": 2.0518029456576943, "percentage": 68.39, "elapsed_time": "5:18:39", "remaining_time": "2:27:15"} | |
| {"current_steps": 4050, "total_steps": 5907, "loss": 0.1751, "lr": 1.3616024681728278e-06, "epoch": 2.0568816658202134, "percentage": 68.56, "elapsed_time": "5:19:11", "remaining_time": "2:26:21"} | |
| {"current_steps": 4060, "total_steps": 5907, "loss": 0.1824, "lr": 1.3484688082340708e-06, "epoch": 2.0619603859827325, "percentage": 68.73, "elapsed_time": "5:19:55", "remaining_time": "2:25:32"} | |
| {"current_steps": 4070, "total_steps": 5907, "loss": 0.187, "lr": 1.3353753647926701e-06, "epoch": 2.0670391061452515, "percentage": 68.9, "elapsed_time": "5:20:41", "remaining_time": "2:24:44"} | |
| {"current_steps": 4080, "total_steps": 5907, "loss": 0.1772, "lr": 1.3223225951288449e-06, "epoch": 2.0721178263077706, "percentage": 69.07, "elapsed_time": "5:21:29", "remaining_time": "2:23:57"} | |
| {"current_steps": 4090, "total_steps": 5907, "loss": 0.182, "lr": 1.3093109551023058e-06, "epoch": 2.0771965464702893, "percentage": 69.24, "elapsed_time": "5:22:01", "remaining_time": "2:23:03"} | |
| {"current_steps": 4100, "total_steps": 5907, "loss": 0.1814, "lr": 1.2963408991363374e-06, "epoch": 2.0822752666328084, "percentage": 69.41, "elapsed_time": "5:22:34", "remaining_time": "2:22:10"} | |
| {"current_steps": 4110, "total_steps": 5907, "loss": 0.1816, "lr": 1.283412880201927e-06, "epoch": 2.0873539867953275, "percentage": 69.58, "elapsed_time": "5:23:06", "remaining_time": "2:21:16"} | |
| {"current_steps": 4120, "total_steps": 5907, "loss": 0.1826, "lr": 1.270527349801946e-06, "epoch": 2.0924327069578466, "percentage": 69.75, "elapsed_time": "5:23:42", "remaining_time": "2:20:24"} | |
| {"current_steps": 4130, "total_steps": 5907, "loss": 0.1872, "lr": 1.2576847579553826e-06, "epoch": 2.0975114271203656, "percentage": 69.92, "elapsed_time": "5:24:16", "remaining_time": "2:19:31"} | |
| {"current_steps": 4140, "total_steps": 5907, "loss": 0.1815, "lr": 1.2448855531816184e-06, "epoch": 2.1025901472828847, "percentage": 70.09, "elapsed_time": "5:25:11", "remaining_time": "2:18:47"} | |
| {"current_steps": 4150, "total_steps": 5907, "loss": 0.1841, "lr": 1.232130182484772e-06, "epoch": 2.107668867445404, "percentage": 70.26, "elapsed_time": "5:25:44", "remaining_time": "2:17:54"} | |
| {"current_steps": 4160, "total_steps": 5907, "loss": 0.1844, "lr": 1.2194190913380858e-06, "epoch": 2.112747587607923, "percentage": 70.42, "elapsed_time": "5:26:16", "remaining_time": "2:17:01"} | |
| {"current_steps": 4170, "total_steps": 5907, "loss": 0.1833, "lr": 1.206752723668364e-06, "epoch": 2.117826307770442, "percentage": 70.59, "elapsed_time": "5:27:19", "remaining_time": "2:16:20"} | |
| {"current_steps": 4180, "total_steps": 5907, "loss": 0.1845, "lr": 1.194131521840474e-06, "epoch": 2.122905027932961, "percentage": 70.76, "elapsed_time": "5:28:00", "remaining_time": "2:15:31"} | |
| {"current_steps": 4190, "total_steps": 5907, "loss": 0.1871, "lr": 1.181555926641892e-06, "epoch": 2.1279837480954797, "percentage": 70.93, "elapsed_time": "5:28:46", "remaining_time": "2:14:43"} | |
| {"current_steps": 4200, "total_steps": 5907, "loss": 0.1872, "lr": 1.1690263772673158e-06, "epoch": 2.133062468257999, "percentage": 71.1, "elapsed_time": "5:29:26", "remaining_time": "2:13:53"} | |
| {"current_steps": 4210, "total_steps": 5907, "loss": 0.1821, "lr": 1.1565433113033176e-06, "epoch": 2.138141188420518, "percentage": 71.27, "elapsed_time": "5:30:01", "remaining_time": "2:13:01"} | |
| {"current_steps": 4220, "total_steps": 5907, "loss": 0.1796, "lr": 1.14410716471307e-06, "epoch": 2.143219908583037, "percentage": 71.44, "elapsed_time": "5:30:34", "remaining_time": "2:12:09"} | |
| {"current_steps": 4230, "total_steps": 5907, "loss": 0.184, "lr": 1.131718371821112e-06, "epoch": 2.148298628745556, "percentage": 71.61, "elapsed_time": "5:31:20", "remaining_time": "2:11:21"} | |
| {"current_steps": 4240, "total_steps": 5907, "loss": 0.1851, "lr": 1.119377365298189e-06, "epoch": 2.153377348908075, "percentage": 71.78, "elapsed_time": "5:32:04", "remaining_time": "2:10:33"} | |
| {"current_steps": 4250, "total_steps": 5907, "loss": 0.1861, "lr": 1.1070845761461347e-06, "epoch": 2.1584560690705943, "percentage": 71.95, "elapsed_time": "5:32:47", "remaining_time": "2:09:44"} | |
| {"current_steps": 4260, "total_steps": 5907, "loss": 0.1816, "lr": 1.0948404336828222e-06, "epoch": 2.1635347892331134, "percentage": 72.12, "elapsed_time": "5:33:45", "remaining_time": "2:09:02"} | |
| {"current_steps": 4270, "total_steps": 5907, "loss": 0.1884, "lr": 1.0826453655271693e-06, "epoch": 2.1686135093956325, "percentage": 72.29, "elapsed_time": "5:34:18", "remaining_time": "2:08:09"} | |
| {"current_steps": 4280, "total_steps": 5907, "loss": 0.1851, "lr": 1.0704997975842075e-06, "epoch": 2.173692229558151, "percentage": 72.46, "elapsed_time": "5:34:50", "remaining_time": "2:07:17"} | |
| {"current_steps": 4290, "total_steps": 5907, "loss": 0.185, "lr": 1.0584041540302009e-06, "epoch": 2.17877094972067, "percentage": 72.63, "elapsed_time": "5:35:23", "remaining_time": "2:06:25"} | |
| {"current_steps": 4300, "total_steps": 5907, "loss": 0.1874, "lr": 1.0463588572978399e-06, "epoch": 2.1838496698831893, "percentage": 72.79, "elapsed_time": "5:36:08", "remaining_time": "2:05:37"} | |
| {"current_steps": 4310, "total_steps": 5907, "loss": 0.1812, "lr": 1.0343643280614798e-06, "epoch": 2.1889283900457084, "percentage": 72.96, "elapsed_time": "5:36:42", "remaining_time": "2:04:45"} | |
| {"current_steps": 4320, "total_steps": 5907, "loss": 0.1858, "lr": 1.0224209852224573e-06, "epoch": 2.1940071102082275, "percentage": 73.13, "elapsed_time": "5:37:14", "remaining_time": "2:03:53"} | |
| {"current_steps": 4330, "total_steps": 5907, "loss": 0.1782, "lr": 1.010529245894454e-06, "epoch": 2.1990858303707466, "percentage": 73.3, "elapsed_time": "5:37:48", "remaining_time": "2:03:01"} | |
| {"current_steps": 4340, "total_steps": 5907, "loss": 0.1865, "lr": 9.986895253889322e-07, "epoch": 2.2041645505332657, "percentage": 73.47, "elapsed_time": "5:38:50", "remaining_time": "2:02:20"} | |
| {"current_steps": 4350, "total_steps": 5907, "loss": 0.1836, "lr": 9.86902237200629e-07, "epoch": 2.2092432706957847, "percentage": 73.64, "elapsed_time": "5:39:21", "remaining_time": "2:01:28"} | |
| {"current_steps": 4360, "total_steps": 5907, "loss": 0.1879, "lr": 9.751677929931189e-07, "epoch": 2.214321990858304, "percentage": 73.81, "elapsed_time": "5:39:55", "remaining_time": "2:00:36"} | |
| {"current_steps": 4370, "total_steps": 5907, "loss": 0.1838, "lr": 9.634866025844306e-07, "epoch": 2.219400711020823, "percentage": 73.98, "elapsed_time": "5:40:57", "remaining_time": "1:59:55"} | |
| {"current_steps": 4380, "total_steps": 5907, "loss": 0.1846, "lr": 9.518590739327382e-07, "epoch": 2.224479431183342, "percentage": 74.15, "elapsed_time": "5:41:41", "remaining_time": "1:59:07"} | |
| {"current_steps": 4390, "total_steps": 5907, "loss": 0.1826, "lr": 9.402856131221144e-07, "epoch": 2.2295581513458607, "percentage": 74.32, "elapsed_time": "5:42:28", "remaining_time": "1:58:20"} | |
| {"current_steps": 4400, "total_steps": 5907, "loss": 0.1813, "lr": 9.287666243483473e-07, "epoch": 2.2346368715083798, "percentage": 74.49, "elapsed_time": "5:43:25", "remaining_time": "1:57:37"} | |
| {"current_steps": 4410, "total_steps": 5907, "loss": 0.1823, "lr": 9.17302509904821e-07, "epoch": 2.239715591670899, "percentage": 74.66, "elapsed_time": "5:44:01", "remaining_time": "1:56:46"} | |
| {"current_steps": 4420, "total_steps": 5907, "loss": 0.188, "lr": 9.058936701684698e-07, "epoch": 2.244794311833418, "percentage": 74.83, "elapsed_time": "5:44:34", "remaining_time": "1:55:55"} | |
| {"current_steps": 4430, "total_steps": 5907, "loss": 0.1864, "lr": 8.945405035857932e-07, "epoch": 2.249873031995937, "percentage": 75.0, "elapsed_time": "5:45:18", "remaining_time": "1:55:07"} | |
| {"current_steps": 4440, "total_steps": 5907, "loss": 0.189, "lr": 8.832434066589432e-07, "epoch": 2.254951752158456, "percentage": 75.17, "elapsed_time": "5:45:49", "remaining_time": "1:54:15"} | |
| {"current_steps": 4450, "total_steps": 5907, "loss": 0.1851, "lr": 8.720027739318724e-07, "epoch": 2.260030472320975, "percentage": 75.33, "elapsed_time": "5:46:42", "remaining_time": "1:53:31"} | |
| {"current_steps": 4460, "total_steps": 5907, "loss": 0.1854, "lr": 8.608189979765563e-07, "epoch": 2.2651091924834943, "percentage": 75.5, "elapsed_time": "5:47:13", "remaining_time": "1:52:39"} | |
| {"current_steps": 4470, "total_steps": 5907, "loss": 0.1869, "lr": 8.496924693792872e-07, "epoch": 2.2701879126460134, "percentage": 75.67, "elapsed_time": "5:48:01", "remaining_time": "1:51:52"} | |
| {"current_steps": 4480, "total_steps": 5907, "loss": 0.1873, "lr": 8.386235767270256e-07, "epoch": 2.275266632808532, "percentage": 75.84, "elapsed_time": "5:48:32", "remaining_time": "1:51:01"} | |
| {"current_steps": 4490, "total_steps": 5907, "loss": 0.1859, "lr": 8.27612706593837e-07, "epoch": 2.280345352971051, "percentage": 76.01, "elapsed_time": "5:49:05", "remaining_time": "1:50:10"} | |
| {"current_steps": 4500, "total_steps": 5907, "loss": 0.1863, "lr": 8.166602435273832e-07, "epoch": 2.28542407313357, "percentage": 76.18, "elapsed_time": "5:50:15", "remaining_time": "1:49:30"} | |
| {"current_steps": 4500, "total_steps": 5907, "eval_loss": 0.3316808044910431, "epoch": 2.28542407313357, "percentage": 76.18, "elapsed_time": "5:52:17", "remaining_time": "1:50:08"} | |
| {"current_steps": 4510, "total_steps": 5907, "loss": 0.1839, "lr": 8.057665700354999e-07, "epoch": 2.2905027932960893, "percentage": 76.35, "elapsed_time": "5:55:27", "remaining_time": "1:50:06"} | |
| {"current_steps": 4520, "total_steps": 5907, "loss": 0.1876, "lr": 7.949320665728319e-07, "epoch": 2.2955815134586084, "percentage": 76.52, "elapsed_time": "5:56:17", "remaining_time": "1:49:19"} | |
| {"current_steps": 4530, "total_steps": 5907, "loss": 0.1804, "lr": 7.841571115275487e-07, "epoch": 2.3006602336211275, "percentage": 76.69, "elapsed_time": "5:57:03", "remaining_time": "1:48:32"} | |
| {"current_steps": 4540, "total_steps": 5907, "loss": 0.1821, "lr": 7.734420812081283e-07, "epoch": 2.3057389537836466, "percentage": 76.86, "elapsed_time": "5:57:44", "remaining_time": "1:47:42"} | |
| {"current_steps": 4550, "total_steps": 5907, "loss": 0.1867, "lr": 7.62787349830218e-07, "epoch": 2.3108176739461657, "percentage": 77.03, "elapsed_time": "5:58:14", "remaining_time": "1:46:50"} | |
| {"current_steps": 4560, "total_steps": 5907, "loss": 0.1847, "lr": 7.521932895035605e-07, "epoch": 2.3158963941086848, "percentage": 77.2, "elapsed_time": "5:58:59", "remaining_time": "1:46:02"} | |
| {"current_steps": 4570, "total_steps": 5907, "loss": 0.1804, "lr": 7.416602702190004e-07, "epoch": 2.320975114271204, "percentage": 77.37, "elapsed_time": "5:59:32", "remaining_time": "1:45:11"} | |
| {"current_steps": 4580, "total_steps": 5907, "loss": 0.1832, "lr": 7.311886598355642e-07, "epoch": 2.326053834433723, "percentage": 77.54, "elapsed_time": "6:00:19", "remaining_time": "1:44:23"} | |
| {"current_steps": 4590, "total_steps": 5907, "loss": 0.1847, "lr": 7.207788240676108e-07, "epoch": 2.3311325545962416, "percentage": 77.7, "elapsed_time": "6:01:05", "remaining_time": "1:43:36"} | |
| {"current_steps": 4600, "total_steps": 5907, "loss": 0.1843, "lr": 7.104311264720598e-07, "epoch": 2.3362112747587607, "percentage": 77.87, "elapsed_time": "6:01:53", "remaining_time": "1:42:49"} | |
| {"current_steps": 4610, "total_steps": 5907, "loss": 0.1816, "lr": 7.001459284356938e-07, "epoch": 2.3412899949212798, "percentage": 78.04, "elapsed_time": "6:02:29", "remaining_time": "1:41:59"} | |
| {"current_steps": 4620, "total_steps": 5907, "loss": 0.1845, "lr": 6.899235891625372e-07, "epoch": 2.346368715083799, "percentage": 78.21, "elapsed_time": "6:03:18", "remaining_time": "1:41:12"} | |
| {"current_steps": 4630, "total_steps": 5907, "loss": 0.1828, "lr": 6.79764465661315e-07, "epoch": 2.351447435246318, "percentage": 78.38, "elapsed_time": "6:03:51", "remaining_time": "1:40:21"} | |
| {"current_steps": 4640, "total_steps": 5907, "loss": 0.184, "lr": 6.696689127329792e-07, "epoch": 2.356526155408837, "percentage": 78.55, "elapsed_time": "6:04:30", "remaining_time": "1:39:31"} | |
| {"current_steps": 4650, "total_steps": 5907, "loss": 0.1855, "lr": 6.596372829583184e-07, "epoch": 2.361604875571356, "percentage": 78.72, "elapsed_time": "6:05:03", "remaining_time": "1:38:40"} | |
| {"current_steps": 4660, "total_steps": 5907, "loss": 0.188, "lr": 6.496699266856493e-07, "epoch": 2.366683595733875, "percentage": 78.89, "elapsed_time": "6:05:33", "remaining_time": "1:37:49"} | |
| {"current_steps": 4670, "total_steps": 5907, "loss": 0.1808, "lr": 6.397671920185738e-07, "epoch": 2.3717623158963943, "percentage": 79.06, "elapsed_time": "6:06:32", "remaining_time": "1:37:05"} | |
| {"current_steps": 4680, "total_steps": 5907, "loss": 0.1828, "lr": 6.299294248038281e-07, "epoch": 2.376841036058913, "percentage": 79.23, "elapsed_time": "6:07:09", "remaining_time": "1:36:15"} | |
| {"current_steps": 4690, "total_steps": 5907, "loss": 0.1861, "lr": 6.201569686191988e-07, "epoch": 2.381919756221432, "percentage": 79.4, "elapsed_time": "6:08:07", "remaining_time": "1:35:31"} | |
| {"current_steps": 4700, "total_steps": 5907, "loss": 0.1919, "lr": 6.104501647615265e-07, "epoch": 2.386998476383951, "percentage": 79.57, "elapsed_time": "6:08:52", "remaining_time": "1:34:43"} | |
| {"current_steps": 4710, "total_steps": 5907, "loss": 0.1847, "lr": 6.00809352234788e-07, "epoch": 2.39207719654647, "percentage": 79.74, "elapsed_time": "6:09:24", "remaining_time": "1:33:52"} | |
| {"current_steps": 4720, "total_steps": 5907, "loss": 0.1865, "lr": 5.912348677382523e-07, "epoch": 2.3971559167089893, "percentage": 79.91, "elapsed_time": "6:10:12", "remaining_time": "1:33:06"} | |
| {"current_steps": 4730, "total_steps": 5907, "loss": 0.1873, "lr": 5.81727045654725e-07, "epoch": 2.4022346368715084, "percentage": 80.07, "elapsed_time": "6:11:12", "remaining_time": "1:32:22"} | |
| {"current_steps": 4740, "total_steps": 5907, "loss": 0.1823, "lr": 5.722862180388683e-07, "epoch": 2.4073133570340275, "percentage": 80.24, "elapsed_time": "6:12:00", "remaining_time": "1:31:35"} | |
| {"current_steps": 4750, "total_steps": 5907, "loss": 0.1804, "lr": 5.629127146056062e-07, "epoch": 2.4123920771965466, "percentage": 80.41, "elapsed_time": "6:12:32", "remaining_time": "1:30:44"} | |
| {"current_steps": 4760, "total_steps": 5907, "loss": 0.184, "lr": 5.536068627186089e-07, "epoch": 2.4174707973590657, "percentage": 80.58, "elapsed_time": "6:13:09", "remaining_time": "1:29:55"} | |
| {"current_steps": 4770, "total_steps": 5907, "loss": 0.1855, "lr": 5.443689873788572e-07, "epoch": 2.4225495175215848, "percentage": 80.75, "elapsed_time": "6:14:14", "remaining_time": "1:29:12"} | |
| {"current_steps": 4780, "total_steps": 5907, "loss": 0.1845, "lr": 5.351994112132944e-07, "epoch": 2.427628237684104, "percentage": 80.92, "elapsed_time": "6:14:51", "remaining_time": "1:28:22"} | |
| {"current_steps": 4790, "total_steps": 5907, "loss": 0.1869, "lr": 5.260984544635603e-07, "epoch": 2.4327069578466225, "percentage": 81.09, "elapsed_time": "6:15:24", "remaining_time": "1:27:32"} | |
| {"current_steps": 4800, "total_steps": 5907, "loss": 0.1836, "lr": 5.170664349748031e-07, "epoch": 2.4377856780091416, "percentage": 81.26, "elapsed_time": "6:16:11", "remaining_time": "1:26:45"} | |
| {"current_steps": 4810, "total_steps": 5907, "loss": 0.1896, "lr": 5.081036681845813e-07, "epoch": 2.4428643981716607, "percentage": 81.43, "elapsed_time": "6:17:10", "remaining_time": "1:26:01"} | |
| {"current_steps": 4820, "total_steps": 5907, "loss": 0.1869, "lr": 4.99210467111847e-07, "epoch": 2.4479431183341798, "percentage": 81.6, "elapsed_time": "6:17:45", "remaining_time": "1:25:11"} | |
| {"current_steps": 4830, "total_steps": 5907, "loss": 0.1845, "lr": 4.903871423460141e-07, "epoch": 2.453021838496699, "percentage": 81.77, "elapsed_time": "6:18:21", "remaining_time": "1:24:22"} | |
| {"current_steps": 4840, "total_steps": 5907, "loss": 0.1766, "lr": 4.816340020361096e-07, "epoch": 2.458100558659218, "percentage": 81.94, "elapsed_time": "6:19:08", "remaining_time": "1:23:35"} | |
| {"current_steps": 4850, "total_steps": 5907, "loss": 0.1829, "lr": 4.7295135188001465e-07, "epoch": 2.463179278821737, "percentage": 82.11, "elapsed_time": "6:19:39", "remaining_time": "1:22:44"} | |
| {"current_steps": 4860, "total_steps": 5907, "loss": 0.1861, "lr": 4.6433949511378417e-07, "epoch": 2.468257998984256, "percentage": 82.28, "elapsed_time": "6:20:12", "remaining_time": "1:21:54"} | |
| {"current_steps": 4870, "total_steps": 5907, "loss": 0.1865, "lr": 4.557987325010613e-07, "epoch": 2.4733367191467748, "percentage": 82.44, "elapsed_time": "6:20:45", "remaining_time": "1:21:04"} | |
| {"current_steps": 4880, "total_steps": 5907, "loss": 0.1812, "lr": 4.4732936232256855e-07, "epoch": 2.478415439309294, "percentage": 82.61, "elapsed_time": "6:21:41", "remaining_time": "1:20:19"} | |
| {"current_steps": 4890, "total_steps": 5907, "loss": 0.1918, "lr": 4.389316803656943e-07, "epoch": 2.483494159471813, "percentage": 82.78, "elapsed_time": "6:22:16", "remaining_time": "1:19:30"} | |
| {"current_steps": 4900, "total_steps": 5907, "loss": 0.178, "lr": 4.3060597991415987e-07, "epoch": 2.488572879634332, "percentage": 82.95, "elapsed_time": "6:23:31", "remaining_time": "1:18:49"} | |
| {"current_steps": 4910, "total_steps": 5907, "loss": 0.1864, "lr": 4.223525517377805e-07, "epoch": 2.493651599796851, "percentage": 83.12, "elapsed_time": "6:24:31", "remaining_time": "1:18:04"} | |
| {"current_steps": 4920, "total_steps": 5907, "loss": 0.1849, "lr": 4.1417168408230596e-07, "epoch": 2.4987303199593702, "percentage": 83.29, "elapsed_time": "6:25:25", "remaining_time": "1:17:19"} | |
| {"current_steps": 4930, "total_steps": 5907, "loss": 0.1816, "lr": 4.060636626593556e-07, "epoch": 2.5038090401218893, "percentage": 83.46, "elapsed_time": "6:25:55", "remaining_time": "1:16:28"} | |
| {"current_steps": 4940, "total_steps": 5907, "loss": 0.1833, "lr": 3.9802877063644193e-07, "epoch": 2.5088877602844084, "percentage": 83.63, "elapsed_time": "6:26:55", "remaining_time": "1:15:44"} | |
| {"current_steps": 4950, "total_steps": 5907, "loss": 0.1781, "lr": 3.9006728862707925e-07, "epoch": 2.5139664804469275, "percentage": 83.8, "elapsed_time": "6:27:42", "remaining_time": "1:14:57"} | |
| {"current_steps": 4960, "total_steps": 5907, "loss": 0.1813, "lr": 3.8217949468098205e-07, "epoch": 2.5190452006094466, "percentage": 83.97, "elapsed_time": "6:28:13", "remaining_time": "1:14:07"} | |
| {"current_steps": 4970, "total_steps": 5907, "loss": 0.1915, "lr": 3.7436566427435675e-07, "epoch": 2.5241239207719657, "percentage": 84.14, "elapsed_time": "6:28:47", "remaining_time": "1:13:17"} | |
| {"current_steps": 4980, "total_steps": 5907, "loss": 0.1843, "lr": 3.6662607030028e-07, "epoch": 2.5292026409344848, "percentage": 84.31, "elapsed_time": "6:29:21", "remaining_time": "1:12:28"} | |
| {"current_steps": 4990, "total_steps": 5907, "loss": 0.1877, "lr": 3.589609830591692e-07, "epoch": 2.5342813610970034, "percentage": 84.48, "elapsed_time": "6:29:53", "remaining_time": "1:11:38"} | |
| {"current_steps": 5000, "total_steps": 5907, "loss": 0.1825, "lr": 3.513706702493394e-07, "epoch": 2.5393600812595225, "percentage": 84.65, "elapsed_time": "6:30:25", "remaining_time": "1:10:49"} | |
| {"current_steps": 5000, "total_steps": 5907, "eval_loss": 0.3309638500213623, "epoch": 2.5393600812595225, "percentage": 84.65, "elapsed_time": "6:32:27", "remaining_time": "1:11:11"} | |
| {"current_steps": 5010, "total_steps": 5907, "loss": 0.1831, "lr": 3.438553969576569e-07, "epoch": 2.5444388014220416, "percentage": 84.81, "elapsed_time": "6:36:19", "remaining_time": "1:10:57"} | |
| {"current_steps": 5020, "total_steps": 5907, "loss": 0.184, "lr": 3.364154256502808e-07, "epoch": 2.5495175215845607, "percentage": 84.98, "elapsed_time": "6:37:07", "remaining_time": "1:10:10"} | |
| {"current_steps": 5030, "total_steps": 5907, "loss": 0.1854, "lr": 3.2905101616349497e-07, "epoch": 2.5545962417470798, "percentage": 85.15, "elapsed_time": "6:37:50", "remaining_time": "1:09:21"} | |
| {"current_steps": 5040, "total_steps": 5907, "loss": 0.1832, "lr": 3.217624256946361e-07, "epoch": 2.559674961909599, "percentage": 85.32, "elapsed_time": "6:38:47", "remaining_time": "1:08:36"} | |
| {"current_steps": 5050, "total_steps": 5907, "loss": 0.1827, "lr": 3.1454990879310866e-07, "epoch": 2.564753682072118, "percentage": 85.49, "elapsed_time": "6:40:02", "remaining_time": "1:07:53"} | |
| {"current_steps": 5060, "total_steps": 5907, "loss": 0.1825, "lr": 3.0741371735149544e-07, "epoch": 2.5698324022346366, "percentage": 85.66, "elapsed_time": "6:40:35", "remaining_time": "1:07:03"} | |
| {"current_steps": 5070, "total_steps": 5907, "loss": 0.1785, "lr": 3.003541005967628e-07, "epoch": 2.5749111223971557, "percentage": 85.83, "elapsed_time": "6:41:18", "remaining_time": "1:06:15"} | |
| {"current_steps": 5080, "total_steps": 5907, "loss": 0.1784, "lr": 2.9337130508155287e-07, "epoch": 2.579989842559675, "percentage": 86.0, "elapsed_time": "6:42:08", "remaining_time": "1:05:28"} | |
| {"current_steps": 5090, "total_steps": 5907, "loss": 0.1831, "lr": 2.8646557467557514e-07, "epoch": 2.585068562722194, "percentage": 86.17, "elapsed_time": "6:42:41", "remaining_time": "1:04:38"} | |
| {"current_steps": 5100, "total_steps": 5907, "loss": 0.1865, "lr": 2.796371505570888e-07, "epoch": 2.590147282884713, "percentage": 86.34, "elapsed_time": "6:43:16", "remaining_time": "1:03:48"} | |
| {"current_steps": 5110, "total_steps": 5907, "loss": 0.1794, "lr": 2.728862712044811e-07, "epoch": 2.595226003047232, "percentage": 86.51, "elapsed_time": "6:44:03", "remaining_time": "1:03:01"} | |
| {"current_steps": 5120, "total_steps": 5907, "loss": 0.1859, "lr": 2.662131723879366e-07, "epoch": 2.600304723209751, "percentage": 86.68, "elapsed_time": "6:44:37", "remaining_time": "1:02:11"} | |
| {"current_steps": 5130, "total_steps": 5907, "loss": 0.1797, "lr": 2.5961808716120364e-07, "epoch": 2.6053834433722702, "percentage": 86.85, "elapsed_time": "6:45:09", "remaining_time": "1:01:21"} | |
| {"current_steps": 5140, "total_steps": 5907, "loss": 0.1812, "lr": 2.531012458534551e-07, "epoch": 2.6104621635347893, "percentage": 87.02, "elapsed_time": "6:45:57", "remaining_time": "1:00:34"} | |
| {"current_steps": 5150, "total_steps": 5907, "loss": 0.1874, "lr": 2.466628760612463e-07, "epoch": 2.6155408836973084, "percentage": 87.18, "elapsed_time": "6:46:29", "remaining_time": "0:59:45"} | |
| {"current_steps": 5160, "total_steps": 5907, "loss": 0.1774, "lr": 2.40303202640563e-07, "epoch": 2.6206196038598275, "percentage": 87.35, "elapsed_time": "6:47:11", "remaining_time": "0:58:56"} | |
| {"current_steps": 5170, "total_steps": 5907, "loss": 0.1843, "lr": 2.3402244769896998e-07, "epoch": 2.6256983240223466, "percentage": 87.52, "elapsed_time": "6:47:58", "remaining_time": "0:58:09"} | |
| {"current_steps": 5180, "total_steps": 5907, "loss": 0.1829, "lr": 2.2782083058785458e-07, "epoch": 2.6307770441848657, "percentage": 87.69, "elapsed_time": "6:48:29", "remaining_time": "0:57:19"} | |
| {"current_steps": 5190, "total_steps": 5907, "loss": 0.1821, "lr": 2.216985678947664e-07, "epoch": 2.6358557643473843, "percentage": 87.86, "elapsed_time": "6:49:03", "remaining_time": "0:56:30"} | |
| {"current_steps": 5200, "total_steps": 5907, "loss": 0.1774, "lr": 2.156558734358505e-07, "epoch": 2.6409344845099034, "percentage": 88.03, "elapsed_time": "6:49:38", "remaining_time": "0:55:41"} | |
| {"current_steps": 5210, "total_steps": 5907, "loss": 0.1852, "lr": 2.0969295824838336e-07, "epoch": 2.6460132046724225, "percentage": 88.2, "elapsed_time": "6:50:18", "remaining_time": "0:54:53"} | |
| {"current_steps": 5220, "total_steps": 5907, "loss": 0.1832, "lr": 2.0381003058339982e-07, "epoch": 2.6510919248349416, "percentage": 88.37, "elapsed_time": "6:51:46", "remaining_time": "0:54:11"} | |
| {"current_steps": 5230, "total_steps": 5907, "loss": 0.1763, "lr": 1.9800729589842222e-07, "epoch": 2.6561706449974607, "percentage": 88.54, "elapsed_time": "6:52:45", "remaining_time": "0:53:25"} | |
| {"current_steps": 5240, "total_steps": 5907, "loss": 0.1815, "lr": 1.92284956850283e-07, "epoch": 2.66124936515998, "percentage": 88.71, "elapsed_time": "6:53:21", "remaining_time": "0:52:37"} | |
| {"current_steps": 5250, "total_steps": 5907, "loss": 0.1834, "lr": 1.866432132880483e-07, "epoch": 2.666328085322499, "percentage": 88.88, "elapsed_time": "6:54:07", "remaining_time": "0:51:49"} | |
| {"current_steps": 5260, "total_steps": 5907, "loss": 0.1804, "lr": 1.8108226224603732e-07, "epoch": 2.6714068054850175, "percentage": 89.05, "elapsed_time": "6:54:39", "remaining_time": "0:51:00"} | |
| {"current_steps": 5270, "total_steps": 5907, "loss": 0.1828, "lr": 1.7560229793694288e-07, "epoch": 2.6764855256475366, "percentage": 89.22, "elapsed_time": "6:55:12", "remaining_time": "0:50:11"} | |
| {"current_steps": 5280, "total_steps": 5907, "loss": 0.1901, "lr": 1.702035117450468e-07, "epoch": 2.6815642458100557, "percentage": 89.39, "elapsed_time": "6:55:44", "remaining_time": "0:49:22"} | |
| {"current_steps": 5290, "total_steps": 5907, "loss": 0.1797, "lr": 1.6488609221953612e-07, "epoch": 2.686642965972575, "percentage": 89.55, "elapsed_time": "6:56:16", "remaining_time": "0:48:33"} | |
| {"current_steps": 5300, "total_steps": 5907, "loss": 0.1799, "lr": 1.596502250679194e-07, "epoch": 2.691721686135094, "percentage": 89.72, "elapsed_time": "6:57:08", "remaining_time": "0:47:46"} | |
| {"current_steps": 5310, "total_steps": 5907, "loss": 0.1849, "lr": 1.5449609314954012e-07, "epoch": 2.696800406297613, "percentage": 89.89, "elapsed_time": "6:57:45", "remaining_time": "0:46:58"} | |
| {"current_steps": 5320, "total_steps": 5907, "loss": 0.181, "lr": 1.494238764691902e-07, "epoch": 2.701879126460132, "percentage": 90.06, "elapsed_time": "6:58:29", "remaining_time": "0:46:10"} | |
| {"current_steps": 5330, "total_steps": 5907, "loss": 0.1806, "lr": 1.444337521708236e-07, "epoch": 2.706957846622651, "percentage": 90.23, "elapsed_time": "6:59:10", "remaining_time": "0:45:22"} | |
| {"current_steps": 5340, "total_steps": 5907, "loss": 0.1803, "lr": 1.3952589453137017e-07, "epoch": 2.7120365667851702, "percentage": 90.4, "elapsed_time": "6:59:43", "remaining_time": "0:44:33"} | |
| {"current_steps": 5350, "total_steps": 5907, "loss": 0.1779, "lr": 1.3470047495464905e-07, "epoch": 2.7171152869476893, "percentage": 90.57, "elapsed_time": "7:00:21", "remaining_time": "0:43:45"} | |
| {"current_steps": 5360, "total_steps": 5907, "loss": 0.179, "lr": 1.2995766196538194e-07, "epoch": 2.7221940071102084, "percentage": 90.74, "elapsed_time": "7:01:21", "remaining_time": "0:43:00"} | |
| {"current_steps": 5370, "total_steps": 5907, "loss": 0.1862, "lr": 1.252976212033072e-07, "epoch": 2.7272727272727275, "percentage": 90.91, "elapsed_time": "7:01:57", "remaining_time": "0:42:11"} | |
| {"current_steps": 5380, "total_steps": 5907, "loss": 0.171, "lr": 1.2072051541739682e-07, "epoch": 2.732351447435246, "percentage": 91.08, "elapsed_time": "7:02:47", "remaining_time": "0:41:24"} | |
| {"current_steps": 5390, "total_steps": 5907, "loss": 0.1785, "lr": 1.1622650446017042e-07, "epoch": 2.7374301675977653, "percentage": 91.25, "elapsed_time": "7:03:21", "remaining_time": "0:40:36"} | |
| {"current_steps": 5400, "total_steps": 5907, "loss": 0.1764, "lr": 1.118157452821142e-07, "epoch": 2.7425088877602843, "percentage": 91.42, "elapsed_time": "7:04:17", "remaining_time": "0:39:50"} | |
| {"current_steps": 5410, "total_steps": 5907, "loss": 0.1809, "lr": 1.0748839192619764e-07, "epoch": 2.7475876079228034, "percentage": 91.59, "elapsed_time": "7:04:49", "remaining_time": "0:39:01"} | |
| {"current_steps": 5420, "total_steps": 5907, "loss": 0.1804, "lr": 1.0324459552249505e-07, "epoch": 2.7526663280853225, "percentage": 91.76, "elapsed_time": "7:05:33", "remaining_time": "0:38:14"} | |
| {"current_steps": 5430, "total_steps": 5907, "loss": 0.1837, "lr": 9.908450428290806e-08, "epoch": 2.7577450482478416, "percentage": 91.92, "elapsed_time": "7:06:16", "remaining_time": "0:37:26"} | |
| {"current_steps": 5440, "total_steps": 5907, "loss": 0.1779, "lr": 9.500826349598729e-08, "epoch": 2.7628237684103607, "percentage": 92.09, "elapsed_time": "7:06:56", "remaining_time": "0:36:39"} | |
| {"current_steps": 5450, "total_steps": 5907, "loss": 0.1853, "lr": 9.101601552185951e-08, "epoch": 2.76790248857288, "percentage": 92.26, "elapsed_time": "7:07:44", "remaining_time": "0:35:52"} | |
| {"current_steps": 5460, "total_steps": 5907, "loss": 0.1817, "lr": 8.710789978725653e-08, "epoch": 2.7729812087353984, "percentage": 92.43, "elapsed_time": "7:08:19", "remaining_time": "0:35:03"} | |
| {"current_steps": 5470, "total_steps": 5907, "loss": 0.1843, "lr": 8.328405278064417e-08, "epoch": 2.7780599288979175, "percentage": 92.6, "elapsed_time": "7:08:49", "remaining_time": "0:34:15"} | |
| {"current_steps": 5480, "total_steps": 5907, "loss": 0.1811, "lr": 7.954460804745712e-08, "epoch": 2.7831386490604366, "percentage": 92.77, "elapsed_time": "7:09:36", "remaining_time": "0:33:28"} | |
| {"current_steps": 5490, "total_steps": 5907, "loss": 0.1873, "lr": 7.588969618543357e-08, "epoch": 2.7882173692229557, "percentage": 92.94, "elapsed_time": "7:10:05", "remaining_time": "0:32:40"} | |
| {"current_steps": 5500, "total_steps": 5907, "loss": 0.1787, "lr": 7.231944484005437e-08, "epoch": 2.793296089385475, "percentage": 93.11, "elapsed_time": "7:10:49", "remaining_time": "0:31:52"} | |
| {"current_steps": 5500, "total_steps": 5907, "eval_loss": 0.3309679627418518, "epoch": 2.793296089385475, "percentage": 93.11, "elapsed_time": "7:12:51", "remaining_time": "0:32:01"} | |
| {"current_steps": 5510, "total_steps": 5907, "loss": 0.1831, "lr": 6.883397870008662e-08, "epoch": 2.798374809547994, "percentage": 93.28, "elapsed_time": "7:16:43", "remaining_time": "0:31:27"} | |
| {"current_steps": 5520, "total_steps": 5907, "loss": 0.1851, "lr": 6.543341949322657e-08, "epoch": 2.803453529710513, "percentage": 93.45, "elapsed_time": "7:17:19", "remaining_time": "0:30:39"} | |
| {"current_steps": 5530, "total_steps": 5907, "loss": 0.182, "lr": 6.211788598185081e-08, "epoch": 2.808532249873032, "percentage": 93.62, "elapsed_time": "7:17:50", "remaining_time": "0:29:50"} | |
| {"current_steps": 5540, "total_steps": 5907, "loss": 0.1809, "lr": 5.8887493958866004e-08, "epoch": 2.813610970035551, "percentage": 93.79, "elapsed_time": "7:18:41", "remaining_time": "0:29:03"} | |
| {"current_steps": 5550, "total_steps": 5907, "loss": 0.1875, "lr": 5.574235624366764e-08, "epoch": 2.8186896901980703, "percentage": 93.96, "elapsed_time": "7:19:27", "remaining_time": "0:28:16"} | |
| {"current_steps": 5560, "total_steps": 5907, "loss": 0.1777, "lr": 5.2682582678197644e-08, "epoch": 2.8237684103605893, "percentage": 94.13, "elapsed_time": "7:19:59", "remaining_time": "0:27:27"} | |
| {"current_steps": 5570, "total_steps": 5907, "loss": 0.1856, "lr": 4.970828012310969e-08, "epoch": 2.8288471305231084, "percentage": 94.29, "elapsed_time": "7:20:46", "remaining_time": "0:26:40"} | |
| {"current_steps": 5580, "total_steps": 5907, "loss": 0.1872, "lr": 4.681955245403602e-08, "epoch": 2.833925850685627, "percentage": 94.46, "elapsed_time": "7:21:30", "remaining_time": "0:25:52"} | |
| {"current_steps": 5590, "total_steps": 5907, "loss": 0.1811, "lr": 4.401650055796042e-08, "epoch": 2.839004570848146, "percentage": 94.63, "elapsed_time": "7:22:18", "remaining_time": "0:25:04"} | |
| {"current_steps": 5600, "total_steps": 5907, "loss": 0.177, "lr": 4.1299222329694574e-08, "epoch": 2.8440832910106653, "percentage": 94.8, "elapsed_time": "7:23:30", "remaining_time": "0:24:18"} | |
| {"current_steps": 5610, "total_steps": 5907, "loss": 0.1863, "lr": 3.8667812668459204e-08, "epoch": 2.8491620111731844, "percentage": 94.97, "elapsed_time": "7:24:05", "remaining_time": "0:23:30"} | |
| {"current_steps": 5620, "total_steps": 5907, "loss": 0.1799, "lr": 3.612236347456943e-08, "epoch": 2.8542407313357034, "percentage": 95.14, "elapsed_time": "7:24:38", "remaining_time": "0:22:42"} | |
| {"current_steps": 5630, "total_steps": 5907, "loss": 0.1836, "lr": 3.366296364622629e-08, "epoch": 2.8593194514982225, "percentage": 95.31, "elapsed_time": "7:25:48", "remaining_time": "0:21:56"} | |
| {"current_steps": 5640, "total_steps": 5907, "loss": 0.1813, "lr": 3.128969907641027e-08, "epoch": 2.8643981716607416, "percentage": 95.48, "elapsed_time": "7:26:34", "remaining_time": "0:21:08"} | |
| {"current_steps": 5650, "total_steps": 5907, "loss": 0.1851, "lr": 2.9002652649882945e-08, "epoch": 2.8694768918232603, "percentage": 95.65, "elapsed_time": "7:27:13", "remaining_time": "0:20:20"} | |
| {"current_steps": 5660, "total_steps": 5907, "loss": 0.1795, "lr": 2.6801904240292275e-08, "epoch": 2.8745556119857794, "percentage": 95.82, "elapsed_time": "7:27:45", "remaining_time": "0:19:32"} | |
| {"current_steps": 5670, "total_steps": 5907, "loss": 0.1883, "lr": 2.4687530707381836e-08, "epoch": 2.8796343321482984, "percentage": 95.99, "elapsed_time": "7:28:19", "remaining_time": "0:18:44"} | |
| {"current_steps": 5680, "total_steps": 5907, "loss": 0.1755, "lr": 2.265960589430821e-08, "epoch": 2.8847130523108175, "percentage": 96.16, "elapsed_time": "7:28:50", "remaining_time": "0:17:56"} | |
| {"current_steps": 5690, "total_steps": 5907, "loss": 0.1806, "lr": 2.0718200625060302e-08, "epoch": 2.8897917724733366, "percentage": 96.33, "elapsed_time": "7:29:34", "remaining_time": "0:17:08"} | |
| {"current_steps": 5700, "total_steps": 5907, "loss": 0.1838, "lr": 1.8863382701987675e-08, "epoch": 2.8948704926358557, "percentage": 96.5, "elapsed_time": "7:30:07", "remaining_time": "0:16:20"} | |
| {"current_steps": 5710, "total_steps": 5907, "loss": 0.1822, "lr": 1.70952169034308e-08, "epoch": 2.899949212798375, "percentage": 96.66, "elapsed_time": "7:30:52", "remaining_time": "0:15:33"} | |
| {"current_steps": 5720, "total_steps": 5907, "loss": 0.189, "lr": 1.5413764981460354e-08, "epoch": 2.905027932960894, "percentage": 96.83, "elapsed_time": "7:31:33", "remaining_time": "0:14:45"} | |
| {"current_steps": 5730, "total_steps": 5907, "loss": 0.1802, "lr": 1.3819085659719233e-08, "epoch": 2.910106653123413, "percentage": 97.0, "elapsed_time": "7:32:16", "remaining_time": "0:13:58"} | |
| {"current_steps": 5740, "total_steps": 5907, "loss": 0.185, "lr": 1.2311234631372514e-08, "epoch": 2.915185373285932, "percentage": 97.17, "elapsed_time": "7:33:00", "remaining_time": "0:13:10"} | |
| {"current_steps": 5750, "total_steps": 5907, "loss": 0.1859, "lr": 1.0890264557162356e-08, "epoch": 2.920264093448451, "percentage": 97.34, "elapsed_time": "7:33:53", "remaining_time": "0:12:23"} | |
| {"current_steps": 5760, "total_steps": 5907, "loss": 0.1822, "lr": 9.556225063568347e-09, "epoch": 2.9253428136109703, "percentage": 97.51, "elapsed_time": "7:34:26", "remaining_time": "0:11:35"} | |
| {"current_steps": 5770, "total_steps": 5907, "loss": 0.1837, "lr": 8.309162741074461e-09, "epoch": 2.9304215337734894, "percentage": 97.68, "elapsed_time": "7:35:24", "remaining_time": "0:10:48"} | |
| {"current_steps": 5780, "total_steps": 5907, "loss": 0.1833, "lr": 7.149121142542292e-09, "epoch": 2.935500253936008, "percentage": 97.85, "elapsed_time": "7:35:56", "remaining_time": "0:10:01"} | |
| {"current_steps": 5790, "total_steps": 5907, "loss": 0.1856, "lr": 6.076140781690054e-09, "epoch": 2.940578974098527, "percentage": 98.02, "elapsed_time": "7:36:26", "remaining_time": "0:09:13"} | |
| {"current_steps": 5800, "total_steps": 5907, "loss": 0.1853, "lr": 5.090259131676767e-09, "epoch": 2.945657694261046, "percentage": 98.19, "elapsed_time": "7:37:14", "remaining_time": "0:08:26"} | |
| {"current_steps": 5810, "total_steps": 5907, "loss": 0.1823, "lr": 4.191510623794414e-09, "epoch": 2.9507364144235653, "percentage": 98.36, "elapsed_time": "7:37:49", "remaining_time": "0:07:38"} | |
| {"current_steps": 5820, "total_steps": 5907, "loss": 0.1791, "lr": 3.379926646265852e-09, "epoch": 2.9558151345860844, "percentage": 98.53, "elapsed_time": "7:38:22", "remaining_time": "0:06:51"} | |
| {"current_steps": 5830, "total_steps": 5907, "loss": 0.1831, "lr": 2.6555355431465145e-09, "epoch": 2.9608938547486034, "percentage": 98.7, "elapsed_time": "7:38:55", "remaining_time": "0:06:03"} | |
| {"current_steps": 5840, "total_steps": 5907, "loss": 0.1836, "lr": 2.0183626133374325e-09, "epoch": 2.9659725749111225, "percentage": 98.87, "elapsed_time": "7:39:40", "remaining_time": "0:05:16"} | |
| {"current_steps": 5850, "total_steps": 5907, "loss": 0.1823, "lr": 1.4684301096992704e-09, "epoch": 2.971051295073641, "percentage": 99.04, "elapsed_time": "7:40:14", "remaining_time": "0:04:29"} | |
| {"current_steps": 5860, "total_steps": 5907, "loss": 0.1829, "lr": 1.0057572382765613e-09, "epoch": 2.9761300152361603, "percentage": 99.2, "elapsed_time": "7:41:23", "remaining_time": "0:03:42"} | |
| {"current_steps": 5870, "total_steps": 5907, "loss": 0.1855, "lr": 6.303601576257423e-10, "epoch": 2.9812087353986794, "percentage": 99.37, "elapsed_time": "7:42:12", "remaining_time": "0:02:54"} | |
| {"current_steps": 5880, "total_steps": 5907, "loss": 0.1824, "lr": 3.4225197825227264e-10, "epoch": 2.9862874555611985, "percentage": 99.54, "elapsed_time": "7:42:52", "remaining_time": "0:02:07"} | |
| {"current_steps": 5890, "total_steps": 5907, "loss": 0.187, "lr": 1.4144276215211085e-10, "epoch": 2.9913661757237175, "percentage": 99.71, "elapsed_time": "7:43:39", "remaining_time": "0:01:20"} | |
| {"current_steps": 5900, "total_steps": 5907, "loss": 0.1806, "lr": 2.793952245921938e-11, "epoch": 2.9964448958862366, "percentage": 99.88, "elapsed_time": "7:44:14", "remaining_time": "0:00:33"} | |
| {"current_steps": 5907, "total_steps": 5907, "epoch": 3.0, "percentage": 100.0, "elapsed_time": "7:47:00", "remaining_time": "0:00:00"} | |