Text Generation
Transformers
Safetensors
qwen2
llama-factory
full
Generated from Trainer
conversational
text-generation-inference
Instructions to use ini/AKILM-reason with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use ini/AKILM-reason with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="ini/AKILM-reason") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForMultimodalLM tokenizer = AutoTokenizer.from_pretrained("ini/AKILM-reason") model = AutoModelForMultimodalLM.from_pretrained("ini/AKILM-reason") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use ini/AKILM-reason with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "ini/AKILM-reason" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ini/AKILM-reason", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/ini/AKILM-reason
- SGLang
How to use ini/AKILM-reason with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "ini/AKILM-reason" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ini/AKILM-reason", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "ini/AKILM-reason" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ini/AKILM-reason", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use ini/AKILM-reason with Docker Model Runner:
docker model run hf.co/ini/AKILM-reason
| {"current_steps": 10, "total_steps": 1736, "loss": 0.9813, "lr": 2.8735632183908047e-07, "epoch": 0.04597701149425287, "percentage": 0.58, "elapsed_time": "0:00:56", "remaining_time": "2:41:22"} | |
| {"current_steps": 20, "total_steps": 1736, "loss": 0.9645, "lr": 5.747126436781609e-07, "epoch": 0.09195402298850575, "percentage": 1.15, "elapsed_time": "0:01:56", "remaining_time": "2:45:53"} | |
| {"current_steps": 30, "total_steps": 1736, "loss": 0.8355, "lr": 8.620689655172415e-07, "epoch": 0.13793103448275862, "percentage": 1.73, "elapsed_time": "0:02:51", "remaining_time": "2:42:20"} | |
| {"current_steps": 40, "total_steps": 1736, "loss": 0.7287, "lr": 1.1494252873563219e-06, "epoch": 0.1839080459770115, "percentage": 2.3, "elapsed_time": "0:03:48", "remaining_time": "2:41:26"} | |
| {"current_steps": 50, "total_steps": 1736, "loss": 0.6077, "lr": 1.4367816091954023e-06, "epoch": 0.22988505747126436, "percentage": 2.88, "elapsed_time": "0:04:42", "remaining_time": "2:38:34"} | |
| {"current_steps": 60, "total_steps": 1736, "loss": 0.5534, "lr": 1.724137931034483e-06, "epoch": 0.27586206896551724, "percentage": 3.46, "elapsed_time": "0:05:40", "remaining_time": "2:38:22"} | |
| {"current_steps": 70, "total_steps": 1736, "loss": 0.5322, "lr": 2.0114942528735633e-06, "epoch": 0.3218390804597701, "percentage": 4.03, "elapsed_time": "0:06:33", "remaining_time": "2:36:08"} | |
| {"current_steps": 80, "total_steps": 1736, "loss": 0.5003, "lr": 2.2988505747126437e-06, "epoch": 0.367816091954023, "percentage": 4.61, "elapsed_time": "0:07:35", "remaining_time": "2:37:14"} | |
| {"current_steps": 90, "total_steps": 1736, "loss": 0.4883, "lr": 2.5862068965517246e-06, "epoch": 0.41379310344827586, "percentage": 5.18, "elapsed_time": "0:08:31", "remaining_time": "2:35:55"} | |
| {"current_steps": 100, "total_steps": 1736, "loss": 0.4584, "lr": 2.8735632183908046e-06, "epoch": 0.45977011494252873, "percentage": 5.76, "elapsed_time": "0:09:23", "remaining_time": "2:33:44"} | |
| {"current_steps": 110, "total_steps": 1736, "loss": 0.447, "lr": 3.1609195402298854e-06, "epoch": 0.5057471264367817, "percentage": 6.34, "elapsed_time": "0:10:17", "remaining_time": "2:32:11"} | |
| {"current_steps": 120, "total_steps": 1736, "loss": 0.4556, "lr": 3.448275862068966e-06, "epoch": 0.5517241379310345, "percentage": 6.91, "elapsed_time": "0:11:10", "remaining_time": "2:30:29"} | |
| {"current_steps": 130, "total_steps": 1736, "loss": 0.4354, "lr": 3.7356321839080462e-06, "epoch": 0.5977011494252874, "percentage": 7.49, "elapsed_time": "0:12:09", "remaining_time": "2:30:16"} | |
| {"current_steps": 140, "total_steps": 1736, "loss": 0.4228, "lr": 4.022988505747127e-06, "epoch": 0.6436781609195402, "percentage": 8.06, "elapsed_time": "0:13:03", "remaining_time": "2:28:55"} | |
| {"current_steps": 150, "total_steps": 1736, "loss": 0.4299, "lr": 4.310344827586207e-06, "epoch": 0.6896551724137931, "percentage": 8.64, "elapsed_time": "0:14:00", "remaining_time": "2:28:02"} | |
| {"current_steps": 160, "total_steps": 1736, "loss": 0.4193, "lr": 4.5977011494252875e-06, "epoch": 0.735632183908046, "percentage": 9.22, "elapsed_time": "0:14:57", "remaining_time": "2:27:24"} | |
| {"current_steps": 170, "total_steps": 1736, "loss": 0.4155, "lr": 4.885057471264369e-06, "epoch": 0.7816091954022989, "percentage": 9.79, "elapsed_time": "0:15:55", "remaining_time": "2:26:42"} | |
| {"current_steps": 180, "total_steps": 1736, "loss": 0.4083, "lr": 4.999817969178238e-06, "epoch": 0.8275862068965517, "percentage": 10.37, "elapsed_time": "0:16:43", "remaining_time": "2:24:32"} | |
| {"current_steps": 190, "total_steps": 1736, "loss": 0.4225, "lr": 4.998705654596035e-06, "epoch": 0.8735632183908046, "percentage": 10.94, "elapsed_time": "0:17:37", "remaining_time": "2:23:21"} | |
| {"current_steps": 200, "total_steps": 1736, "loss": 0.4019, "lr": 4.996582603056429e-06, "epoch": 0.9195402298850575, "percentage": 11.52, "elapsed_time": "0:18:34", "remaining_time": "2:22:40"} | |
| {"current_steps": 210, "total_steps": 1736, "loss": 0.3965, "lr": 4.9934496733427066e-06, "epoch": 0.9655172413793104, "percentage": 12.1, "elapsed_time": "0:19:26", "remaining_time": "2:21:13"} | |
| {"current_steps": 220, "total_steps": 1736, "loss": 0.3966, "lr": 4.989308132738127e-06, "epoch": 1.0114942528735633, "percentage": 12.67, "elapsed_time": "0:20:22", "remaining_time": "2:20:25"} | |
| {"current_steps": 230, "total_steps": 1736, "loss": 0.3407, "lr": 4.9841596565133e-06, "epoch": 1.0574712643678161, "percentage": 13.25, "elapsed_time": "0:21:20", "remaining_time": "2:19:42"} | |
| {"current_steps": 240, "total_steps": 1736, "loss": 0.3519, "lr": 4.978006327248537e-06, "epoch": 1.103448275862069, "percentage": 13.82, "elapsed_time": "0:22:19", "remaining_time": "2:19:08"} | |
| {"current_steps": 250, "total_steps": 1736, "loss": 0.3596, "lr": 4.970850633991432e-06, "epoch": 1.1494252873563218, "percentage": 14.4, "elapsed_time": "0:23:14", "remaining_time": "2:18:06"} | |
| {"current_steps": 260, "total_steps": 1736, "loss": 0.343, "lr": 4.962695471250033e-06, "epoch": 1.1954022988505748, "percentage": 14.98, "elapsed_time": "0:24:08", "remaining_time": "2:17:03"} | |
| {"current_steps": 270, "total_steps": 1736, "loss": 0.3369, "lr": 4.953544137822006e-06, "epoch": 1.2413793103448276, "percentage": 15.55, "elapsed_time": "0:25:04", "remaining_time": "2:16:08"} | |
| {"current_steps": 280, "total_steps": 1736, "loss": 0.3627, "lr": 4.9434003354602515e-06, "epoch": 1.2873563218390804, "percentage": 16.13, "elapsed_time": "0:25:57", "remaining_time": "2:15:00"} | |
| {"current_steps": 290, "total_steps": 1736, "loss": 0.3528, "lr": 4.932268167375532e-06, "epoch": 1.3333333333333333, "percentage": 16.71, "elapsed_time": "0:26:52", "remaining_time": "2:13:59"} | |
| {"current_steps": 300, "total_steps": 1736, "loss": 0.3406, "lr": 4.920152136576706e-06, "epoch": 1.3793103448275863, "percentage": 17.28, "elapsed_time": "0:27:44", "remaining_time": "2:12:45"} | |
| {"current_steps": 310, "total_steps": 1736, "loss": 0.3643, "lr": 4.9070571440492435e-06, "epoch": 1.4252873563218391, "percentage": 17.86, "elapsed_time": "0:28:38", "remaining_time": "2:11:45"} | |
| {"current_steps": 320, "total_steps": 1736, "loss": 0.3434, "lr": 4.892988486772756e-06, "epoch": 1.471264367816092, "percentage": 18.43, "elapsed_time": "0:29:33", "remaining_time": "2:10:45"} | |
| {"current_steps": 330, "total_steps": 1736, "loss": 0.3322, "lr": 4.877951855578342e-06, "epoch": 1.5172413793103448, "percentage": 19.01, "elapsed_time": "0:30:30", "remaining_time": "2:09:59"} | |
| {"current_steps": 340, "total_steps": 1736, "loss": 0.3529, "lr": 4.86195333284663e-06, "epoch": 1.5632183908045976, "percentage": 19.59, "elapsed_time": "0:31:28", "remaining_time": "2:09:13"} | |
| {"current_steps": 350, "total_steps": 1736, "loss": 0.3547, "lr": 4.844999390047419e-06, "epoch": 1.6091954022988506, "percentage": 20.16, "elapsed_time": "0:32:25", "remaining_time": "2:08:23"} | |
| {"current_steps": 360, "total_steps": 1736, "loss": 0.3408, "lr": 4.827096885121954e-06, "epoch": 1.6551724137931034, "percentage": 20.74, "elapsed_time": "0:33:19", "remaining_time": "2:07:23"} | |
| {"current_steps": 370, "total_steps": 1736, "loss": 0.3506, "lr": 4.808253059708849e-06, "epoch": 1.7011494252873565, "percentage": 21.31, "elapsed_time": "0:34:11", "remaining_time": "2:06:13"} | |
| {"current_steps": 380, "total_steps": 1736, "loss": 0.3368, "lr": 4.788475536214822e-06, "epoch": 1.7471264367816093, "percentage": 21.89, "elapsed_time": "0:35:08", "remaining_time": "2:05:25"} | |
| {"current_steps": 390, "total_steps": 1736, "loss": 0.3424, "lr": 4.767772314731394e-06, "epoch": 1.793103448275862, "percentage": 22.47, "elapsed_time": "0:36:04", "remaining_time": "2:04:31"} | |
| {"current_steps": 400, "total_steps": 1736, "loss": 0.3549, "lr": 4.746151769798818e-06, "epoch": 1.839080459770115, "percentage": 23.04, "elapsed_time": "0:36:57", "remaining_time": "2:03:27"} | |
| {"current_steps": 410, "total_steps": 1736, "loss": 0.3247, "lr": 4.7236226470185505e-06, "epoch": 1.8850574712643677, "percentage": 23.62, "elapsed_time": "0:37:51", "remaining_time": "2:02:27"} | |
| {"current_steps": 420, "total_steps": 1736, "loss": 0.3373, "lr": 4.700194059515606e-06, "epoch": 1.9310344827586206, "percentage": 24.19, "elapsed_time": "0:38:49", "remaining_time": "2:01:39"} | |
| {"current_steps": 430, "total_steps": 1736, "loss": 0.3393, "lr": 4.67587548425227e-06, "epoch": 1.9770114942528736, "percentage": 24.77, "elapsed_time": "0:39:44", "remaining_time": "2:00:42"} | |
| {"current_steps": 440, "total_steps": 1736, "loss": 0.3095, "lr": 4.650676758194624e-06, "epoch": 2.0229885057471266, "percentage": 25.35, "elapsed_time": "0:40:40", "remaining_time": "1:59:49"} | |
| {"current_steps": 450, "total_steps": 1736, "loss": 0.256, "lr": 4.624608074333448e-06, "epoch": 2.0689655172413794, "percentage": 25.92, "elapsed_time": "0:41:35", "remaining_time": "1:58:52"} | |
| {"current_steps": 460, "total_steps": 1736, "loss": 0.2471, "lr": 4.597679977561122e-06, "epoch": 2.1149425287356323, "percentage": 26.5, "elapsed_time": "0:42:30", "remaining_time": "1:57:55"} | |
| {"current_steps": 470, "total_steps": 1736, "loss": 0.2554, "lr": 4.569903360406163e-06, "epoch": 2.160919540229885, "percentage": 27.07, "elapsed_time": "0:43:28", "remaining_time": "1:57:06"} | |
| {"current_steps": 480, "total_steps": 1736, "loss": 0.2527, "lr": 4.541289458627155e-06, "epoch": 2.206896551724138, "percentage": 27.65, "elapsed_time": "0:44:25", "remaining_time": "1:56:14"} | |
| {"current_steps": 490, "total_steps": 1736, "loss": 0.2504, "lr": 4.511849846667839e-06, "epoch": 2.2528735632183907, "percentage": 28.23, "elapsed_time": "0:45:23", "remaining_time": "1:55:26"} | |
| {"current_steps": 500, "total_steps": 1736, "loss": 0.2608, "lr": 4.481596432975202e-06, "epoch": 2.2988505747126435, "percentage": 28.8, "elapsed_time": "0:46:21", "remaining_time": "1:54:35"} | |
| {"current_steps": 510, "total_steps": 1736, "loss": 0.2606, "lr": 4.4505414551824536e-06, "epoch": 2.344827586206897, "percentage": 29.38, "elapsed_time": "0:47:16", "remaining_time": "1:53:39"} | |
| {"current_steps": 520, "total_steps": 1736, "loss": 0.2677, "lr": 4.418697475158861e-06, "epoch": 2.3908045977011496, "percentage": 29.95, "elapsed_time": "0:48:06", "remaining_time": "1:52:28"} | |
| {"current_steps": 530, "total_steps": 1736, "loss": 0.2628, "lr": 4.386077373928413e-06, "epoch": 2.4367816091954024, "percentage": 30.53, "elapsed_time": "0:49:02", "remaining_time": "1:51:35"} | |
| {"current_steps": 540, "total_steps": 1736, "loss": 0.2739, "lr": 4.352694346459397e-06, "epoch": 2.4827586206896552, "percentage": 31.11, "elapsed_time": "0:50:00", "remaining_time": "1:50:46"} | |
| {"current_steps": 550, "total_steps": 1736, "loss": 0.2587, "lr": 4.318561896326973e-06, "epoch": 2.528735632183908, "percentage": 31.68, "elapsed_time": "0:50:54", "remaining_time": "1:49:46"} | |
| {"current_steps": 560, "total_steps": 1736, "loss": 0.271, "lr": 4.283693830250926e-06, "epoch": 2.574712643678161, "percentage": 32.26, "elapsed_time": "0:51:52", "remaining_time": "1:48:55"} | |
| {"current_steps": 570, "total_steps": 1736, "loss": 0.2596, "lr": 4.248104252510786e-06, "epoch": 2.6206896551724137, "percentage": 32.83, "elapsed_time": "0:52:47", "remaining_time": "1:47:58"} | |
| {"current_steps": 580, "total_steps": 1736, "loss": 0.2607, "lr": 4.211807559240588e-06, "epoch": 2.6666666666666665, "percentage": 33.41, "elapsed_time": "0:53:40", "remaining_time": "1:46:58"} | |
| {"current_steps": 590, "total_steps": 1736, "loss": 0.2714, "lr": 4.174818432605579e-06, "epoch": 2.7126436781609193, "percentage": 33.99, "elapsed_time": "0:54:34", "remaining_time": "1:46:00"} | |
| {"current_steps": 600, "total_steps": 1736, "loss": 0.267, "lr": 4.137151834863213e-06, "epoch": 2.7586206896551726, "percentage": 34.56, "elapsed_time": "0:55:29", "remaining_time": "1:45:04"} | |
| {"current_steps": 610, "total_steps": 1736, "loss": 0.2637, "lr": 4.098823002310864e-06, "epoch": 2.8045977011494254, "percentage": 35.14, "elapsed_time": "0:56:26", "remaining_time": "1:44:11"} | |
| {"current_steps": 620, "total_steps": 1736, "loss": 0.2591, "lr": 4.059847439122672e-06, "epoch": 2.8505747126436782, "percentage": 35.71, "elapsed_time": "0:57:20", "remaining_time": "1:43:12"} | |
| {"current_steps": 630, "total_steps": 1736, "loss": 0.2597, "lr": 4.020240911078041e-06, "epoch": 2.896551724137931, "percentage": 36.29, "elapsed_time": "0:58:14", "remaining_time": "1:42:14"} | |
| {"current_steps": 640, "total_steps": 1736, "loss": 0.2694, "lr": 3.98001943918432e-06, "epoch": 2.942528735632184, "percentage": 36.87, "elapsed_time": "0:59:07", "remaining_time": "1:41:15"} | |
| {"current_steps": 650, "total_steps": 1736, "loss": 0.2704, "lr": 3.939199293196231e-06, "epoch": 2.9885057471264367, "percentage": 37.44, "elapsed_time": "1:00:04", "remaining_time": "1:40:21"} | |
| {"current_steps": 660, "total_steps": 1736, "loss": 0.1997, "lr": 3.897796985034687e-06, "epoch": 3.0344827586206895, "percentage": 38.02, "elapsed_time": "1:00:58", "remaining_time": "1:39:24"} | |
| {"current_steps": 670, "total_steps": 1736, "loss": 0.1716, "lr": 3.855829262107653e-06, "epoch": 3.0804597701149423, "percentage": 38.59, "elapsed_time": "1:01:53", "remaining_time": "1:38:28"} | |
| {"current_steps": 680, "total_steps": 1736, "loss": 0.1803, "lr": 3.813313100535747e-06, "epoch": 3.1264367816091956, "percentage": 39.17, "elapsed_time": "1:02:53", "remaining_time": "1:37:40"} | |
| {"current_steps": 690, "total_steps": 1736, "loss": 0.1754, "lr": 3.770265698285328e-06, "epoch": 3.1724137931034484, "percentage": 39.75, "elapsed_time": "1:03:51", "remaining_time": "1:36:48"} | |
| {"current_steps": 700, "total_steps": 1736, "loss": 0.1835, "lr": 3.726704468211844e-06, "epoch": 3.218390804597701, "percentage": 40.32, "elapsed_time": "1:04:46", "remaining_time": "1:35:52"} | |
| {"current_steps": 710, "total_steps": 1736, "loss": 0.1792, "lr": 3.6826470310162645e-06, "epoch": 3.264367816091954, "percentage": 40.9, "elapsed_time": "1:05:41", "remaining_time": "1:34:55"} | |
| {"current_steps": 720, "total_steps": 1736, "loss": 0.1765, "lr": 3.6381112081174254e-06, "epoch": 3.310344827586207, "percentage": 41.47, "elapsed_time": "1:06:33", "remaining_time": "1:33:55"} | |
| {"current_steps": 730, "total_steps": 1736, "loss": 0.1817, "lr": 3.593115014443195e-06, "epoch": 3.3563218390804597, "percentage": 42.05, "elapsed_time": "1:07:29", "remaining_time": "1:33:00"} | |
| {"current_steps": 740, "total_steps": 1736, "loss": 0.1849, "lr": 3.547676651143361e-06, "epoch": 3.4022988505747125, "percentage": 42.63, "elapsed_time": "1:08:25", "remaining_time": "1:32:05"} | |
| {"current_steps": 750, "total_steps": 1736, "loss": 0.1769, "lr": 3.5018144982271814e-06, "epoch": 3.4482758620689653, "percentage": 43.2, "elapsed_time": "1:09:17", "remaining_time": "1:31:06"} | |
| {"current_steps": 760, "total_steps": 1736, "loss": 0.1848, "lr": 3.455547107128602e-06, "epoch": 3.4942528735632186, "percentage": 43.78, "elapsed_time": "1:10:16", "remaining_time": "1:30:14"} | |
| {"current_steps": 770, "total_steps": 1736, "loss": 0.1892, "lr": 3.4088931932021193e-06, "epoch": 3.5402298850574714, "percentage": 44.35, "elapsed_time": "1:11:09", "remaining_time": "1:29:16"} | |
| {"current_steps": 780, "total_steps": 1736, "loss": 0.1807, "lr": 3.3618716281523384e-06, "epoch": 3.586206896551724, "percentage": 44.93, "elapsed_time": "1:12:04", "remaining_time": "1:28:19"} | |
| {"current_steps": 790, "total_steps": 1736, "loss": 0.1852, "lr": 3.3145014324002945e-06, "epoch": 3.632183908045977, "percentage": 45.51, "elapsed_time": "1:12:57", "remaining_time": "1:27:22"} | |
| {"current_steps": 800, "total_steps": 1736, "loss": 0.1885, "lr": 3.266801767389608e-06, "epoch": 3.67816091954023, "percentage": 46.08, "elapsed_time": "1:13:53", "remaining_time": "1:26:27"} | |
| {"current_steps": 810, "total_steps": 1736, "loss": 0.1835, "lr": 3.2187919278356027e-06, "epoch": 3.7241379310344827, "percentage": 46.66, "elapsed_time": "1:14:50", "remaining_time": "1:25:33"} | |
| {"current_steps": 820, "total_steps": 1736, "loss": 0.1863, "lr": 3.1704913339205107e-06, "epoch": 3.7701149425287355, "percentage": 47.24, "elapsed_time": "1:15:47", "remaining_time": "1:24:39"} | |
| {"current_steps": 830, "total_steps": 1736, "loss": 0.1921, "lr": 3.121919523437927e-06, "epoch": 3.8160919540229887, "percentage": 47.81, "elapsed_time": "1:16:47", "remaining_time": "1:23:49"} | |
| {"current_steps": 840, "total_steps": 1736, "loss": 0.1868, "lr": 3.073096143889689e-06, "epoch": 3.862068965517241, "percentage": 48.39, "elapsed_time": "1:17:42", "remaining_time": "1:22:52"} | |
| {"current_steps": 850, "total_steps": 1736, "loss": 0.1855, "lr": 3.0240409445383835e-06, "epoch": 3.9080459770114944, "percentage": 48.96, "elapsed_time": "1:18:35", "remaining_time": "1:21:55"} | |
| {"current_steps": 860, "total_steps": 1736, "loss": 0.1791, "lr": 2.97477376841868e-06, "epoch": 3.954022988505747, "percentage": 49.54, "elapsed_time": "1:19:30", "remaining_time": "1:20:59"} | |
| {"current_steps": 870, "total_steps": 1736, "loss": 0.1756, "lr": 2.9253145443107455e-06, "epoch": 4.0, "percentage": 50.12, "elapsed_time": "1:20:27", "remaining_time": "1:20:05"} | |
| {"current_steps": 880, "total_steps": 1736, "loss": 0.1081, "lr": 2.8756832786789667e-06, "epoch": 4.045977011494253, "percentage": 50.69, "elapsed_time": "1:21:24", "remaining_time": "1:19:11"} | |
| {"current_steps": 890, "total_steps": 1736, "loss": 0.1052, "lr": 2.825900047579251e-06, "epoch": 4.091954022988506, "percentage": 51.27, "elapsed_time": "1:22:21", "remaining_time": "1:18:17"} | |
| {"current_steps": 900, "total_steps": 1736, "loss": 0.1032, "lr": 2.775984988538175e-06, "epoch": 4.137931034482759, "percentage": 51.84, "elapsed_time": "1:23:19", "remaining_time": "1:17:23"} | |
| {"current_steps": 910, "total_steps": 1736, "loss": 0.1049, "lr": 2.725958292407276e-06, "epoch": 4.183908045977011, "percentage": 52.42, "elapsed_time": "1:24:15", "remaining_time": "1:16:28"} | |
| {"current_steps": 920, "total_steps": 1736, "loss": 0.1051, "lr": 2.6758401951957625e-06, "epoch": 4.2298850574712645, "percentage": 53.0, "elapsed_time": "1:25:09", "remaining_time": "1:15:31"} | |
| {"current_steps": 930, "total_steps": 1736, "loss": 0.1071, "lr": 2.6256509698849652e-06, "epoch": 4.275862068965517, "percentage": 53.57, "elapsed_time": "1:26:06", "remaining_time": "1:14:37"} | |
| {"current_steps": 940, "total_steps": 1736, "loss": 0.1077, "lr": 2.5754109182278298e-06, "epoch": 4.32183908045977, "percentage": 54.15, "elapsed_time": "1:27:02", "remaining_time": "1:13:42"} | |
| {"current_steps": 950, "total_steps": 1736, "loss": 0.1085, "lr": 2.525140362536775e-06, "epoch": 4.3678160919540225, "percentage": 54.72, "elapsed_time": "1:27:56", "remaining_time": "1:12:45"} | |
| {"current_steps": 960, "total_steps": 1736, "loss": 0.1082, "lr": 2.474859637463226e-06, "epoch": 4.413793103448276, "percentage": 55.3, "elapsed_time": "1:28:48", "remaining_time": "1:11:47"} | |
| {"current_steps": 970, "total_steps": 1736, "loss": 0.1102, "lr": 2.42458908177217e-06, "epoch": 4.459770114942529, "percentage": 55.88, "elapsed_time": "1:29:39", "remaining_time": "1:10:48"} | |
| {"current_steps": 980, "total_steps": 1736, "loss": 0.1105, "lr": 2.374349030115036e-06, "epoch": 4.505747126436781, "percentage": 56.45, "elapsed_time": "1:30:37", "remaining_time": "1:09:54"} | |
| {"current_steps": 990, "total_steps": 1736, "loss": 0.1082, "lr": 2.3241598048042383e-06, "epoch": 4.551724137931035, "percentage": 57.03, "elapsed_time": "1:31:35", "remaining_time": "1:09:01"} | |
| {"current_steps": 1000, "total_steps": 1736, "loss": 0.109, "lr": 2.2740417075927244e-06, "epoch": 4.597701149425287, "percentage": 57.6, "elapsed_time": "1:32:26", "remaining_time": "1:08:02"} | |
| {"current_steps": 1010, "total_steps": 1736, "loss": 0.1068, "lr": 2.2240150114618262e-06, "epoch": 4.64367816091954, "percentage": 58.18, "elapsed_time": "1:33:26", "remaining_time": "1:07:09"} | |
| {"current_steps": 1020, "total_steps": 1736, "loss": 0.1058, "lr": 2.17409995242075e-06, "epoch": 4.689655172413794, "percentage": 58.76, "elapsed_time": "1:34:23", "remaining_time": "1:06:15"} | |
| {"current_steps": 1030, "total_steps": 1736, "loss": 0.1015, "lr": 2.1243167213210337e-06, "epoch": 4.735632183908046, "percentage": 59.33, "elapsed_time": "1:35:14", "remaining_time": "1:05:16"} | |
| {"current_steps": 1040, "total_steps": 1736, "loss": 0.1032, "lr": 2.0746854556892545e-06, "epoch": 4.781609195402299, "percentage": 59.91, "elapsed_time": "1:36:09", "remaining_time": "1:04:20"} | |
| {"current_steps": 1050, "total_steps": 1736, "loss": 0.1033, "lr": 2.0252262315813213e-06, "epoch": 4.827586206896552, "percentage": 60.48, "elapsed_time": "1:37:08", "remaining_time": "1:03:27"} | |
| {"current_steps": 1060, "total_steps": 1736, "loss": 0.1075, "lr": 1.9759590554616177e-06, "epoch": 4.873563218390805, "percentage": 61.06, "elapsed_time": "1:38:05", "remaining_time": "1:02:33"} | |
| {"current_steps": 1070, "total_steps": 1736, "loss": 0.1075, "lr": 1.9269038561103114e-06, "epoch": 4.919540229885057, "percentage": 61.64, "elapsed_time": "1:39:01", "remaining_time": "1:01:38"} | |
| {"current_steps": 1080, "total_steps": 1736, "loss": 0.1033, "lr": 1.8780804765620747e-06, "epoch": 4.9655172413793105, "percentage": 62.21, "elapsed_time": "1:39:59", "remaining_time": "1:00:44"} | |
| {"current_steps": 1090, "total_steps": 1736, "loss": 0.0939, "lr": 1.8295086660794903e-06, "epoch": 5.011494252873563, "percentage": 62.79, "elapsed_time": "1:40:48", "remaining_time": "0:59:44"} | |
| {"current_steps": 1100, "total_steps": 1736, "loss": 0.0524, "lr": 1.7812080721643977e-06, "epoch": 5.057471264367816, "percentage": 63.36, "elapsed_time": "1:41:44", "remaining_time": "0:58:49"} | |
| {"current_steps": 1110, "total_steps": 1736, "loss": 0.0531, "lr": 1.7331982326103922e-06, "epoch": 5.103448275862069, "percentage": 63.94, "elapsed_time": "1:42:39", "remaining_time": "0:57:53"} | |
| {"current_steps": 1120, "total_steps": 1736, "loss": 0.0518, "lr": 1.6854985675997065e-06, "epoch": 5.149425287356322, "percentage": 64.52, "elapsed_time": "1:43:35", "remaining_time": "0:56:58"} | |
| {"current_steps": 1130, "total_steps": 1736, "loss": 0.0521, "lr": 1.6381283718476622e-06, "epoch": 5.195402298850575, "percentage": 65.09, "elapsed_time": "1:44:26", "remaining_time": "0:56:00"} | |
| {"current_steps": 1140, "total_steps": 1736, "loss": 0.0534, "lr": 1.591106806797882e-06, "epoch": 5.241379310344827, "percentage": 65.67, "elapsed_time": "1:45:20", "remaining_time": "0:55:04"} | |
| {"current_steps": 1150, "total_steps": 1736, "loss": 0.0529, "lr": 1.5444528928713987e-06, "epoch": 5.287356321839081, "percentage": 66.24, "elapsed_time": "1:46:16", "remaining_time": "0:54:09"} | |
| {"current_steps": 1160, "total_steps": 1736, "loss": 0.054, "lr": 1.4981855017728197e-06, "epoch": 5.333333333333333, "percentage": 66.82, "elapsed_time": "1:47:13", "remaining_time": "0:53:14"} | |
| {"current_steps": 1170, "total_steps": 1736, "loss": 0.0583, "lr": 1.4523233488566394e-06, "epoch": 5.379310344827586, "percentage": 67.4, "elapsed_time": "1:48:10", "remaining_time": "0:52:19"} | |
| {"current_steps": 1180, "total_steps": 1736, "loss": 0.0513, "lr": 1.4068849855568042e-06, "epoch": 5.425287356321839, "percentage": 67.97, "elapsed_time": "1:49:04", "remaining_time": "0:51:23"} | |
| {"current_steps": 1190, "total_steps": 1736, "loss": 0.0547, "lr": 1.3618887918825752e-06, "epoch": 5.471264367816092, "percentage": 68.55, "elapsed_time": "1:49:57", "remaining_time": "0:50:26"} | |
| {"current_steps": 1200, "total_steps": 1736, "loss": 0.0543, "lr": 1.3173529689837355e-06, "epoch": 5.517241379310345, "percentage": 69.12, "elapsed_time": "1:50:47", "remaining_time": "0:49:29"} | |
| {"current_steps": 1210, "total_steps": 1736, "loss": 0.0544, "lr": 1.2732955317881563e-06, "epoch": 5.563218390804598, "percentage": 69.7, "elapsed_time": "1:51:45", "remaining_time": "0:48:34"} | |
| {"current_steps": 1220, "total_steps": 1736, "loss": 0.0529, "lr": 1.2297343017146727e-06, "epoch": 5.609195402298851, "percentage": 70.28, "elapsed_time": "1:52:42", "remaining_time": "0:47:40"} | |
| {"current_steps": 1230, "total_steps": 1736, "loss": 0.0514, "lr": 1.1866868994642535e-06, "epoch": 5.655172413793103, "percentage": 70.85, "elapsed_time": "1:53:39", "remaining_time": "0:46:45"} | |
| {"current_steps": 1240, "total_steps": 1736, "loss": 0.0529, "lr": 1.1441707378923475e-06, "epoch": 5.7011494252873565, "percentage": 71.43, "elapsed_time": "1:54:35", "remaining_time": "0:45:50"} | |
| {"current_steps": 1250, "total_steps": 1736, "loss": 0.0524, "lr": 1.1022030149653134e-06, "epoch": 5.747126436781609, "percentage": 72.0, "elapsed_time": "1:55:31", "remaining_time": "0:44:55"} | |
| {"current_steps": 1260, "total_steps": 1736, "loss": 0.0528, "lr": 1.0608007068037702e-06, "epoch": 5.793103448275862, "percentage": 72.58, "elapsed_time": "1:56:28", "remaining_time": "0:44:00"} | |
| {"current_steps": 1270, "total_steps": 1736, "loss": 0.0504, "lr": 1.0199805608156802e-06, "epoch": 5.8390804597701145, "percentage": 73.16, "elapsed_time": "1:57:19", "remaining_time": "0:43:03"} | |
| {"current_steps": 1280, "total_steps": 1736, "loss": 0.0524, "lr": 9.79759088921959e-07, "epoch": 5.885057471264368, "percentage": 73.73, "elapsed_time": "1:58:15", "remaining_time": "0:42:07"} | |
| {"current_steps": 1290, "total_steps": 1736, "loss": 0.0453, "lr": 9.401525608773293e-07, "epoch": 5.931034482758621, "percentage": 74.31, "elapsed_time": "1:59:08", "remaining_time": "0:41:11"} | |
| {"current_steps": 1300, "total_steps": 1736, "loss": 0.0483, "lr": 9.011769976891368e-07, "epoch": 5.977011494252873, "percentage": 74.88, "elapsed_time": "2:00:06", "remaining_time": "0:40:16"} | |
| {"current_steps": 1310, "total_steps": 1736, "loss": 0.0373, "lr": 8.628481651367876e-07, "epoch": 6.022988505747127, "percentage": 75.46, "elapsed_time": "2:01:07", "remaining_time": "0:39:23"} | |
| {"current_steps": 1320, "total_steps": 1736, "loss": 0.0257, "lr": 8.25181567394422e-07, "epoch": 6.068965517241379, "percentage": 76.04, "elapsed_time": "2:02:06", "remaining_time": "0:38:28"} | |
| {"current_steps": 1330, "total_steps": 1736, "loss": 0.0243, "lr": 7.88192440759413e-07, "epoch": 6.114942528735632, "percentage": 76.61, "elapsed_time": "2:03:01", "remaining_time": "0:37:33"} | |
| {"current_steps": 1340, "total_steps": 1736, "loss": 0.0244, "lr": 7.51895747489215e-07, "epoch": 6.160919540229885, "percentage": 77.19, "elapsed_time": "2:03:57", "remaining_time": "0:36:37"} | |
| {"current_steps": 1350, "total_steps": 1736, "loss": 0.0253, "lr": 7.163061697490742e-07, "epoch": 6.206896551724138, "percentage": 77.76, "elapsed_time": "2:04:57", "remaining_time": "0:35:43"} | |
| {"current_steps": 1360, "total_steps": 1736, "loss": 0.0237, "lr": 6.814381036730275e-07, "epoch": 6.252873563218391, "percentage": 78.34, "elapsed_time": "2:05:54", "remaining_time": "0:34:48"} | |
| {"current_steps": 1370, "total_steps": 1736, "loss": 0.0247, "lr": 6.473056535406036e-07, "epoch": 6.2988505747126435, "percentage": 78.92, "elapsed_time": "2:06:48", "remaining_time": "0:33:52"} | |
| {"current_steps": 1380, "total_steps": 1736, "loss": 0.0232, "lr": 6.139226260715872e-07, "epoch": 6.344827586206897, "percentage": 79.49, "elapsed_time": "2:07:42", "remaining_time": "0:32:56"} | |
| {"current_steps": 1390, "total_steps": 1736, "loss": 0.0209, "lr": 5.813025248411397e-07, "epoch": 6.390804597701149, "percentage": 80.07, "elapsed_time": "2:08:31", "remaining_time": "0:31:59"} | |
| {"current_steps": 1400, "total_steps": 1736, "loss": 0.0248, "lr": 5.494585448175474e-07, "epoch": 6.436781609195402, "percentage": 80.65, "elapsed_time": "2:09:28", "remaining_time": "0:31:04"} | |
| {"current_steps": 1410, "total_steps": 1736, "loss": 0.0251, "lr": 5.184035670247989e-07, "epoch": 6.482758620689655, "percentage": 81.22, "elapsed_time": "2:10:25", "remaining_time": "0:30:09"} | |
| {"current_steps": 1420, "total_steps": 1736, "loss": 0.0238, "lr": 4.881501533321605e-07, "epoch": 6.528735632183908, "percentage": 81.8, "elapsed_time": "2:11:18", "remaining_time": "0:29:13"} | |
| {"current_steps": 1430, "total_steps": 1736, "loss": 0.0227, "lr": 4.587105413728457e-07, "epoch": 6.574712643678161, "percentage": 82.37, "elapsed_time": "2:12:16", "remaining_time": "0:28:18"} | |
| {"current_steps": 1440, "total_steps": 1736, "loss": 0.0229, "lr": 4.3009663959383776e-07, "epoch": 6.620689655172414, "percentage": 82.95, "elapsed_time": "2:13:14", "remaining_time": "0:27:23"} | |
| {"current_steps": 1450, "total_steps": 1736, "loss": 0.0244, "lr": 4.0232002243887873e-07, "epoch": 6.666666666666667, "percentage": 83.53, "elapsed_time": "2:14:09", "remaining_time": "0:26:27"} | |
| {"current_steps": 1460, "total_steps": 1736, "loss": 0.0234, "lr": 3.7539192566655254e-07, "epoch": 6.712643678160919, "percentage": 84.1, "elapsed_time": "2:15:03", "remaining_time": "0:25:31"} | |
| {"current_steps": 1470, "total_steps": 1736, "loss": 0.0223, "lr": 3.493232418053774e-07, "epoch": 6.758620689655173, "percentage": 84.68, "elapsed_time": "2:15:58", "remaining_time": "0:24:36"} | |
| {"current_steps": 1480, "total_steps": 1736, "loss": 0.0218, "lr": 3.24124515747731e-07, "epoch": 6.804597701149425, "percentage": 85.25, "elapsed_time": "2:16:56", "remaining_time": "0:23:41"} | |
| {"current_steps": 1490, "total_steps": 1736, "loss": 0.0226, "lr": 2.9980594048439477e-07, "epoch": 6.850574712643678, "percentage": 85.83, "elapsed_time": "2:17:46", "remaining_time": "0:22:44"} | |
| {"current_steps": 1500, "total_steps": 1736, "loss": 0.0212, "lr": 2.7637735298145064e-07, "epoch": 6.896551724137931, "percentage": 86.41, "elapsed_time": "2:18:39", "remaining_time": "0:21:48"} | |
| {"current_steps": 1510, "total_steps": 1736, "loss": 0.0225, "lr": 2.538482302011822e-07, "epoch": 6.942528735632184, "percentage": 86.98, "elapsed_time": "2:19:32", "remaining_time": "0:20:53"} | |
| {"current_steps": 1520, "total_steps": 1736, "loss": 0.0227, "lr": 2.3222768526860701e-07, "epoch": 6.988505747126437, "percentage": 87.56, "elapsed_time": "2:20:28", "remaining_time": "0:19:57"} | |
| {"current_steps": 1530, "total_steps": 1736, "loss": 0.0151, "lr": 2.115244637851782e-07, "epoch": 7.0344827586206895, "percentage": 88.13, "elapsed_time": "2:21:27", "remaining_time": "0:19:02"} | |
| {"current_steps": 1540, "total_steps": 1736, "loss": 0.0123, "lr": 1.9174694029115148e-07, "epoch": 7.080459770114943, "percentage": 88.71, "elapsed_time": "2:22:23", "remaining_time": "0:18:07"} | |
| {"current_steps": 1550, "total_steps": 1736, "loss": 0.0126, "lr": 1.7290311487804689e-07, "epoch": 7.126436781609195, "percentage": 89.29, "elapsed_time": "2:23:17", "remaining_time": "0:17:11"} | |
| {"current_steps": 1560, "total_steps": 1736, "loss": 0.0128, "lr": 1.5500060995258136e-07, "epoch": 7.172413793103448, "percentage": 89.86, "elapsed_time": "2:24:17", "remaining_time": "0:16:16"} | |
| {"current_steps": 1570, "total_steps": 1736, "loss": 0.0122, "lr": 1.3804666715337117e-07, "epoch": 7.218390804597701, "percentage": 90.44, "elapsed_time": "2:25:13", "remaining_time": "0:15:21"} | |
| {"current_steps": 1580, "total_steps": 1736, "loss": 0.0124, "lr": 1.2204814442165814e-07, "epoch": 7.264367816091954, "percentage": 91.01, "elapsed_time": "2:26:07", "remaining_time": "0:14:25"} | |
| {"current_steps": 1590, "total_steps": 1736, "loss": 0.0117, "lr": 1.0701151322724451e-07, "epoch": 7.310344827586207, "percentage": 91.59, "elapsed_time": "2:27:01", "remaining_time": "0:13:30"} | |
| {"current_steps": 1600, "total_steps": 1736, "loss": 0.0122, "lr": 9.294285595075669e-08, "epoch": 7.35632183908046, "percentage": 92.17, "elapsed_time": "2:27:53", "remaining_time": "0:12:34"} | |
| {"current_steps": 1610, "total_steps": 1736, "loss": 0.0133, "lr": 7.984786342329493e-08, "epoch": 7.402298850574713, "percentage": 92.74, "elapsed_time": "2:28:50", "remaining_time": "0:11:38"} | |
| {"current_steps": 1620, "total_steps": 1736, "loss": 0.0112, "lr": 6.773183262446914e-08, "epoch": 7.448275862068965, "percentage": 93.32, "elapsed_time": "2:29:42", "remaining_time": "0:10:43"} | |
| {"current_steps": 1630, "total_steps": 1736, "loss": 0.0139, "lr": 5.65996645397493e-08, "epoch": 7.494252873563219, "percentage": 93.89, "elapsed_time": "2:30:42", "remaining_time": "0:09:48"} | |
| {"current_steps": 1640, "total_steps": 1736, "loss": 0.0131, "lr": 4.645586217799453e-08, "epoch": 7.540229885057471, "percentage": 94.47, "elapsed_time": "2:31:45", "remaining_time": "0:08:52"} | |
| {"current_steps": 1650, "total_steps": 1736, "loss": 0.0124, "lr": 3.730452874996737e-08, "epoch": 7.586206896551724, "percentage": 95.05, "elapsed_time": "2:32:40", "remaining_time": "0:07:57"} | |
| {"current_steps": 1660, "total_steps": 1736, "loss": 0.0112, "lr": 2.914936600856899e-08, "epoch": 7.6321839080459775, "percentage": 95.62, "elapsed_time": "2:33:30", "remaining_time": "0:07:01"} | |
| {"current_steps": 1670, "total_steps": 1736, "loss": 0.0112, "lr": 2.199367275146358e-08, "epoch": 7.67816091954023, "percentage": 96.2, "elapsed_time": "2:34:28", "remaining_time": "0:06:06"} | |
| {"current_steps": 1680, "total_steps": 1736, "loss": 0.0133, "lr": 1.5840343486700216e-08, "epoch": 7.724137931034483, "percentage": 96.77, "elapsed_time": "2:35:22", "remaining_time": "0:05:10"} | |
| {"current_steps": 1690, "total_steps": 1736, "loss": 0.0116, "lr": 1.0691867261874155e-08, "epoch": 7.7701149425287355, "percentage": 97.35, "elapsed_time": "2:36:17", "remaining_time": "0:04:15"} | |
| {"current_steps": 1700, "total_steps": 1736, "loss": 0.0111, "lr": 6.550326657293882e-09, "epoch": 7.816091954022989, "percentage": 97.93, "elapsed_time": "2:37:11", "remaining_time": "0:03:19"} | |
| {"current_steps": 1710, "total_steps": 1736, "loss": 0.0121, "lr": 3.4173969435710717e-09, "epoch": 7.862068965517241, "percentage": 98.5, "elapsed_time": "2:38:03", "remaining_time": "0:02:24"} | |
| {"current_steps": 1720, "total_steps": 1736, "loss": 0.0122, "lr": 1.2943454039654467e-09, "epoch": 7.908045977011494, "percentage": 99.08, "elapsed_time": "2:39:01", "remaining_time": "0:01:28"} | |
| {"current_steps": 1730, "total_steps": 1736, "loss": 0.0125, "lr": 1.8203082176287967e-10, "epoch": 7.954022988505747, "percentage": 99.65, "elapsed_time": "2:40:00", "remaining_time": "0:00:33"} | |
| {"current_steps": 1736, "total_steps": 1736, "epoch": 7.9816091954022985, "percentage": 100.0, "elapsed_time": "2:41:28", "remaining_time": "0:00:00"} | |