Text Generation
Transformers
Safetensors
murzik
feature-extraction
nullxes
causal-lm
custom_code
multilingual
conversational
Instructions to use MagistrTheOne/murzik-15b-init with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use MagistrTheOne/murzik-15b-init with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="MagistrTheOne/murzik-15b-init", trust_remote_code=True) messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("MagistrTheOne/murzik-15b-init", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use MagistrTheOne/murzik-15b-init with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "MagistrTheOne/murzik-15b-init" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "MagistrTheOne/murzik-15b-init", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/MagistrTheOne/murzik-15b-init
- SGLang
How to use MagistrTheOne/murzik-15b-init with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "MagistrTheOne/murzik-15b-init" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "MagistrTheOne/murzik-15b-init", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "MagistrTheOne/murzik-15b-init" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "MagistrTheOne/murzik-15b-init", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use MagistrTheOne/murzik-15b-init with Docker Model Runner:
docker model run hf.co/MagistrTheOne/murzik-15b-init
| {"current_steps": 10, "total_steps": 1500, "loss": 101.5380615234375, "lr": 1.8e-05, "epoch": 0.0024922118380062306, "percentage": 0.67, "elapsed_time": "0:00:27", "remaining_time": "1:07:44"} | |
| {"current_steps": 20, "total_steps": 1500, "loss": 94.726171875, "lr": 3.8e-05, "epoch": 0.004984423676012461, "percentage": 1.33, "elapsed_time": "0:00:53", "remaining_time": "1:06:28"} | |
| {"current_steps": 30, "total_steps": 1500, "loss": 89.19619140625, "lr": 5.8e-05, "epoch": 0.007476635514018692, "percentage": 2.0, "elapsed_time": "0:01:16", "remaining_time": "1:02:40"} | |
| {"current_steps": 40, "total_steps": 1500, "loss": 92.52296142578125, "lr": 7.800000000000001e-05, "epoch": 0.009968847352024923, "percentage": 2.67, "elapsed_time": "0:01:42", "remaining_time": "1:02:26"} | |
| {"current_steps": 50, "total_steps": 1500, "loss": 86.34298095703124, "lr": 9.8e-05, "epoch": 0.012461059190031152, "percentage": 3.33, "elapsed_time": "0:02:05", "remaining_time": "1:00:32"} | |
| {"current_steps": 60, "total_steps": 1500, "loss": 75.817529296875, "lr": 9.999049449909854e-05, "epoch": 0.014953271028037384, "percentage": 4.0, "elapsed_time": "0:02:32", "remaining_time": "1:00:57"} | |
| {"current_steps": 70, "total_steps": 1500, "loss": 70.09442138671875, "lr": 9.995764061750087e-05, "epoch": 0.017445482866043614, "percentage": 4.67, "elapsed_time": "0:02:58", "remaining_time": "1:00:56"} | |
| {"current_steps": 80, "total_steps": 1500, "loss": 70.83968505859374, "lr": 9.990133642141359e-05, "epoch": 0.019937694704049845, "percentage": 5.33, "elapsed_time": "0:03:26", "remaining_time": "1:01:00"} | |
| {"current_steps": 90, "total_steps": 1500, "loss": 68.97255859375, "lr": 9.982160834024952e-05, "epoch": 0.022429906542056073, "percentage": 6.0, "elapsed_time": "0:03:53", "remaining_time": "1:00:56"} | |
| {"current_steps": 100, "total_steps": 1500, "loss": 67.46973266601563, "lr": 9.971849379868592e-05, "epoch": 0.024922118380062305, "percentage": 6.67, "elapsed_time": "0:04:19", "remaining_time": "1:00:26"} | |
| {"current_steps": 110, "total_steps": 1500, "loss": 66.220703125, "lr": 9.959204119909727e-05, "epoch": 0.027414330218068536, "percentage": 7.33, "elapsed_time": "0:04:47", "remaining_time": "1:00:28"} | |
| {"current_steps": 120, "total_steps": 1500, "loss": 65.1081787109375, "lr": 9.944230989883492e-05, "epoch": 0.029906542056074768, "percentage": 8.0, "elapsed_time": "0:05:12", "remaining_time": "0:59:57"} | |
| {"current_steps": 130, "total_steps": 1500, "loss": 66.480126953125, "lr": 9.926937018236461e-05, "epoch": 0.032398753894081, "percentage": 8.67, "elapsed_time": "0:05:36", "remaining_time": "0:59:08"} | |
| {"current_steps": 140, "total_steps": 1500, "loss": 66.46883544921874, "lr": 9.907330322827462e-05, "epoch": 0.03489096573208723, "percentage": 9.33, "elapsed_time": "0:06:04", "remaining_time": "0:59:05"} | |
| {"current_steps": 150, "total_steps": 1500, "loss": 63.629949951171874, "lr": 9.885420107117021e-05, "epoch": 0.037383177570093455, "percentage": 10.0, "elapsed_time": "0:06:31", "remaining_time": "0:58:42"} | |
| {"current_steps": 160, "total_steps": 1500, "loss": 62.81998291015625, "lr": 9.861216655847225e-05, "epoch": 0.03987538940809969, "percentage": 10.67, "elapsed_time": "0:06:57", "remaining_time": "0:58:16"} | |
| {"current_steps": 170, "total_steps": 1500, "loss": 62.8767333984375, "lr": 9.834731330214017e-05, "epoch": 0.04236760124610592, "percentage": 11.33, "elapsed_time": "0:07:23", "remaining_time": "0:57:52"} | |
| {"current_steps": 180, "total_steps": 1500, "loss": 62.5250244140625, "lr": 9.805976562534215e-05, "epoch": 0.044859813084112146, "percentage": 12.0, "elapsed_time": "0:07:51", "remaining_time": "0:57:40"} | |
| {"current_steps": 190, "total_steps": 1500, "loss": 64.16326904296875, "lr": 9.774965850409721e-05, "epoch": 0.04735202492211838, "percentage": 12.67, "elapsed_time": "0:08:17", "remaining_time": "0:57:12"} | |
| {"current_steps": 200, "total_steps": 1500, "loss": 62.6332275390625, "lr": 9.741713750391703e-05, "epoch": 0.04984423676012461, "percentage": 13.33, "elapsed_time": "0:08:46", "remaining_time": "0:57:04"} | |
| {"current_steps": 210, "total_steps": 1500, "loss": 63.14339599609375, "lr": 9.706235871147689e-05, "epoch": 0.052336448598130844, "percentage": 14.0, "elapsed_time": "0:09:12", "remaining_time": "0:56:34"} | |
| {"current_steps": 220, "total_steps": 1500, "loss": 62.38193359375, "lr": 9.668548866134796e-05, "epoch": 0.05482866043613707, "percentage": 14.67, "elapsed_time": "0:09:37", "remaining_time": "0:56:00"} | |
| {"current_steps": 230, "total_steps": 1500, "loss": 63.40284423828125, "lr": 9.628670425782531e-05, "epoch": 0.0573208722741433, "percentage": 15.33, "elapsed_time": "0:10:04", "remaining_time": "0:55:37"} | |
| {"current_steps": 240, "total_steps": 1500, "loss": 59.16956787109375, "lr": 9.586619269188837e-05, "epoch": 0.059813084112149535, "percentage": 16.0, "elapsed_time": "0:10:28", "remaining_time": "0:55:01"} | |
| {"current_steps": 250, "total_steps": 1500, "loss": 62.044830322265625, "lr": 9.542415135333269e-05, "epoch": 0.06230529595015576, "percentage": 16.67, "elapsed_time": "0:10:54", "remaining_time": "0:54:33"} | |
| {"current_steps": 260, "total_steps": 1500, "loss": 63.55614013671875, "lr": 9.496078773811437e-05, "epoch": 0.064797507788162, "percentage": 17.33, "elapsed_time": "0:11:23", "remaining_time": "0:54:18"} | |
| {"current_steps": 270, "total_steps": 1500, "loss": 64.28612670898437, "lr": 9.447631935095078e-05, "epoch": 0.06728971962616823, "percentage": 18.0, "elapsed_time": "0:11:46", "remaining_time": "0:53:40"} | |
| {"current_steps": 280, "total_steps": 1500, "loss": 63.5065673828125, "lr": 9.397097360322276e-05, "epoch": 0.06978193146417445, "percentage": 18.67, "elapsed_time": "0:12:12", "remaining_time": "0:53:12"} | |
| {"current_steps": 290, "total_steps": 1500, "loss": 61.68922119140625, "lr": 9.344498770622705e-05, "epoch": 0.07227414330218068, "percentage": 19.33, "elapsed_time": "0:12:36", "remaining_time": "0:52:36"} | |
| {"current_steps": 300, "total_steps": 1500, "loss": 62.48082275390625, "lr": 9.289860855982814e-05, "epoch": 0.07476635514018691, "percentage": 20.0, "elapsed_time": "0:13:03", "remaining_time": "0:52:14"} | |
| {"current_steps": 310, "total_steps": 1500, "loss": 63.46171875, "lr": 9.233209263656272e-05, "epoch": 0.07725856697819315, "percentage": 20.67, "elapsed_time": "0:13:27", "remaining_time": "0:51:40"} | |
| {"current_steps": 320, "total_steps": 1500, "loss": 60.84918212890625, "lr": 9.174570586125026e-05, "epoch": 0.07975077881619938, "percentage": 21.33, "elapsed_time": "0:13:52", "remaining_time": "0:51:10"} | |
| {"current_steps": 330, "total_steps": 1500, "loss": 61.82047119140625, "lr": 9.113972348616698e-05, "epoch": 0.08224299065420561, "percentage": 22.0, "elapsed_time": "0:14:13", "remaining_time": "0:50:27"} | |
| {"current_steps": 340, "total_steps": 1500, "loss": 63.42564086914062, "lr": 9.051442996184127e-05, "epoch": 0.08473520249221184, "percentage": 22.67, "elapsed_time": "0:14:38", "remaining_time": "0:49:58"} | |
| {"current_steps": 350, "total_steps": 1500, "loss": 60.22391357421875, "lr": 8.987011880353148e-05, "epoch": 0.08722741433021806, "percentage": 23.33, "elapsed_time": "0:15:05", "remaining_time": "0:49:36"} | |
| {"current_steps": 360, "total_steps": 1500, "loss": 61.663671875, "lr": 8.920709245344879e-05, "epoch": 0.08971962616822429, "percentage": 24.0, "elapsed_time": "0:15:31", "remaining_time": "0:49:09"} | |
| {"current_steps": 370, "total_steps": 1500, "loss": 61.935418701171876, "lr": 8.852566213878947e-05, "epoch": 0.09221183800623053, "percentage": 24.67, "elapsed_time": "0:15:56", "remaining_time": "0:48:42"} | |
| {"current_steps": 380, "total_steps": 1500, "loss": 61.58736572265625, "lr": 8.782614772564379e-05, "epoch": 0.09470404984423676, "percentage": 25.33, "elapsed_time": "0:16:22", "remaining_time": "0:48:14"} | |
| {"current_steps": 390, "total_steps": 1500, "loss": 61.641015625, "lr": 8.710887756884946e-05, "epoch": 0.09719626168224299, "percentage": 26.0, "elapsed_time": "0:16:48", "remaining_time": "0:47:50"} | |
| {"current_steps": 400, "total_steps": 1500, "loss": 61.800238037109374, "lr": 8.637418835786066e-05, "epoch": 0.09968847352024922, "percentage": 26.67, "elapsed_time": "0:17:15", "remaining_time": "0:47:26"} | |
| {"current_steps": 410, "total_steps": 1500, "loss": 61.11771240234375, "lr": 8.562242495870463e-05, "epoch": 0.10218068535825545, "percentage": 27.33, "elapsed_time": "0:17:41", "remaining_time": "0:47:01"} | |
| {"current_steps": 420, "total_steps": 1500, "loss": 61.6244140625, "lr": 8.485394025210016e-05, "epoch": 0.10467289719626169, "percentage": 28.0, "elapsed_time": "0:18:06", "remaining_time": "0:46:32"} | |
| {"current_steps": 430, "total_steps": 1500, "loss": 60.49154052734375, "lr": 8.40690949678141e-05, "epoch": 0.10716510903426792, "percentage": 28.67, "elapsed_time": "0:18:30", "remaining_time": "0:46:03"} | |
| {"current_steps": 440, "total_steps": 1500, "loss": 59.9539306640625, "lr": 8.326825751533322e-05, "epoch": 0.10965732087227414, "percentage": 29.33, "elapsed_time": "0:18:55", "remaining_time": "0:45:35"} | |
| {"current_steps": 450, "total_steps": 1500, "loss": 62.11453857421875, "lr": 8.245180381093151e-05, "epoch": 0.11214953271028037, "percentage": 30.0, "elapsed_time": "0:19:21", "remaining_time": "0:45:11"} | |
| {"current_steps": 460, "total_steps": 1500, "loss": 62.84632568359375, "lr": 8.16201171012134e-05, "epoch": 0.1146417445482866, "percentage": 30.67, "elapsed_time": "0:19:50", "remaining_time": "0:44:50"} | |
| {"current_steps": 470, "total_steps": 1500, "loss": 61.0372802734375, "lr": 8.077358778321646e-05, "epoch": 0.11713395638629283, "percentage": 31.33, "elapsed_time": "0:20:14", "remaining_time": "0:44:21"} | |
| {"current_steps": 480, "total_steps": 1500, "loss": 61.43934936523438, "lr": 7.991261322115737e-05, "epoch": 0.11962616822429907, "percentage": 32.0, "elapsed_time": "0:20:39", "remaining_time": "0:43:54"} | |
| {"current_steps": 490, "total_steps": 1500, "loss": 62.139642333984376, "lr": 7.903759755990763e-05, "epoch": 0.1221183800623053, "percentage": 32.67, "elapsed_time": "0:21:07", "remaining_time": "0:43:32"} | |
| {"current_steps": 500, "total_steps": 1500, "loss": 61.7046875, "lr": 7.814895153528635e-05, "epoch": 0.12461059190031153, "percentage": 33.33, "elapsed_time": "0:21:32", "remaining_time": "0:43:04"} | |
| {"current_steps": 510, "total_steps": 1500, "loss": 61.59208984375, "lr": 7.724709228125923e-05, "epoch": 0.12710280373831775, "percentage": 34.0, "elapsed_time": "0:21:55", "remaining_time": "0:42:33"} | |
| {"current_steps": 520, "total_steps": 1500, "loss": 61.0688720703125, "lr": 7.633244313413416e-05, "epoch": 0.129595015576324, "percentage": 34.67, "elapsed_time": "0:22:23", "remaining_time": "0:42:12"} | |
| {"current_steps": 530, "total_steps": 1500, "loss": 59.56242065429687, "lr": 7.540543343384565e-05, "epoch": 0.1320872274143302, "percentage": 35.33, "elapsed_time": "0:22:49", "remaining_time": "0:41:46"} | |
| {"current_steps": 540, "total_steps": 1500, "loss": 60.75743408203125, "lr": 7.446649832242075e-05, "epoch": 0.13457943925233645, "percentage": 36.0, "elapsed_time": "0:23:11", "remaining_time": "0:41:14"} | |
| {"current_steps": 550, "total_steps": 1500, "loss": 61.74861450195313, "lr": 7.35160785397218e-05, "epoch": 0.13707165109034267, "percentage": 36.67, "elapsed_time": "0:23:36", "remaining_time": "0:40:46"} | |
| {"current_steps": 560, "total_steps": 1500, "loss": 60.4139892578125, "lr": 7.255462021656132e-05, "epoch": 0.1395638629283489, "percentage": 37.33, "elapsed_time": "0:24:04", "remaining_time": "0:40:24"} | |
| {"current_steps": 570, "total_steps": 1500, "loss": 61.02410888671875, "lr": 7.158257466528651e-05, "epoch": 0.14205607476635515, "percentage": 38.0, "elapsed_time": "0:24:27", "remaining_time": "0:39:54"} | |
| {"current_steps": 580, "total_steps": 1500, "loss": 61.34220581054687, "lr": 7.060039816793141e-05, "epoch": 0.14454828660436136, "percentage": 38.67, "elapsed_time": "0:24:51", "remaining_time": "0:39:25"} | |
| {"current_steps": 590, "total_steps": 1500, "loss": 61.14195556640625, "lr": 6.960855176203624e-05, "epoch": 0.1470404984423676, "percentage": 39.33, "elapsed_time": "0:25:15", "remaining_time": "0:38:57"} | |
| {"current_steps": 600, "total_steps": 1500, "loss": 59.297998046875, "lr": 6.860750102423465e-05, "epoch": 0.14953271028037382, "percentage": 40.0, "elapsed_time": "0:25:42", "remaining_time": "0:38:33"} | |
| {"current_steps": 610, "total_steps": 1500, "loss": 59.147332763671876, "lr": 6.759771585171017e-05, "epoch": 0.15202492211838006, "percentage": 40.67, "elapsed_time": "0:26:06", "remaining_time": "0:38:06"} | |
| {"current_steps": 620, "total_steps": 1500, "loss": 60.24951171875, "lr": 6.65796702416246e-05, "epoch": 0.1545171339563863, "percentage": 41.33, "elapsed_time": "0:26:30", "remaining_time": "0:37:36"} | |
| {"current_steps": 630, "total_steps": 1500, "loss": 60.24437255859375, "lr": 6.555384206862182e-05, "epoch": 0.15700934579439252, "percentage": 42.0, "elapsed_time": "0:26:56", "remaining_time": "0:37:12"} | |
| {"current_steps": 640, "total_steps": 1500, "loss": 59.27978515625, "lr": 6.45207128605117e-05, "epoch": 0.15950155763239876, "percentage": 42.67, "elapsed_time": "0:27:23", "remaining_time": "0:36:48"} | |
| {"current_steps": 650, "total_steps": 1500, "loss": 61.29134521484375, "lr": 6.348076757223877e-05, "epoch": 0.16199376947040497, "percentage": 43.33, "elapsed_time": "0:27:49", "remaining_time": "0:36:23"} | |
| {"current_steps": 660, "total_steps": 1500, "loss": 58.97567138671875, "lr": 6.243449435824276e-05, "epoch": 0.16448598130841122, "percentage": 44.0, "elapsed_time": "0:28:14", "remaining_time": "0:35:56"} | |
| {"current_steps": 670, "total_steps": 1500, "loss": 61.413525390625, "lr": 6.138238434331667e-05, "epoch": 0.16697819314641746, "percentage": 44.67, "elapsed_time": "0:28:37", "remaining_time": "0:35:27"} | |
| {"current_steps": 680, "total_steps": 1500, "loss": 60.899566650390625, "lr": 6.0324931392071074e-05, "epoch": 0.16947040498442367, "percentage": 45.33, "elapsed_time": "0:29:03", "remaining_time": "0:35:02"} | |
| {"current_steps": 690, "total_steps": 1500, "loss": 57.98511962890625, "lr": 5.926263187711202e-05, "epoch": 0.17196261682242991, "percentage": 46.0, "elapsed_time": "0:29:29", "remaining_time": "0:34:36"} | |
| {"current_steps": 700, "total_steps": 1500, "loss": 60.6756103515625, "lr": 5.819598444604174e-05, "epoch": 0.17445482866043613, "percentage": 46.67, "elapsed_time": "0:29:55", "remaining_time": "0:34:11"} | |
| {"current_steps": 710, "total_steps": 1500, "loss": 59.08873291015625, "lr": 5.712548978739154e-05, "epoch": 0.17694704049844237, "percentage": 47.33, "elapsed_time": "0:30:21", "remaining_time": "0:33:46"} | |
| {"current_steps": 720, "total_steps": 1500, "loss": 58.09189453125, "lr": 5.60516503955966e-05, "epoch": 0.17943925233644858, "percentage": 48.0, "elapsed_time": "0:30:45", "remaining_time": "0:33:19"} | |
| {"current_steps": 730, "total_steps": 1500, "loss": 59.247650146484375, "lr": 5.497497033512309e-05, "epoch": 0.18193146417445483, "percentage": 48.67, "elapsed_time": "0:31:07", "remaining_time": "0:32:50"} | |
| {"current_steps": 740, "total_steps": 1500, "loss": 60.30670166015625, "lr": 5.38959550038583e-05, "epoch": 0.18442367601246107, "percentage": 49.33, "elapsed_time": "0:31:34", "remaining_time": "0:32:26"} | |
| {"current_steps": 750, "total_steps": 1500, "loss": 58.3121826171875, "lr": 5.281511089587491e-05, "epoch": 0.18691588785046728, "percentage": 50.0, "elapsed_time": "0:32:00", "remaining_time": "0:32:00"} | |
| {"current_steps": 760, "total_steps": 1500, "loss": 57.36131591796875, "lr": 5.173294536368062e-05, "epoch": 0.18940809968847352, "percentage": 50.67, "elapsed_time": "0:32:22", "remaining_time": "0:31:31"} | |
| {"current_steps": 770, "total_steps": 1500, "loss": 57.4423828125, "lr": 5.0649966380064895e-05, "epoch": 0.19190031152647974, "percentage": 51.33, "elapsed_time": "0:32:48", "remaining_time": "0:31:06"} | |
| {"current_steps": 780, "total_steps": 1500, "loss": 58.411962890625, "lr": 4.9566682299654546e-05, "epoch": 0.19439252336448598, "percentage": 52.0, "elapsed_time": "0:33:13", "remaining_time": "0:30:39"} | |
| {"current_steps": 790, "total_steps": 1500, "loss": 60.6809326171875, "lr": 4.848360162028997e-05, "epoch": 0.19688473520249222, "percentage": 52.67, "elapsed_time": "0:33:39", "remaining_time": "0:30:14"} | |
| {"current_steps": 800, "total_steps": 1500, "loss": 60.1671875, "lr": 4.740123274433438e-05, "epoch": 0.19937694704049844, "percentage": 53.33, "elapsed_time": "0:34:07", "remaining_time": "0:29:51"} | |
| {"current_steps": 810, "total_steps": 1500, "loss": 59.940185546875, "lr": 4.6320083740027584e-05, "epoch": 0.20186915887850468, "percentage": 54.0, "elapsed_time": "0:34:31", "remaining_time": "0:29:24"} | |
| {"current_steps": 820, "total_steps": 1500, "loss": 62.11185302734375, "lr": 4.524066210299685e-05, "epoch": 0.2043613707165109, "percentage": 54.67, "elapsed_time": "0:34:57", "remaining_time": "0:28:59"} | |
| {"current_steps": 830, "total_steps": 1500, "loss": 59.7978515625, "lr": 4.416347451803637e-05, "epoch": 0.20685358255451713, "percentage": 55.33, "elapsed_time": "0:35:22", "remaining_time": "0:28:33"} | |
| {"current_steps": 840, "total_steps": 1500, "loss": 59.99947509765625, "lr": 4.308902662126748e-05, "epoch": 0.20934579439252338, "percentage": 56.0, "elapsed_time": "0:35:50", "remaining_time": "0:28:09"} | |
| {"current_steps": 850, "total_steps": 1500, "loss": 57.72410888671875, "lr": 4.2017822762790956e-05, "epoch": 0.2118380062305296, "percentage": 56.67, "elapsed_time": "0:36:17", "remaining_time": "0:27:44"} | |
| {"current_steps": 860, "total_steps": 1500, "loss": 57.016107177734376, "lr": 4.095036576994321e-05, "epoch": 0.21433021806853583, "percentage": 57.33, "elapsed_time": "0:36:43", "remaining_time": "0:27:20"} | |
| {"current_steps": 870, "total_steps": 1500, "loss": 58.38250732421875, "lr": 3.988715671126704e-05, "epoch": 0.21682242990654205, "percentage": 58.0, "elapsed_time": "0:37:11", "remaining_time": "0:26:55"} | |
| {"current_steps": 880, "total_steps": 1500, "loss": 59.503271484375, "lr": 3.882869466130812e-05, "epoch": 0.2193146417445483, "percentage": 58.67, "elapsed_time": "0:37:38", "remaining_time": "0:26:30"} | |
| {"current_steps": 890, "total_steps": 1500, "loss": 56.64525146484375, "lr": 3.777547646634741e-05, "epoch": 0.22180685358255453, "percentage": 59.33, "elapsed_time": "0:38:05", "remaining_time": "0:26:06"} | |
| {"current_steps": 900, "total_steps": 1500, "loss": 57.2556640625, "lr": 3.672799651117958e-05, "epoch": 0.22429906542056074, "percentage": 60.0, "elapsed_time": "0:38:33", "remaining_time": "0:25:42"} | |
| {"current_steps": 910, "total_steps": 1500, "loss": 58.7387939453125, "lr": 3.568674648704677e-05, "epoch": 0.226791277258567, "percentage": 60.67, "elapsed_time": "0:38:57", "remaining_time": "0:25:15"} | |
| {"current_steps": 920, "total_steps": 1500, "loss": 59.7054931640625, "lr": 3.4652215160836826e-05, "epoch": 0.2292834890965732, "percentage": 61.33, "elapsed_time": "0:39:21", "remaining_time": "0:24:49"} | |
| {"current_steps": 930, "total_steps": 1500, "loss": 57.66356201171875, "lr": 3.362488814565414e-05, "epoch": 0.23177570093457944, "percentage": 62.0, "elapsed_time": "0:39:47", "remaining_time": "0:24:23"} | |
| {"current_steps": 940, "total_steps": 1500, "loss": 57.801812744140626, "lr": 3.2605247672870965e-05, "epoch": 0.23426791277258566, "percentage": 62.67, "elapsed_time": "0:40:12", "remaining_time": "0:23:57"} | |
| {"current_steps": 950, "total_steps": 1500, "loss": 58.65765380859375, "lr": 3.1593772365766105e-05, "epoch": 0.2367601246105919, "percentage": 63.33, "elapsed_time": "0:40:37", "remaining_time": "0:23:30"} | |
| {"current_steps": 960, "total_steps": 1500, "loss": 57.47552490234375, "lr": 3.059093701485722e-05, "epoch": 0.23925233644859814, "percentage": 64.0, "elapsed_time": "0:41:00", "remaining_time": "0:23:04"} | |
| {"current_steps": 970, "total_steps": 1500, "loss": 59.5080810546875, "lr": 2.95972123550323e-05, "epoch": 0.24174454828660435, "percentage": 64.67, "elapsed_time": "0:41:28", "remaining_time": "0:22:39"} | |
| {"current_steps": 980, "total_steps": 1500, "loss": 58.1850830078125, "lr": 2.8613064844584812e-05, "epoch": 0.2442367601246106, "percentage": 65.33, "elapsed_time": "0:41:52", "remaining_time": "0:22:12"} | |
| {"current_steps": 990, "total_steps": 1500, "loss": 58.619384765625, "lr": 2.763895644625637e-05, "epoch": 0.2467289719626168, "percentage": 66.0, "elapsed_time": "0:42:16", "remaining_time": "0:21:46"} | |
| {"current_steps": 1000, "total_steps": 1500, "loss": 57.890045166015625, "lr": 2.6675344410389623e-05, "epoch": 0.24922118380062305, "percentage": 66.67, "elapsed_time": "0:42:43", "remaining_time": "0:21:21"} | |
| {"current_steps": 1010, "total_steps": 1500, "loss": 58.45093994140625, "lr": 2.5722681060292952e-05, "epoch": 0.25171339563862927, "percentage": 67.33, "elapsed_time": "0:43:07", "remaining_time": "0:20:55"} | |
| {"current_steps": 1020, "total_steps": 1500, "loss": 58.074462890625, "lr": 2.4781413579918382e-05, "epoch": 0.2542056074766355, "percentage": 68.0, "elapsed_time": "0:43:30", "remaining_time": "0:20:28"} | |
| {"current_steps": 1030, "total_steps": 1500, "loss": 58.45963134765625, "lr": 2.3851983803951444e-05, "epoch": 0.25669781931464175, "percentage": 68.67, "elapsed_time": "0:43:56", "remaining_time": "0:20:03"} | |
| {"current_steps": 1040, "total_steps": 1500, "loss": 58.1219970703125, "lr": 2.2934828010412362e-05, "epoch": 0.259190031152648, "percentage": 69.33, "elapsed_time": "0:44:22", "remaining_time": "0:19:37"} | |
| {"current_steps": 1050, "total_steps": 1500, "loss": 58.8197509765625, "lr": 2.2030376715865314e-05, "epoch": 0.2616822429906542, "percentage": 70.0, "elapsed_time": "0:44:46", "remaining_time": "0:19:11"} | |
| {"current_steps": 1060, "total_steps": 1500, "loss": 57.1118896484375, "lr": 2.1139054473332358e-05, "epoch": 0.2641744548286604, "percentage": 70.67, "elapsed_time": "0:45:12", "remaining_time": "0:18:46"} | |
| {"current_steps": 1070, "total_steps": 1500, "loss": 59.5951171875, "lr": 2.026127967300645e-05, "epoch": 0.26666666666666666, "percentage": 71.33, "elapsed_time": "0:45:40", "remaining_time": "0:18:21"} | |
| {"current_steps": 1080, "total_steps": 1500, "loss": 59.3608642578125, "lr": 1.9397464345857562e-05, "epoch": 0.2691588785046729, "percentage": 72.0, "elapsed_time": "0:46:05", "remaining_time": "0:17:55"} | |
| {"current_steps": 1090, "total_steps": 1500, "loss": 58.517822265625, "lr": 1.854801397022351e-05, "epoch": 0.27165109034267915, "percentage": 72.67, "elapsed_time": "0:46:31", "remaining_time": "0:17:30"} | |
| {"current_steps": 1100, "total_steps": 1500, "loss": 58.18668212890625, "lr": 1.7713327281477077e-05, "epoch": 0.27414330218068533, "percentage": 73.33, "elapsed_time": "0:46:58", "remaining_time": "0:17:04"} | |
| {"current_steps": 1110, "total_steps": 1500, "loss": 58.797900390625, "lr": 1.6893796084857804e-05, "epoch": 0.2766355140186916, "percentage": 74.0, "elapsed_time": "0:47:25", "remaining_time": "0:16:39"} | |
| {"current_steps": 1120, "total_steps": 1500, "loss": 58.73525390625, "lr": 1.6089805071557255e-05, "epoch": 0.2791277258566978, "percentage": 74.67, "elapsed_time": "0:47:52", "remaining_time": "0:16:14"} | |
| {"current_steps": 1130, "total_steps": 1500, "loss": 56.05345458984375, "lr": 1.5301731638143287e-05, "epoch": 0.28161993769470406, "percentage": 75.33, "elapsed_time": "0:48:18", "remaining_time": "0:15:49"} | |
| {"current_steps": 1140, "total_steps": 1500, "loss": 59.84700927734375, "lr": 1.4529945709408727e-05, "epoch": 0.2841121495327103, "percentage": 76.0, "elapsed_time": "0:48:42", "remaining_time": "0:15:22"} | |
| {"current_steps": 1150, "total_steps": 1500, "loss": 58.613311767578125, "lr": 1.3774809564727103e-05, "epoch": 0.2866043613707165, "percentage": 76.67, "elapsed_time": "0:49:07", "remaining_time": "0:14:57"} | |
| {"current_steps": 1160, "total_steps": 1500, "loss": 57.91171875, "lr": 1.303667766799741e-05, "epoch": 0.28909657320872273, "percentage": 77.33, "elapsed_time": "0:49:34", "remaining_time": "0:14:31"} | |
| {"current_steps": 1170, "total_steps": 1500, "loss": 58.024560546875, "lr": 1.2315896501257147e-05, "epoch": 0.29158878504672897, "percentage": 78.0, "elapsed_time": "0:49:58", "remaining_time": "0:14:05"} | |
| {"current_steps": 1180, "total_steps": 1500, "loss": 58.019476318359374, "lr": 1.161280440204251e-05, "epoch": 0.2940809968847352, "percentage": 78.67, "elapsed_time": "0:50:24", "remaining_time": "0:13:40"} | |
| {"current_steps": 1190, "total_steps": 1500, "loss": 56.94827880859375, "lr": 1.0927731404571211e-05, "epoch": 0.29657320872274145, "percentage": 79.33, "elapsed_time": "0:50:50", "remaining_time": "0:13:14"} | |
| {"current_steps": 1200, "total_steps": 1500, "loss": 57.61213989257813, "lr": 1.0260999084823265e-05, "epoch": 0.29906542056074764, "percentage": 80.0, "elapsed_time": "0:51:17", "remaining_time": "0:12:49"} | |
| {"current_steps": 1210, "total_steps": 1500, "loss": 55.23438720703125, "lr": 9.612920409591813e-06, "epoch": 0.3015576323987539, "percentage": 80.67, "elapsed_time": "0:51:44", "remaining_time": "0:12:24"} | |
| {"current_steps": 1220, "total_steps": 1500, "loss": 57.05225830078125, "lr": 8.983799589575392e-06, "epoch": 0.3040498442367601, "percentage": 81.33, "elapsed_time": "0:52:11", "remaining_time": "0:11:58"} | |
| {"current_steps": 1230, "total_steps": 1500, "loss": 59.320458984375, "lr": 8.373931936580114e-06, "epoch": 0.30654205607476637, "percentage": 82.0, "elapsed_time": "0:52:36", "remaining_time": "0:11:32"} | |
| {"current_steps": 1240, "total_steps": 1500, "loss": 59.88106689453125, "lr": 7.783603724899257e-06, "epoch": 0.3090342679127726, "percentage": 82.67, "elapsed_time": "0:53:01", "remaining_time": "0:11:07"} | |
| {"current_steps": 1250, "total_steps": 1500, "loss": 57.19481201171875, "lr": 7.213092056934833e-06, "epoch": 0.3115264797507788, "percentage": 83.33, "elapsed_time": "0:53:27", "remaining_time": "0:10:41"} | |
| {"current_steps": 1260, "total_steps": 1500, "loss": 54.98917236328125, "lr": 6.662664733124768e-06, "epoch": 0.31401869158878504, "percentage": 84.0, "elapsed_time": "0:53:55", "remaining_time": "0:10:16"} | |
| {"current_steps": 1270, "total_steps": 1500, "loss": 57.67198486328125, "lr": 6.132580126236198e-06, "epoch": 0.3165109034267913, "percentage": 84.67, "elapsed_time": "0:54:19", "remaining_time": "0:09:50"} | |
| {"current_steps": 1280, "total_steps": 1500, "loss": 57.73681640625, "lr": 5.623087060084364e-06, "epoch": 0.3190031152647975, "percentage": 85.33, "elapsed_time": "0:54:42", "remaining_time": "0:09:24"} | |
| {"current_steps": 1290, "total_steps": 1500, "loss": 57.99171142578125, "lr": 5.13442469273363e-06, "epoch": 0.32149532710280376, "percentage": 86.0, "elapsed_time": "0:55:07", "remaining_time": "0:08:58"} | |
| {"current_steps": 1300, "total_steps": 1500, "loss": 58.67447509765625, "lr": 4.666822404235838e-06, "epoch": 0.32398753894080995, "percentage": 86.67, "elapsed_time": "0:55:34", "remaining_time": "0:08:32"} | |
| {"current_steps": 1310, "total_steps": 1500, "loss": 58.18992919921875, "lr": 4.220499688958307e-06, "epoch": 0.3264797507788162, "percentage": 87.33, "elapsed_time": "0:55:56", "remaining_time": "0:08:06"} | |
| {"current_steps": 1320, "total_steps": 1500, "loss": 58.37633056640625, "lr": 3.795666052552416e-06, "epoch": 0.32897196261682243, "percentage": 88.0, "elapsed_time": "0:56:21", "remaining_time": "0:07:41"} | |
| {"current_steps": 1330, "total_steps": 1500, "loss": 57.58590087890625, "lr": 3.3925209136106808e-06, "epoch": 0.3314641744548287, "percentage": 88.67, "elapsed_time": "0:56:46", "remaining_time": "0:07:15"} | |
| {"current_steps": 1340, "total_steps": 1500, "loss": 57.52745361328125, "lr": 3.01125351005902e-06, "epoch": 0.3339563862928349, "percentage": 89.33, "elapsed_time": "0:57:12", "remaining_time": "0:06:49"} | |
| {"current_steps": 1350, "total_steps": 1500, "loss": 54.82862548828125, "lr": 2.6520428103276318e-06, "epoch": 0.3364485981308411, "percentage": 90.0, "elapsed_time": "0:57:38", "remaining_time": "0:06:24"} | |
| {"current_steps": 1360, "total_steps": 1500, "loss": 56.81730346679687, "lr": 2.3150574293425377e-06, "epoch": 0.33894080996884735, "percentage": 90.67, "elapsed_time": "0:58:05", "remaining_time": "0:05:58"} | |
| {"current_steps": 1370, "total_steps": 1500, "loss": 57.27638549804688, "lr": 2.000455549377045e-06, "epoch": 0.3414330218068536, "percentage": 91.33, "elapsed_time": "0:58:32", "remaining_time": "0:05:33"} | |
| {"current_steps": 1380, "total_steps": 1500, "loss": 59.63363037109375, "lr": 1.7083848458004038e-06, "epoch": 0.34392523364485983, "percentage": 92.0, "elapsed_time": "0:59:00", "remaining_time": "0:05:07"} | |
| {"current_steps": 1390, "total_steps": 1500, "loss": 59.1685302734375, "lr": 1.4389824177583388e-06, "epoch": 0.34641744548286607, "percentage": 92.67, "elapsed_time": "0:59:26", "remaining_time": "0:04:42"} | |
| {"current_steps": 1400, "total_steps": 1500, "loss": 57.37066650390625, "lr": 1.1923747238182403e-06, "epoch": 0.34890965732087226, "percentage": 93.33, "elapsed_time": "0:59:51", "remaining_time": "0:04:16"} | |
| {"current_steps": 1410, "total_steps": 1500, "loss": 56.397607421875, "lr": 9.68677522608946e-07, "epoch": 0.3514018691588785, "percentage": 94.0, "elapsed_time": "1:00:14", "remaining_time": "0:03:50"} | |
| {"current_steps": 1420, "total_steps": 1500, "loss": 59.682470703125, "lr": 7.679958184832304e-07, "epoch": 0.35389408099688474, "percentage": 94.67, "elapsed_time": "1:00:38", "remaining_time": "0:03:25"} | |
| {"current_steps": 1430, "total_steps": 1500, "loss": 59.41876220703125, "lr": 5.904238122283135e-07, "epoch": 0.356386292834891, "percentage": 95.33, "elapsed_time": "1:01:06", "remaining_time": "0:02:59"} | |
| {"current_steps": 1440, "total_steps": 1500, "loss": 59.55084228515625, "lr": 4.3604485684765606e-07, "epoch": 0.35887850467289717, "percentage": 96.0, "elapsed_time": "1:01:30", "remaining_time": "0:02:33"} | |
| {"current_steps": 1450, "total_steps": 1500, "loss": 58.51279296875, "lr": 3.0493141843472296e-07, "epoch": 0.3613707165109034, "percentage": 96.67, "elapsed_time": "1:01:52", "remaining_time": "0:02:08"} | |
| {"current_steps": 1460, "total_steps": 1500, "loss": 57.33876953125, "lr": 1.9714504215711527e-07, "epoch": 0.36386292834890965, "percentage": 97.33, "elapsed_time": "1:02:18", "remaining_time": "0:01:42"} | |
| {"current_steps": 1470, "total_steps": 1500, "loss": 57.85191650390625, "lr": 1.1273632336700756e-07, "epoch": 0.3663551401869159, "percentage": 98.0, "elapsed_time": "1:02:43", "remaining_time": "0:01:16"} | |
| {"current_steps": 1480, "total_steps": 1500, "loss": 58.805126953125, "lr": 5.174488385152887e-08, "epoch": 0.36884735202492214, "percentage": 98.67, "elapsed_time": "1:03:06", "remaining_time": "0:00:51"} | |
| {"current_steps": 1490, "total_steps": 1500, "loss": 58.005548095703126, "lr": 1.419935323409005e-08, "epoch": 0.3713395638629283, "percentage": 99.33, "elapsed_time": "1:03:31", "remaining_time": "0:00:25"} | |
| {"current_steps": 1500, "total_steps": 1500, "loss": 57.0460693359375, "lr": 1.1735553555602963e-10, "epoch": 0.37383177570093457, "percentage": 100.0, "elapsed_time": "1:03:58", "remaining_time": "0:00:00"} | |
| {"current_steps": 1500, "total_steps": 1500, "epoch": 0.37383177570093457, "percentage": 100.0, "elapsed_time": "1:05:40", "remaining_time": "0:00:00"} | |