spicyneuron commited on
Commit
462b773
·
verified ·
1 Parent(s): aa3a1c4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -26,6 +26,14 @@ This one is comparable to
26
  [Unsloth's MOE-MXFP4](https://huggingface.co/unsloth/Qwen3-Coder-Next-GGUF/blob/main/Qwen3-Coder-Next-MXFP4_MOE.gguf)
27
  in size, but loads and runs noticeably faster thanks to MLX.
28
 
 
 
 
 
 
 
 
 
29
  # Benchmarks
30
 
31
  - unsloth/Qwen3-Coder-Next-GGUF:UD-Q4_K_XL
 
26
  [Unsloth's MOE-MXFP4](https://huggingface.co/unsloth/Qwen3-Coder-Next-GGUF/blob/main/Qwen3-Coder-Next-MXFP4_MOE.gguf)
27
  in size, but loads and runs noticeably faster thanks to MLX.
28
 
29
+ # Usage
30
+
31
+ ```sh
32
+ # Start server at http://localhost:8080/v1/chat/completions
33
+ uvx --from mlx-lm mlx_lm.server --host 127.0.0.1 --port 8080 \
34
+ --model spicyneuron/Qwen3-Next-Coder-MLX-mixed-4.5-bit
35
+ ```
36
+
37
  # Benchmarks
38
 
39
  - unsloth/Qwen3-Coder-Next-GGUF:UD-Q4_K_XL