spicyneuron commited on
Commit
e752b57
·
verified ·
1 Parent(s): 816d837

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -10,9 +10,9 @@ tags:
10
 
11
  [Qwen3-Coder-Next](https://huggingface.co/moonshotai/Qwen/Qwen3-Coder-Next) optimized for MLX. Note: Uses MXFP4 for some module paths.
12
 
13
- **EDIT:** v2 fixes some misassigned shared expert gates. Slower, but with 4x better perplexity.
14
 
15
- **EDIT:** v3 bumps edge experts to Q8 for further perplexity improvement and minimal effect on speed.
16
 
17
  # Methodology
18
 
 
10
 
11
  [Qwen3-Coder-Next](https://huggingface.co/moonshotai/Qwen/Qwen3-Coder-Next) optimized for MLX. Note: Uses MXFP4 for some module paths.
12
 
13
+ **EDIT:** [v2](https://huggingface.co/spicyneuron/Qwen3-Next-Coder-MLX-mixed-4.5-bit/tree/v2) fixes some misassigned shared expert gates. Slower, but with 4x better perplexity.
14
 
15
+ **EDIT:** [v3](https://huggingface.co/spicyneuron/Qwen3-Next-Coder-MLX-mixed-4.5-bit/tree/v3) bumps edge experts to Q8 for further perplexity improvement and minimal effect on speed.
16
 
17
  # Methodology
18