rockypod commited on
Commit
93caec9
·
verified ·
1 Parent(s): 1ce3ad8

README: note MLX v3.2 is now available at mlx-v3.2/

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -44,8 +44,8 @@ with Tailwind v4 styling and WCAG 2.2 AAA accessibility.
44
  > v3.2 8B and 4B are trained and staged on hardware, pending exam review;
45
  > their HF repos will update shortly after.
46
 
47
- > **MLX format for v3.2 is coming later** MLX conversion is a follow-up
48
- > step. v3.1 MLX safetensors remain at `mlx-v3.1/` in this repo.
49
 
50
  ## Install via Ollama
51
 
@@ -165,6 +165,7 @@ any single real miss a floor failure).
165
  | File | Format | Size | Use case |
166
  |---|---|---|---|
167
  | `neotoi-coder-v3.2-q4_k_m_patched.gguf` | GGUF Q4_K_M | 8.4 GB | **Current 15B v3.2** — LM Studio, llama.cpp, Ollama |
 
168
  | `neotoi-coder-v3.1-q4_k_m.gguf` | GGUF Q4_K_M | 8.4 GB | v3.1 archive |
169
  | `neotoi-coder-v3-q4_k_m_patched.gguf` | GGUF Q4_K_M | 9 GB | v3.0 archive |
170
  | `neotoi-coder-v2.0-q4_k_m.gguf` | GGUF Q4_K_M | 9 GB | v2.0 archive |
 
44
  > v3.2 8B and 4B are trained and staged on hardware, pending exam review;
45
  > their HF repos will update shortly after.
46
 
47
+ > **MLX format for v3.2 is available now** at `mlx-v3.2/` in this repo
48
+ > (7.7 GB, 4-bit quantized, 2 shards). v3.1 MLX remains at `mlx-v3.1/`.
49
 
50
  ## Install via Ollama
51
 
 
165
  | File | Format | Size | Use case |
166
  |---|---|---|---|
167
  | `neotoi-coder-v3.2-q4_k_m_patched.gguf` | GGUF Q4_K_M | 8.4 GB | **Current 15B v3.2** — LM Studio, llama.cpp, Ollama |
168
+ | `mlx-v3.2/` | MLX 4-bit safetensors | 7.7 GB | **Current 15B v3.2 MLX** — Apple Silicon (mlx-lm) |
169
  | `neotoi-coder-v3.1-q4_k_m.gguf` | GGUF Q4_K_M | 8.4 GB | v3.1 archive |
170
  | `neotoi-coder-v3-q4_k_m_patched.gguf` | GGUF Q4_K_M | 9 GB | v3.0 archive |
171
  | `neotoi-coder-v2.0-q4_k_m.gguf` | GGUF Q4_K_M | 9 GB | v2.0 archive |