Text Generation
Safetensors
GGUF
Rust
English
Vietnamese
dioxus
accessibility
wcag
fine-tuned
raft
code
server-functions
qwen3
family-hub
scoped-css
syncstore
conversational
Instructions to use rockypod/neotoi-coder with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use rockypod/neotoi-coder with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="rockypod/neotoi-coder", filename="neotoi-coder-v1-q4_k_m_final.gguf", )
llm.create_chat_completion( messages = [ { "role": "user", "content": "What is the capital of France?" } ] ) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use rockypod/neotoi-coder with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf rockypod/neotoi-coder:Q4_K_M # Run inference directly in the terminal: llama-cli -hf rockypod/neotoi-coder:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf rockypod/neotoi-coder:Q4_K_M # Run inference directly in the terminal: llama-cli -hf rockypod/neotoi-coder:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf rockypod/neotoi-coder:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf rockypod/neotoi-coder:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf rockypod/neotoi-coder:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf rockypod/neotoi-coder:Q4_K_M
Use Docker
docker model run hf.co/rockypod/neotoi-coder:Q4_K_M
- LM Studio
- Jan
- vLLM
How to use rockypod/neotoi-coder with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "rockypod/neotoi-coder" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "rockypod/neotoi-coder", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/rockypod/neotoi-coder:Q4_K_M
- Ollama
How to use rockypod/neotoi-coder with Ollama:
ollama run hf.co/rockypod/neotoi-coder:Q4_K_M
- Unsloth Studio new
How to use rockypod/neotoi-coder with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for rockypod/neotoi-coder to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for rockypod/neotoi-coder to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for rockypod/neotoi-coder to start chatting
- Pi new
How to use rockypod/neotoi-coder with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf rockypod/neotoi-coder:Q4_K_M
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "rockypod/neotoi-coder:Q4_K_M" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use rockypod/neotoi-coder with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf rockypod/neotoi-coder:Q4_K_M
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default rockypod/neotoi-coder:Q4_K_M
Run Hermes
hermes
- Docker Model Runner
How to use rockypod/neotoi-coder with Docker Model Runner:
docker model run hf.co/rockypod/neotoi-coder:Q4_K_M
- Lemonade
How to use rockypod/neotoi-coder with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull rockypod/neotoi-coder:Q4_K_M
Run and chat with the model
lemonade run user.neotoi-coder-Q4_K_M
List all available models
lemonade list
docs: add 8B + 4B v3.2 scorecards, mark all three variants published
Browse files
README.md
CHANGED
|
@@ -25,90 +25,64 @@ pipeline_tag: text-generation
|
|
| 25 |
|
| 26 |
# Neotoi Coder
|
| 27 |
|
| 28 |
-
A Rust / Dioxus 0.7 specialist LLM
|
| 29 |
-
|
| 30 |
-
|
| 31 |
|
| 32 |
All variants are fine-tuned via RAFT (Retrieval-Augmented Fine-Tuning) on
|
| 33 |
-
Qwen3 base models
|
| 34 |
-
|
| 35 |
|
| 36 |
-
## Variants
|
| 37 |
|
| 38 |
| Variant | Repo | Base | Params | Q4_K_M | Spec exam |
|
| 39 |
|---|---|---|---|---|---|
|
| 40 |
-
| **15B v3.2** (this repo
|
| 41 |
-
| 8B v3.
|
| 42 |
-
| 4B v3.
|
| 43 |
|
| 44 |
-
|
| 45 |
-
> their HF repos will update shortly after.
|
| 46 |
|
| 47 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 48 |
> (7.7 GB, 4-bit quantized, 2 shards). v3.1 MLX remains at `mlx-v3.1/`.
|
| 49 |
|
| 50 |
## Install via Ollama
|
| 51 |
|
| 52 |
```bash
|
| 53 |
-
# 15B v3.2 β
|
| 54 |
ollama pull rockypod/neotoi-coder:latest
|
| 55 |
ollama pull rockypod/neotoi-coder:15b # explicit size tag
|
| 56 |
|
| 57 |
-
# 8B v3.
|
| 58 |
ollama pull rockypod/neotoi-coder:8b
|
| 59 |
|
| 60 |
-
# 4B v3.
|
| 61 |
ollama pull rockypod/neotoi-coder:4b
|
| 62 |
```
|
| 63 |
|
| 64 |
-
|
| 65 |
-
|
| 66 |
-
|
| 67 |
-
|
| 68 |
-
- **Overall:** 94.81% β **95.12%** on a harder, longer exam (114Q vs 103Q,
|
| 69 |
-
max 164 vs 144.5, two new tiers).
|
| 70 |
-
- **T4 WCAG / ARIA: 78.6% β 100.0%** β the biggest single jump. v3.1's
|
| 71 |
-
largest weakness (drops `rsx!` macro on ARIA-heavy components) is fixed
|
| 72 |
-
in v3.2 by the dedicated WCAG correction set in T55 training.
|
| 73 |
-
- **All 11 original v3.1 tiers stay at β₯87.5%**; nine of them at 100%.
|
| 74 |
-
- 113/114 questions parse cleanly through the patched grader; 1 question
|
| 75 |
-
hit a generation-side degeneration loop (Q77 in T7 Primitives+CSS).
|
| 76 |
-
|
| 77 |
-
### New Dioxus 0.7 surface
|
| 78 |
-
|
| 79 |
-
v3.2 expands coverage from Dioxus 0.7.0 through **Dioxus 0.7.9** (full 0.7
|
| 80 |
-
series, with 0.7.6 being the final 0.7 release before 0.8). New training
|
| 81 |
-
topics added:
|
| 82 |
-
|
| 83 |
-
- **T44 Scoped CSS and CSS modules** (Dioxus 0.7.3, PR #5087)
|
| 84 |
-
- **T45 SyncStore + `use_store_sync`** (Dioxus 0.7.2, cross-thread reactive state)
|
| 85 |
-
- **T46 New events:** `onauxclick`, `onscrollend` (Dioxus 0.7.3)
|
| 86 |
-
- **T47 Server-only extractors** + `serde_qs` query string support (0.7.1 + 0.7.3)
|
| 87 |
-
- **T48 0.7.2 bug-fix awareness** β optional callback props, child router layouts, drag/drop serialisation, `use_drop` in prelude
|
| 88 |
-
- **T49 0.7.4 APIs:** `WritableResultExt`, WebSocket `Stream + Sink`, FFI for Kotlin/Java/Swift, iOS widget bundling
|
| 89 |
-
- **T50 0.7.6 RSX additions:** `inert` global attribute, web panic resilience, blanket `IntoAttributeValue` for borrowed values, `Action::PartialEq`
|
| 90 |
-
- **T51 `use_context` vs `consume_context` semantics** β panic-on-missing-provider (returns `T`, not `Option<T>`)
|
| 91 |
|
| 92 |
-
##
|
| 93 |
|
| 94 |
-
|
| 95 |
-
failure axes identified in v3.1 evaluation:
|
| 96 |
|
| 97 |
-
|
| 98 |
-
|
| 99 |
-
|
| 100 |
-
|
| 101 |
-
|
| 102 |
-
`use_init_i18n`, DaisyUI v5 / Tailwind v4 CSS-first patterns.
|
| 103 |
-
- **T55 WCAG / ARIA corrections** (22) β drives the 78.6% β 100% jump.
|
| 104 |
-
- **T56 `dioxus-i18n` + Fluent** (22) β `LanguageIdentifier`, `t!()`,
|
| 105 |
-
append-not-replace catalog edits.
|
| 106 |
-
- **T57 Scope discipline** (12) β answer exactly what was asked.
|
| 107 |
-
|
| 108 |
-
## v3.2 spec-exam scorecard (15B)
|
| 109 |
|
| 110 |
-
|
| 111 |
-
90% (147.6 / 164.0), release bar 95% (155.8 / 164.0).
|
| 112 |
|
| 113 |
| Tier | Count | Max wt | Raw | Wtd | Rate | Floor | Status |
|
| 114 |
|---|---|---|---|---|---|---|---|
|
|
@@ -127,26 +101,88 @@ failure axes identified in v3.1 evaluation:
|
|
| 127 |
| **T13 SyncStore (NEW)** | 2 | 3.0 | 0 | 0.0 | 0.0% | 82% | β οΈ |
|
| 128 |
| **Total** | **114** | **164.0** | **109** | **156.0** | **95.12%** | β | β |
|
| 129 |
|
| 130 |
-
|
| 131 |
-
|
| 132 |
-
|
| 133 |
-
|
| 134 |
-
|
| 135 |
-
|
| 136 |
-
|
| 137 |
-
|
| 138 |
-
|
| 139 |
-
|
| 140 |
-
|
| 141 |
-
|
| 142 |
-
|
| 143 |
-
|
| 144 |
-
|
| 145 |
-
|
| 146 |
-
|
| 147 |
-
|
| 148 |
-
|
| 149 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 150 |
|
| 151 |
## Version History
|
| 152 |
|
|
@@ -158,7 +194,9 @@ any single real miss a floor failure).
|
|
| 158 |
| v3.1 15B | Qwen3-Coder-14B (14.8B) | 137.0/144.5 (94.81%) | 103Q weighted, 11 tiers | 4,880 |
|
| 159 |
| v3.1 8B | Qwen3-8B (8.2B) | 144.5/144.5 (100.00%) | 103Q weighted, 11 tiers | 4,880 |
|
| 160 |
| v3.1 4B | Qwen3-4B (4.0B, tied) | 143.5/144.5 (99.31%) | 103Q weighted, 11 tiers | 4,880 |
|
| 161 |
-
|
|
|
|
|
|
|
|
| 162 |
|
| 163 |
## Files in this repo (15B and historical)
|
| 164 |
|
|
@@ -170,11 +208,10 @@ any single real miss a floor failure).
|
|
| 170 |
| `neotoi-coder-v3-q4_k_m_patched.gguf` | GGUF Q4_K_M | 9 GB | v3.0 archive |
|
| 171 |
| `neotoi-coder-v2.0-q4_k_m.gguf` | GGUF Q4_K_M | 9 GB | v2.0 archive |
|
| 172 |
| `neotoi-coder-v1-q4_k_m_final.gguf` | GGUF Q4_K_M | 9 GB | v1.0 archive |
|
| 173 |
-
| `mlx-v3.1/` | MLX safetensors | β | v3.1 MLX
|
| 174 |
| `mlx-v3/` | MLX safetensors | β | v3.0 MLX archive |
|
| 175 |
|
| 176 |
-
For the **8B** and **4B** Q4_K_M GGUFs
|
| 177 |
-
dedicated repos:
|
| 178 |
|
| 179 |
- https://huggingface.co/rockypod/neotoi-coder-8b
|
| 180 |
- https://huggingface.co/rockypod/neotoi-coder-4b
|
|
@@ -184,3 +221,13 @@ dedicated repos:
|
|
| 184 |
This model emits Qwen3 native `<think>...</think>` blocks. Thinking is on
|
| 185 |
by default with the `_patched.gguf` quants on inference backends that
|
| 186 |
honor `qwen3.thinking`.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 25 |
|
| 26 |
# Neotoi Coder
|
| 27 |
|
| 28 |
+
A Rust / Dioxus 0.7 specialist LLM fine-tuned on 5,287 curated examples
|
| 29 |
+
covering the full Dioxus 0.7 series (0.7.0β0.7.9), Tailwind v4, and
|
| 30 |
+
WCAG 2.2 AAA accessibility. All three v3.2 variants are published.
|
| 31 |
|
| 32 |
All variants are fine-tuned via RAFT (Retrieval-Augmented Fine-Tuning) on
|
| 33 |
+
Qwen3 base models using LoRA adapters (Unsloth), optimized for
|
| 34 |
+
production-quality Dioxus 0.7 components.
|
| 35 |
|
| 36 |
+
## Variants
|
| 37 |
|
| 38 |
| Variant | Repo | Base | Params | Q4_K_M | Spec exam |
|
| 39 |
|---|---|---|---|---|---|
|
| 40 |
+
| **15B v3.2** (this repo) | `rockypod/neotoi-coder` | Qwen3-Coder-14B | 14.8B | 8.4 GB | **156.0 / 164.0 β 95.12%** (114Q, 13 tiers) |
|
| 41 |
+
| **8B v3.2** | [`rockypod/neotoi-coder-8b`](https://huggingface.co/rockypod/neotoi-coder-8b) | Qwen3-8B | 8.2B | 4.68 GB | **160.0 / 164.0 β 97.56%** (114Q, 13 tiers) |
|
| 42 |
+
| **4B v3.2** | [`rockypod/neotoi-coder-4b`](https://huggingface.co/rockypod/neotoi-coder-4b) | Qwen3-4B | 4.0B | 2.33 GB | **160.0 / 164.0 β 97.56%** (114Q, 13 tiers) |
|
| 43 |
|
| 44 |
+
All three clear the 90% publication bar and the 95% release bar.
|
|
|
|
| 45 |
|
| 46 |
+
The **8B and 4B tie at 97.56%** with complementary failure patterns:
|
| 47 |
+
- 4B scores **100% on T13 SyncStore** (8B scored 50%) and **100% on T8 GlobalSignal/i18n** (8B scored 87.5%)
|
| 48 |
+
- 8B scores **100% on T12 Format Compliance** (4B scored 66.7%)
|
| 49 |
+
|
| 50 |
+
Pick by hardware: 4B (2.3 GB) if disk/RAM is tight with perfect SyncStore;
|
| 51 |
+
8B (4.7 GB) for best format compliance at moderate size; 15B (8.4 GB) for
|
| 52 |
+
the broadest Dioxus 0.7.4β0.7.9 surface coverage.
|
| 53 |
+
|
| 54 |
+
> **MLX format for v3.2** is available at `mlx-v3.2/` in this repo
|
| 55 |
> (7.7 GB, 4-bit quantized, 2 shards). v3.1 MLX remains at `mlx-v3.1/`.
|
| 56 |
|
| 57 |
## Install via Ollama
|
| 58 |
|
| 59 |
```bash
|
| 60 |
+
# 15B v3.2 β broadest Dioxus 0.7.4β0.7.9 surface
|
| 61 |
ollama pull rockypod/neotoi-coder:latest
|
| 62 |
ollama pull rockypod/neotoi-coder:15b # explicit size tag
|
| 63 |
|
| 64 |
+
# 8B v3.2 β highest raw score, ~40% faster than 15B, perfect format compliance
|
| 65 |
ollama pull rockypod/neotoi-coder:8b
|
| 66 |
|
| 67 |
+
# 4B v3.2 β disk / RAM constrained, perfect SyncStore
|
| 68 |
ollama pull rockypod/neotoi-coder:4b
|
| 69 |
```
|
| 70 |
|
| 71 |
+
Tags: `:latest` / `:15b`, `:8b`, `:4b`, `:v3.1` (archive). Each Modelfile
|
| 72 |
+
sets `num_ctx 8192`, `temperature 0.2`, and prefills `<think>` on the
|
| 73 |
+
assistant turn so Qwen3 native chain-of-thought emits by default.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 74 |
|
| 75 |
+
## v3.2 Scorecards (114Q, max 164.0)
|
| 76 |
|
| 77 |
+
### All-variant summary
|
|
|
|
| 78 |
|
| 79 |
+
| Variant | Score | Weighted | Raw | T12 Format | T13 SyncStore |
|
| 80 |
+
|---|---|---|---|---|---|
|
| 81 |
+
| **8B** | **97.56%** | 160.0 / 164.0 | 111 / 114 | β
100.0% | β οΈ 50.0% |
|
| 82 |
+
| **4B** | **97.56%** | 160.0 / 164.0 | 112 / 114 | β οΈ 66.7% | β
100.0% |
|
| 83 |
+
| **15B** | **95.12%** | 156.0 / 164.0 | 109 / 114 | β οΈ 83.3% | β οΈ 0.0% |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 84 |
|
| 85 |
+
### 15B scorecard
|
|
|
|
| 86 |
|
| 87 |
| Tier | Count | Max wt | Raw | Wtd | Rate | Floor | Status |
|
| 88 |
|---|---|---|---|---|---|---|---|
|
|
|
|
| 101 |
| **T13 SyncStore (NEW)** | 2 | 3.0 | 0 | 0.0 | 0.0% | 82% | β οΈ |
|
| 102 |
| **Total** | **114** | **164.0** | **109** | **156.0** | **95.12%** | β | β |
|
| 103 |
|
| 104 |
+
### 8B scorecard
|
| 105 |
+
|
| 106 |
+
| Tier | Count | Max wt | Raw | Wtd | Rate | Floor | Status |
|
| 107 |
+
|---|---|---|---|---|---|---|---|
|
| 108 |
+
| T1 Fundamentals | 12 | 12.0 | 12 | 12.0 | 100.0% | 82% | β
|
|
| 109 |
+
| T2 RSX Syntax | 12 | 12.0 | 11 | 11.0 | 91.7% | 82% | β
|
|
| 110 |
+
| T3 Signal Hygiene | 12 | 12.0 | 12 | 12.0 | 100.0% | 82% | β
|
|
| 111 |
+
| T4 WCAG / ARIA | 15 | 22.5 | 15 | 22.5 | **100.0%** | 82% | β
|
|
| 112 |
+
| T5 use_resource | 8 | 12.0 | 8 | 12.0 | 100.0% | 82% | β
|
|
| 113 |
+
| T6 Hard Reasoning | 10 | 20.0 | 10 | 20.0 | 100.0% | 88% | β
|
|
| 114 |
+
| T7 Primitives + CSS | 13 | 19.5 | 13 | 19.5 | **100.0%** | 82% | β
|
|
| 115 |
+
| T8 GlobalSignal / i18n | 8 | 12.0 | 7 | 10.5 | 87.5% | 82% | β
|
|
| 116 |
+
| T9 Static Navigator | 6 | 9.0 | 6 | 9.0 | 100.0% | 82% | β
|
|
| 117 |
+
| T10 Dioxus 0.7.4 | 6 | 12.0 | 6 | 12.0 | 100.0% | 88% | β
|
|
| 118 |
+
| T11 Server Functions | 4 | 6.0 | 4 | 6.0 | 100.0% | 82% | β
|
|
| 119 |
+
| **T12 Format Compliance** | 6 | 12.0 | 6 | 12.0 | **100.0%** | 88% | β
|
|
| 120 |
+
| **T13 SyncStore** | 2 | 3.0 | 1 | 1.5 | 50.0% | 82% | β οΈ |
|
| 121 |
+
| **Total** | **114** | **164.0** | **111** | **160.0** | **97.56%** | β | β |
|
| 122 |
+
|
| 123 |
+
T13 floor failure is structural β only 2 questions means any single miss = 50%.
|
| 124 |
+
|
| 125 |
+
### 4B scorecard
|
| 126 |
+
|
| 127 |
+
| Tier | Count | Max wt | Raw | Wtd | Rate | Floor | Status |
|
| 128 |
+
|---|---|---|---|---|---|---|---|
|
| 129 |
+
| T1 Fundamentals | 12 | 12.0 | 12 | 12.0 | 100.0% | 82% | β
|
|
| 130 |
+
| T2 RSX Syntax | 12 | 12.0 | 12 | 12.0 | 100.0% | 82% | β
|
|
| 131 |
+
| T3 Signal Hygiene | 12 | 12.0 | 12 | 12.0 | 100.0% | 82% | β
|
|
| 132 |
+
| T4 WCAG / ARIA | 15 | 22.5 | 15 | 22.5 | **100.0%** | 82% | β
|
|
| 133 |
+
| T5 use_resource | 8 | 12.0 | 8 | 12.0 | 100.0% | 82% | β
|
|
| 134 |
+
| T6 Hard Reasoning | 10 | 20.0 | 10 | 20.0 | 100.0% | 88% | β
|
|
| 135 |
+
| T7 Primitives + CSS | 13 | 19.5 | 13 | 19.5 | **100.0%** | 82% | β
|
|
| 136 |
+
| T8 GlobalSignal / i18n | 8 | 12.0 | 8 | 12.0 | **100.0%** | 82% | β
|
|
| 137 |
+
| T9 Static Navigator | 6 | 9.0 | 6 | 9.0 | 100.0% | 82% | β
|
|
| 138 |
+
| T10 Dioxus 0.7.4 | 6 | 12.0 | 6 | 12.0 | 100.0% | 88% | β
|
|
| 139 |
+
| T11 Server Functions | 4 | 6.0 | 4 | 6.0 | 100.0% | 82% | β
|
|
| 140 |
+
| **T12 Format Compliance** | 6 | 12.0 | 4 | 8.0 | 66.7% | 88% | β οΈ |
|
| 141 |
+
| **T13 SyncStore** | 2 | 3.0 | 2 | 3.0 | **100.0%** | 82% | β
|
|
| 142 |
+
| **Total** | **114** | **164.0** | **112** | **160.0** | **97.56%** | β | β |
|
| 143 |
+
|
| 144 |
+
T12 misses: q111 (old `cx.render` idiom + orphan `</think>`), q112 (missing `rsx!`).
|
| 145 |
+
The 4B also scores **100% on T8 GlobalSignal/i18n** where the 8B scored 87.5%.
|
| 146 |
+
|
| 147 |
+
## What's new in v3.2
|
| 148 |
+
|
| 149 |
+
### Score deltas vs v3.1
|
| 150 |
+
|
| 151 |
+
- **15B:** 94.81% β **95.12%** on a harder, longer exam (114Q vs 103Q,
|
| 152 |
+
max 164 vs 144.5, two new tiers). T4 WCAG/ARIA: **78.6% β 100.0%**.
|
| 153 |
+
- **8B:** 100.00% β **97.56%** β exam is harder (two new tiers added; both
|
| 154 |
+
are fresh weaknesses). T7 Primitives+CSS and T12 Format Compliance both hit
|
| 155 |
+
100% where the 15B scored 92.3% and 83.3%.
|
| 156 |
+
- **4B:** 99.31% β **97.56%** β same exam difficulty note. T13 SyncStore
|
| 157 |
+
hits 100% (a new tier the 8B misses entirely).
|
| 158 |
+
|
| 159 |
+
### New Dioxus 0.7 surface
|
| 160 |
+
|
| 161 |
+
v3.2 expands coverage from Dioxus 0.7.0 through **Dioxus 0.7.9** (full 0.7
|
| 162 |
+
series). New training topics:
|
| 163 |
+
|
| 164 |
+
- **T44 Scoped CSS and CSS modules** (Dioxus 0.7.3)
|
| 165 |
+
- **T45 SyncStore + `use_store_sync`** (Dioxus 0.7.2, cross-thread reactive state)
|
| 166 |
+
- **T46 New events:** `onauxclick`, `onscrollend` (0.7.3)
|
| 167 |
+
- **T47 Server-only extractors** + `serde_qs` query string support
|
| 168 |
+
- **T48 0.7.2 bug-fix awareness** β optional callback props, child router layouts, `use_drop` in prelude
|
| 169 |
+
- **T49 0.7.4 APIs:** `WritableResultExt`, WebSocket `Stream + Sink`, FFI for Kotlin/Java/Swift, iOS widget bundling
|
| 170 |
+
- **T50 0.7.6 RSX additions:** `inert` attribute, web panic resilience, `IntoAttributeValue` for `&T`, `Action::PartialEq`
|
| 171 |
+
- **T51 `use_context` vs `consume_context`** β panic-on-missing-provider semantics
|
| 172 |
+
|
| 173 |
+
### Eval-driven corrections (T52βT57)
|
| 174 |
+
|
| 175 |
+
- **T52 Format Compliance** β fenced-code-only outputs, no prose preamble, no orphan `</think>`
|
| 176 |
+
- **T53 Preserve-and-Append** β `.ftl` catalogs, `Cargo.toml`, route enums: add without replacing
|
| 177 |
+
- **T54 Dioxus 0.7 idiom reinforcement** β `Outlet::<Route>`, `t!()`, DaisyUI v5 / Tailwind v4
|
| 178 |
+
- **T55 WCAG / ARIA corrections** β drives the 78.6% β 100% jump on the 15B
|
| 179 |
+
- **T56 `dioxus-i18n` + Fluent** β `LanguageIdentifier`, catalog append
|
| 180 |
+
- **T57 Scope discipline** β answer exactly what was asked
|
| 181 |
+
|
| 182 |
+
### Dataset
|
| 183 |
+
|
| 184 |
+
- **5,287 curated examples** across **57 topics** (up from 4,880 / 43 in v3.1)
|
| 185 |
+
- Cross-stack contamination scan removed 489 rows: `fn app(` β `fn App(`, `launch(app)` β `launch(App)`, three `useEffect(` β `use_effect(` React leaks
|
| 186 |
|
| 187 |
## Version History
|
| 188 |
|
|
|
|
| 194 |
| v3.1 15B | Qwen3-Coder-14B (14.8B) | 137.0/144.5 (94.81%) | 103Q weighted, 11 tiers | 4,880 |
|
| 195 |
| v3.1 8B | Qwen3-8B (8.2B) | 144.5/144.5 (100.00%) | 103Q weighted, 11 tiers | 4,880 |
|
| 196 |
| v3.1 4B | Qwen3-4B (4.0B, tied) | 143.5/144.5 (99.31%) | 103Q weighted, 11 tiers | 4,880 |
|
| 197 |
+
| v3.2 15B | Qwen3-Coder-14B (14.8B) | 156.0/164.0 (95.12%) | 114Q weighted, 13 tiers | 5,287 |
|
| 198 |
+
| v3.2 8B | Qwen3-8B (8.2B) | 160.0/164.0 (97.56%) | 114Q weighted, 13 tiers | 5,287 |
|
| 199 |
+
| **v3.2 4B** | **Qwen3-4B (4.0B, tied)** | **160.0/164.0 (97.56%)** | **114Q weighted, 13 tiers** | **5,287** |
|
| 200 |
|
| 201 |
## Files in this repo (15B and historical)
|
| 202 |
|
|
|
|
| 208 |
| `neotoi-coder-v3-q4_k_m_patched.gguf` | GGUF Q4_K_M | 9 GB | v3.0 archive |
|
| 209 |
| `neotoi-coder-v2.0-q4_k_m.gguf` | GGUF Q4_K_M | 9 GB | v2.0 archive |
|
| 210 |
| `neotoi-coder-v1-q4_k_m_final.gguf` | GGUF Q4_K_M | 9 GB | v1.0 archive |
|
| 211 |
+
| `mlx-v3.1/` | MLX safetensors | β | v3.1 MLX archive |
|
| 212 |
| `mlx-v3/` | MLX safetensors | β | v3.0 MLX archive |
|
| 213 |
|
| 214 |
+
For the **8B v3.2** and **4B v3.2** Q4_K_M GGUFs, see their dedicated repos:
|
|
|
|
| 215 |
|
| 216 |
- https://huggingface.co/rockypod/neotoi-coder-8b
|
| 217 |
- https://huggingface.co/rockypod/neotoi-coder-4b
|
|
|
|
| 221 |
This model emits Qwen3 native `<think>...</think>` blocks. Thinking is on
|
| 222 |
by default with the `_patched.gguf` quants on inference backends that
|
| 223 |
honor `qwen3.thinking`.
|
| 224 |
+
|
| 225 |
+
## License
|
| 226 |
+
|
| 227 |
+
**Fine-tuned weights:** Neotoi Coder Community License v1.0 β commercial use
|
| 228 |
+
of outputs permitted, weight redistribution prohibited, mental health deployment
|
| 229 |
+
requires written permission. See [LICENSE](LICENSE).
|
| 230 |
+
|
| 231 |
+
**Base model:** [Qwen3-Coder-14B](https://huggingface.co/Qwen/Qwen3-Coder-14B) β Apache 2.0 Β© Alibaba Cloud.
|
| 232 |
+
|
| 233 |
+
Built on a homelab RTX 3090 Ti in Washington State.
|