Instructions to use rockypod/neotoi-coder with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use rockypod/neotoi-coder with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="rockypod/neotoi-coder",
	filename="neotoi-coder-v1-q4_k_m_final.gguf",
)

llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": "What is the capital of France?"
		}
	]
)

Notebooks
Google Colab
Kaggle
Local Apps

llama.cpp

How to use rockypod/neotoi-coder with llama.cpp:

Install from brew

brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf rockypod/neotoi-coder:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf rockypod/neotoi-coder:Q4_K_M

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf rockypod/neotoi-coder:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf rockypod/neotoi-coder:Q4_K_M

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf rockypod/neotoi-coder:Q4_K_M
# Run inference directly in the terminal:
./llama-cli -hf rockypod/neotoi-coder:Q4_K_M

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf rockypod/neotoi-coder:Q4_K_M
# Run inference directly in the terminal:
./build/bin/llama-cli -hf rockypod/neotoi-coder:Q4_K_M

Use Docker

docker model run hf.co/rockypod/neotoi-coder:Q4_K_M

LM Studio
Jan

vLLM

How to use rockypod/neotoi-coder with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "rockypod/neotoi-coder"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "rockypod/neotoi-coder",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/rockypod/neotoi-coder:Q4_K_M

Ollama
How to use rockypod/neotoi-coder with Ollama:
```
ollama run hf.co/rockypod/neotoi-coder:Q4_K_M
```

Unsloth Studio

How to use rockypod/neotoi-coder with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for rockypod/neotoi-coder to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for rockypod/neotoi-coder to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for rockypod/neotoi-coder to start chatting

How to use rockypod/neotoi-coder with Pi:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf rockypod/neotoi-coder:Q4_K_M

Configure the model in Pi

# Install Pi:
npm install -g @mariozechner/pi-coding-agent
# Add to ~/.pi/agent/models.json:
{
  "providers": {
    "llama-cpp": {
      "baseUrl": "http://localhost:8080/v1",
      "api": "openai-completions",
      "apiKey": "none",
      "models": [
        {
          "id": "rockypod/neotoi-coder:Q4_K_M"
        }
      ]
    }
  }
}

Run Pi

# Start Pi in your project directory:
pi

Hermes Agent new

How to use rockypod/neotoi-coder with Hermes Agent:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf rockypod/neotoi-coder:Q4_K_M

Configure Hermes

# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default rockypod/neotoi-coder:Q4_K_M

Run Hermes

hermes

Docker Model Runner
How to use rockypod/neotoi-coder with Docker Model Runner:
```
docker model run hf.co/rockypod/neotoi-coder:Q4_K_M
```

Lemonade

How to use rockypod/neotoi-coder with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull rockypod/neotoi-coder:Q4_K_M

Run and chat with the model

lemonade run user.neotoi-coder-Q4_K_M

List all available models

lemonade list

rockypod commited on Apr 26

Commit

44044f7

verified ·

1 Parent(s): 25a2f9f

Update README for v3.1 dual-size release: surface 8B (100.00%) and 4B (99.31%) branches; correct param counts (14.8B / 8.2B / 4.0B); document Q87 grader patch

Browse files

Files changed (1) hide show

README.md +105 -130

README.md CHANGED Viewed

@@ -4,7 +4,10 @@ license_name: neotoi-coder-community-license
 language:
 - en
 - vi
-base_model: Qwen/Qwen3-Coder-14B
 tags:
 - dioxus
 - rust
@@ -14,115 +17,106 @@ tags:
 - raft
 - code
 - server-functions
 pipeline_tag: text-generation
 ---
-# Neotoi Coder v3.1
-A Rust/Dioxus 0.7 specialist fine-tuned from Qwen3-Coder-14B using RAFT
-(Retrieval-Augmented Fine-Tuning). v3.1 closes the T2 RSX regression
-that shipped in v3.0 and broadens coverage into DaisyUI, deeper signals,
-router patterns, and async/server-function composition.
-## What's New in v3.1
-- **T1 Fundamentals → 100%** (+8.3 pts vs v3.0)
-- **T6 Hard Reasoning → 100%** (+25 pts vs v3.0, clean sweep)
-- **T8 GlobalSignal/i18n → 100%** (+12.5 pts)
-- **T9 Static Navigator → 100%** (held perfect)
-- **T10 Dioxus 0.7.4 → 100%** (+16.7 pts)
-- **New dataset topics:**
-  - **T39** — v3.0 exam-gap corrections
-  - **T40** — DaisyUI 5 component coverage on Tailwind v4
-  - **T41** — Signals deep-dive (`use_signal`, `Signal<T>`, `GlobalSignal`,
-    `.peek()`, `.write()`, `ReadOnlySignal`, signal composition)
-  - **T42** — Router patterns (`#[derive(Routable)]`, nested routes,
-    layout routes, route guards, query parameters)
-  - **T43** — Async / server-function composition (`use_resource`
-    three-arm match, cancellation, streaming, `ServerFnError`)
-- **Dataset:** **4,880 curated examples across 43 topics** (up from 4,535)
-## Exam Results
-### v3.1 — 103 Question Weighted Exam
-| Tier | Questions | Weight | Score | Max | Status |
 |---|---|---|---|---|---|
-| T1 Fundamentals | Q1–12 | 1.0 | 12.0/12 | 12 | ✅ Perfect |
-| T2 RSX Syntax | Q13–24 | 1.0 | 10.0/12 | 12 | ⚠️ 83.3% |
-| T3 Signal Hygiene | Q25–36 | 1.0 | 11.0/12 | 12 | ✅ 91.7% |
-| T4 WCAG/ARIA | Q37–50 | 1.5 | 16.5/21 | 21 | ⚠️ 78.6% |
-| T5 use_resource | Q51–58 | 1.5 | 12.0/12 | 12 | ✅ Perfect |
-| T6 Hard Reasoning | Q59–68 | 2.0 | 20.0/20 | 20 | ✅ Perfect |
-| T7 Primitives+CSS | Q69–80 | 1.5 | 18.0/18 | 18 | ✅ Perfect |
-| T8 GlobalSignal/i18n | Q81–88 | 1.5 | 12.0/12 | 12 | ✅ Perfect |
-| T9 Static Navigator | Q89–94 | 1.5 | 9.0/9 | 9 | ✅ Perfect |
-| T10 Dioxus 0.7.4 | Q95–100 | 2.0 | 12.0/12 | 12 | ✅ Perfect |
-| T11 Server Functions | Q101–103 | 1.5 | 4.5/4.5 | 4.5 | ✅ Perfect |
-| **Overall** | **Q1–103** | | **137.0/144.5** | **144.5** | **✅ 94.81%** |
-**8 tiers at 100%** (T1, T5, T6, T7, T8, T9, T10, T11). Raw: 97/103.
-Publication threshold: 90%. v3.1 clears it with 4.81 points to spare.
-### Remaining Gaps — v3.2 Targets
-All 6 failures are rsx! macro drops or cx.render carryover on RSX-heavy
-questions:
-- **Q17, Q22** (T2) — missing `rsx!` in RSX attribute-precision questions
-- **Q30** (T3) — `cx.render` slip on signal hygiene
-- **Q37, Q39, Q43** (T4) — `cx.render` / missing `rsx!` in WCAG answers
-Root cause under investigation. Targeted for v3.2.
-### Version History
-| Version | Score | Exam | Dataset | Status |
-|---|---|---|---|---|
-| v1.0 | 51/60 (85.0%) | 60Q standard | — | Published |
-| v2.0 | 135.5/140 (96.8%) | 100Q weighted | 4,185 | Published |
-| v3.0 | 124.0/144.5 (85.8%) | 103Q weighted | 4,535 | Published |
-| v3.1 | **137.0/144.5 (94.81%)** | 103Q weighted | **4,880** | **Published** |
-## Model Details
-- **Base model:** Qwen3-Coder-14B (fresh base — never fine-tune a fine-tune)
-- **Method:** RAFT (Retrieval-Augmented Fine-Tuning), Unsloth LoRA
-- **Epochs:** 4
-- **Training hardware:** RTX 3090 Ti (homelab)
-- **Dataset:** 4,880 curated examples across 43 topics
-- **Scope:** Rust + Dioxus 0.7.5 + Tailwind v4 + DaisyUI 5 + WCAG 2.2 AAA
-  + fullstack server functions + router
-- **Quantization:** GGUF Q4_K_M (8.4 GB)
-- **Author:** Kevin Miller, Jr.
-## Install via Ollama
-```
-ollama pull rockypod/neotoi-coder
-# or a specific version:
-ollama pull rockypod/neotoi-coder:v3.1
 ```
-## Read the Full Story
-**[Read the whole story on RockyPod.com →](https://rockypod.com/blog/neotoi-coder-v2-release)**
----
-## Files
 | File | Format | Size | Use case |
 |---|---|---|---|
-| `neotoi-coder-v3.1-q4_k_m.gguf` | GGUF Q4_K_M | 8.4 GB | LM Studio, llama.cpp, Ollama |
-| `mlx-v3.1/` | MLX 4-bit (4.5 bpw) | 7.8 GB | Apple Silicon (mlx-lm) |
 | `neotoi-coder-v3-q4_k_m_patched.gguf` | GGUF Q4_K_M | 9 GB | v3.0 legacy |
-| `mlx-v3/` | MLX 4-bit (4.5 bpw) | 7.8 GB | v3.0 legacy (Apple Silicon) |
-| `neotoi-coder-v2-q4_k_m.gguf` | GGUF Q4_K_M | 8.4 GB | v2.0 legacy |
-| `mlx/` | MLX 4-bit | 7.5 GB | v2.0 legacy |
 ## Enabling Thinking Mode
 ### LM Studio
 | Field | Value |
@@ -134,9 +128,9 @@ ollama pull rockypod/neotoi-coder:v3.1
 | Before Assistant | `<\|im_start\|>assistant\n<think>` |
 | After Assistant | `<\|im_end\|>` |
-### Ollama (GGUF)
-```
 FROM neotoi-coder-v3.1-q4_k_m.gguf
 PARAMETER temperature 0.2
 PARAMETER num_ctx 16384
@@ -152,8 +146,9 @@ SYSTEM You are Neotoi, an expert Rust and Dioxus 0.7 developer.
 ```
 Or simply pull the published model:
 ```
-ollama pull rockypod/neotoi-coder
 ```
 ### llama.cpp
@@ -168,63 +163,43 @@ ollama pull rockypod/neotoi-coder
 ## What It Knows
-Everything v3.0 knew, plus:
-- **DaisyUI 5** components on Tailwind v4 — `btn`, `card`, `drawer`,
-  `modal`, `navbar`, `dropdown`, with `data-theme` discipline
-- **Router patterns** — `#[derive(Routable)]`, nested layouts, query
-  params, route guards, static navigation composition
-- **Signals deep-dive** — `.peek()` vs `.read()`, `ReadOnlySignal`,
-  `Signal<T>` composition, `GlobalSignal::global()` init patterns
-- **Async composition** — `use_resource` cancellation, streaming
-  results, `ServerFnError` error-variant flows
-Carried forward from v3.0: Native scoped CSS (`css!()`), CSS modules
-(`.module.css`), `onauxclick` / `onscrollend` event handlers, real
-WebSocket Stream+Sink (`stream.next()`, `sink.send()`), GlobalSignal
-cache rebuilds, T11 server functions (`#[server]` extractors, fullstack
-WebSocket one-liner, `ServerFnError` + HTTP status codes),
-`use_context_provider` / `use_context` placement discipline.
-Carried forward from v2.0: Dioxus 0.7 RSX brace syntax (never function-
-call), `use_signal`, `use_resource` three-arm match, `r#for` on labels
-only, `GlobalSignal` `.write()` semantics, WCAG 2.2 AAA (tooltip always
-in DOM, listbox/option nesting, `aria_labelledby` on role containers),
-dioxus-primitives discipline, `styles!()` macro, Tailwind v4 utilities
-and semantic tokens, EN/VI i18n via pre-rsx! let bindings, dark mode
-via `document::eval`, static content navigation with `use_memo`,
-`use_context` panic behavior, `WritableResultExt`.
 ## Known Limitations
-- **rsx! macro drops** on 6 RSX-heavy questions (Q17/22/30/37/39/43);
-  v3.2 target
-- **Non-Dioxus web frameworks** — out of scope by design
-  (SvelteKit coverage lives in `rockypod/svcoder`)
-- **Playwright / E2E testing** — out of scope
 ## Transparency
-Full dataset, exam questions, and per-question model outputs are
-published alongside the weights:
 - **Weights:** [HuggingFace — rockypod/neotoi-coder](https://huggingface.co/rockypod/neotoi-coder)
-- **Dataset + exam + per-question results:** [GitHub — rockypod/neotoi-coder](https://github.com/rockypod/neotoi-coder)
-- **Ollama:** `ollama pull rockypod/neotoi-coder`
 ## License
-Neotoi Coder Community License v1.0 — see LICENSE file.
 Commercial use of model outputs permitted.
 Weight redistribution prohibited.
 Mental health deployment requires written permission.
 ## Credits
-Built with:
-- [Unsloth](https://github.com/unslothai/unsloth) — 2x faster fine-tuning
 - [TRL](https://github.com/huggingface/trl) — SFTTrainer
-- [Qwen3-Coder-14B](https://huggingface.co/Qwen/Qwen3-Coder-14B) — base model
-- [MLX](https://github.com/ml-explore/mlx) — Apple Silicon inference
-- [Claude Code](https://claude.ai/code) — dataset pipeline and training infrastructure
 - [Dioxus](https://dioxuslabs.com) — the framework this model specializes in

 language:
 - en
 - vi
+base_model:
+- Qwen/Qwen3-Coder-14B
+- Qwen/Qwen3-8B
+- Qwen/Qwen3-4B
 tags:
 - dioxus
 - rust
 - raft
 - code
 - server-functions
+- gguf
+- qwen3
 pipeline_tag: text-generation
 ---
+# Neotoi Coder
+A Rust / Dioxus 0.7 specialist LLM. v3.1 ships in **three sizes** —
+8B, 4B, and 14B — all fine-tuned via RAFT (Retrieval-Augmented
+Fine-Tuning) on Qwen3 base models. Optimized for production-quality
+Dioxus 0.7 components with Tailwind v4 and WCAG 2.2 AAA accessibility.
+## Variants
+| Variant | Base | Params | Q4_K_M | Spec exam (104Q weighted, max 144.5) | Files |
 |---|---|---|---|---|---|
+| **8B** (flagship) | Qwen3-8B | 8.2B (6.95B non-embed) | 4.68 GB | **144.5 / 144.5 — 100.00%** | [`v3.1.0-8b` branch](https://huggingface.co/rockypod/neotoi-coder/tree/v3.1.0-8b) |
+| 4B | Qwen3-4B | 4.0B (3.6B non-embed, tied) | 2.33 GB | 143.5 / 144.5 — 99.31% | [`v3.1.0-4b` branch](https://huggingface.co/rockypod/neotoi-coder/tree/v3.1.0-4b) |
+| 14B (legacy) | Qwen3-Coder-14B | 14.8B (13.2B non-embed) | 8.40 GB | 137.0 / 144.5 — 94.81% | this branch (`main`) |
+All three clear the 90% publication bar **and** the 95% release bar with all per-tier floors PASS. The 8B is the recommended default; pick the 4B if disk / RAM is tight, pick the 14B for the broadest coverage.
+> **The 8B and 4B GGUFs live on separate branches** — switch the branch
+> dropdown at the top of this page (currently showing `main`) to
+> `v3.1.0-8b` or `v3.1.0-4b` to see and download them.
+## Install via Ollama
+```bash
+# 8B — recommended default
+ollama pull rockypod/neotoi-coder:8b
+# 4B — disk / RAM constrained, ~40% faster generation
+ollama pull rockypod/neotoi-coder:4b
+# 14B — legacy, broadest coverage
+ollama pull rockypod/neotoi-coder:15b
 ```
+## Spec-exam scorecard — all three variants
+Re-graded 2026-04-26 with the patched `run_grade_v31.py` (Q87 now also accepts `LANG()` / `THEME()` GlobalSignal accessor calls in addition to the literal `Signal` token — a false-negative fix that unlocked the 8B's perfect score).
+| Tier | Max wt | 8B | 4B | 14B |
+|---|---|---|---|---|
+| T1 Fundamentals | 12.0 | 12.0 ✅ | 11.0 ⚠️ 91.7% | 12.0 ✅ |
+| T2 RSX Syntax | 12.0 | 12.0 ✅ | 12.0 ✅ | 10.0 ⚠️ 83.3% |
+| T3 Signal Hygiene | 12.0 | 12.0 ✅ | 12.0 ✅ | 11.0 ✅ 91.7% |
+| T4 WCAG / ARIA | 21.0 | 21.0 ✅ | 21.0 ✅ | 16.5 ⚠️ 78.6% |
+| T5 use_resource | 12.0 | 12.0 ✅ | 12.0 ✅ | 12.0 ✅ |
+| T6 Hard Reasoning | 20.0 | 20.0 ✅ | 20.0 ✅ | 20.0 ✅ |
+| T7 Primitives + CSS | 18.0 | 18.0 ✅ | 18.0 ✅ | 18.0 ✅ |
+| T8 GlobalSignal / i18n | 12.0 | 12.0 ✅ | 12.0 ✅ | 12.0 ✅ |
+| T9 Static Navigator | 9.0 | 9.0 ✅ | 9.0 ✅ | 9.0 ✅ |
+| T10 Dioxus 0.7.4 | 12.0 | 12.0 ✅ | 12.0 ✅ | 12.0 ✅ |
+| T11 Server Functions | 4.5 | 4.5 ✅ | 4.5 ✅ | 4.5 ✅ |
+| **Total weighted** | **144.5** | **144.5** | **143.5** | **137.0** |
+| **Total raw (of 103)** | — | **103** | **102** | **97** |
+| **Percent** | — | **100.00%** | **99.31%** | **94.81%** |
+Tier floors (82% on weight-1.0 / 1.5 tiers, 88% on weight-2.0 tiers): all PASS for all three variants.
+The 4B's only miss is Q8 (T1 RSX conversion) — generation truncated mid-`<think>` block. The 14B drops on RSX-heavy questions (Q17, Q22, Q30, Q37, Q39, Q43); v3.2 target.
+## What's new in v3.1 (vs v3.0)
+- **Two new sizes**: 8B and 4B alongside the 14B base, both surpassing the 14B's score.
+- **T1 Fundamentals → 100%** on 8B and 14B, 91.7% on 4B (+8.3 pts vs v3.0 14B).
+- **T6 Hard Reasoning → 100%** clean sweep, all three variants (+25 pts vs v3.0 14B).
+- **T8 GlobalSignal / i18n → 100%** all three variants.
+- **T10 Dioxus 0.7.4 → 100%** all three variants.
+- **8 tiers at 100%** on the 14B; **11 tiers at 100%** on the 8B (perfect).
+- **Dataset:** 4,880 curated examples across 43 topics (up from 4,535).
+## Version History
+| Version | Base (params) | Score | Exam | Dataset |
+|---|---|---|---|---|
+| v1.0 | Qwen3-Coder-14B (14.8B) | 51/60 (85.0%) | 60Q standard | — |
+| v2.0 | Qwen3-Coder-14B (14.8B) | 135.5/140 (96.8%) | 100Q weighted | 4,185 |
+| v3.0 | Qwen3-Coder-14B (14.8B) | 124.0/144.5 (85.8%) | 103Q weighted | 4,535 |
+| v3.1 14B | Qwen3-Coder-14B (14.8B) | 137.0/144.5 (94.81%) | 103Q weighted | 4,880 |
+| **v3.1 8B** | **Qwen3-8B (8.2B)** | **144.5/144.5 (100.00%)** | **103Q weighted** | **4,880** |
+| v3.1 4B | Qwen3-4B (4.0B, tied) | 143.5/144.5 (99.31%) | 103Q weighted | 4,880 |
+## Files in this branch (`main`, 14B)
 | File | Format | Size | Use case |
 |---|---|---|---|
+| `neotoi-coder-v3.1-q4_k_m.gguf` | GGUF Q4_K_M | 8.4 GB | LM Studio, llama.cpp, Ollama (current) |
 | `neotoi-coder-v3-q4_k_m_patched.gguf` | GGUF Q4_K_M | 9 GB | v3.0 legacy |
+| `neotoi-coder-v2.0-q4_k_m.gguf` | GGUF Q4_K_M | 9 GB | v2.0 legacy |
+| `neotoi-coder-v1-q4_k_m_final.gguf` | GGUF Q4_K_M | 9 GB | v1.0 legacy |
+For the 8B and 4B Q4_K_M GGUFs (with and without the `qwen3.thinking=true` patch), switch to the `v3.1.0-8b` or `v3.1.0-4b` branch via the dropdown above.
 ## Enabling Thinking Mode
+This model emits Qwen3 native `<think>...</think>` blocks. Thinking is on by default with the `_patched.gguf` quants on inference backends that honor `qwen3.thinking`.
 ### LM Studio
 | Field | Value |
 | Before Assistant | `<\|im_start\|>assistant\n<think>` |
 | After Assistant | `<\|im_end\|>` |
+### Ollama (custom Modelfile)
+```Modelfile
 FROM neotoi-coder-v3.1-q4_k_m.gguf
 PARAMETER temperature 0.2
 PARAMETER num_ctx 16384
 ```
 Or simply pull the published model:
 ```
+ollama pull rockypod/neotoi-coder:15b
 ```
 ### llama.cpp
 ## What It Knows
+- Dioxus 0.7 RSX brace syntax — never function-call style
+- `use_signal`, `use_resource` with the canonical three-arm match
+- `r#for` on labels only, never inputs
+- WCAG 2.2 AAA: `aria_labelledby`, `aria_describedby`, live regions, `role="alert"`, `role="dialog"`
+- dioxus-primitives — no manual ARIA on managed components
+- `styles!()` macro and native CSS modules
+- Tailwind v4 utility classes and semantic tokens
+- DaisyUI 5 components on Tailwind v4
+- `GlobalSignal` patterns (LANG / THEME), EN/VI i18n, dark-mode toggling via `document::eval`
+- Router patterns (`#[derive(Routable)]`, nested layouts, query params, route guards)
+- Dioxus 0.7.4 APIs: `WritableResultExt`, WebSocket Stream+Sink, server-fn extractors
 ## Known Limitations
+- **rsx! macro drops on the 14B** for 6 RSX-heavy questions (Q17 / 22 / 30 / 37 / 39 / 43); v3.2 target. The 8B and 4B do not reproduce these misses.
+- **Non-Dioxus web frameworks** — out of scope by design (SvelteKit coverage lives in `rockypod/svcoder`).
+- **Playwright / E2E testing** — out of scope.
 ## Transparency
 - **Weights:** [HuggingFace — rockypod/neotoi-coder](https://huggingface.co/rockypod/neotoi-coder)
+- **Exam runner, grader, per-question results:** [GitHub — rockypod/neotoi-coder](https://github.com/rockypod/neotoi-coder)
+- **Ollama:** `ollama pull rockypod/neotoi-coder:8b` (or `:4b`, or `:15b`)
+The training dataset itself is **not redistributed** — see the GitHub repo for the data-generation pipeline. Tailwind v4 reference material is treated as a competence input, not a shipped artifact.
 ## License
+Neotoi Coder Community License v1.0 — see `LICENSE`.
 Commercial use of model outputs permitted.
 Weight redistribution prohibited.
 Mental health deployment requires written permission.
 ## Credits
+- [Unsloth](https://github.com/unslothai/unsloth) — 2× faster fine-tuning
 - [TRL](https://github.com/huggingface/trl) — SFTTrainer
+- [Qwen3-Coder-14B](https://huggingface.co/Qwen/Qwen3-Coder-14B), [Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B), [Qwen3-4B](https://huggingface.co/Qwen/Qwen3-4B) — base models
 - [Dioxus](https://dioxuslabs.com) — the framework this model specializes in
+- [Claude Code](https://claude.ai/code) — dataset pipeline and training infrastructure