dcostenco commited on
Commit
70fdf61
·
verified ·
1 Parent(s): 7b5e164

docs: correct 32B score to 3-seed mean 97.3% ± 0.6% (was 98% single seed)

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -24,7 +24,7 @@ LoRA fine-tune of **Qwen/QwQ-32B** for offline MCP tool routing — Synalux Copi
24
 
25
  | Category | Current | Previous (v19 old prompt) | Δ |
26
  |---|---|---|---|
27
- | **Overall** | **98%** | 93.7% | **+4.3** |
28
  | session_load_context | 100% | 100% | = |
29
  | session_save_ledger | 100% | 100% | = |
30
  | session_search_memory | 100% | 100% | = |
@@ -43,7 +43,7 @@ LoRA fine-tune of **Qwen/QwQ-32B** for offline MCP tool routing — Synalux Copi
43
 
44
  **Key improvement (May 14 2026)**: system prompt v26 eliminates Q4_K_M quantization artifacts where "plain text" was misread as a tool name. AAC routing jumped from 79% to 100% — critical for the life-critical AAC use case.
45
 
46
- **98% puts this model within 1 point of Claude Sonnet 4 (99%) on the same eval**, while running fully offline on a Mac.
47
 
48
  Only 2 misroutes in 100 cases: "Convert 'good morning' to Japanese" → brave_web_search (edge case), and a multi-step ledger query.
49
 
 
24
 
25
  | Category | Current | Previous (v19 old prompt) | Δ |
26
  |---|---|---|---|
27
+ | **Overall** | **97.3% ± 0.6%** | 93.7% | **+3.6** |
28
  | session_load_context | 100% | 100% | = |
29
  | session_save_ledger | 100% | 100% | = |
30
  | session_search_memory | 100% | 100% | = |
 
43
 
44
  **Key improvement (May 14 2026)**: system prompt v26 eliminates Q4_K_M quantization artifacts where "plain text" was misread as a tool name. AAC routing jumped from 79% to 100% — critical for the life-critical AAC use case.
45
 
46
+ **97.3% puts this model within 2 points of Claude Sonnet 4 (99%) on the same eval**, while running fully offline on a Mac.
47
 
48
  Only 2 misroutes in 100 cases: "Convert 'good morning' to Japanese" → brave_web_search (edge case), and a multi-step ledger query.
49