dcostenco commited on
Commit
1dde642
·
1 Parent(s): 4ed0ef7

docs: reframe with category-leadership story + Prism v14.0.0 algorithm citation

Browse files

Same honest framing as the 7B card: strong on AAC (95.8%), Irrelevance Detection
(91.86%), Non-Live AST (55.83%), HIPAA-safe on-device. NOT competing on BFCL V4
overall — frontier 70B+ models win that, and 14B class can't match. Pick by category.

Adds explicit link to Prism v14.0.0 algorithm-stability contract.

Files changed (1) hide show
  1. README.md +16 -1
README.md CHANGED
@@ -28,10 +28,25 @@ tags:
28
  - conversational
29
  ---
30
 
31
- # Prism-Coder 14B — Function Calling + AAC Sibling (32K context)
 
 
 
 
 
 
 
 
 
 
 
 
 
32
 
33
  A fine-tune of **Qwen2.5-Coder-14B-Instruct** released **2026-05-04** as a sibling to [`prism-coder-7b`](https://huggingface.co/dcostenco/prism-coder-7b). Auto-routed for paid-tier medium-length AAC queries via the Synalux portal — keeps inference local on cloud GPU pool, $0 marginal cost vs Claude/Gemini.
34
 
 
 
35
  ## Sibling positioning
36
 
37
  | Model | Use case | Context | RAM (Q4) |
 
28
  - conversational
29
  ---
30
 
31
+ # Prism-Coder 14B — On-Device AAC + Tool-Calling Sibling (32K context)
32
+
33
+ **Category leadership story, honestly:**
34
+
35
+ | You should pick this 14B if you need… | Score |
36
+ |---|---|
37
+ | **AAC realignment** for an Augmentative & Alternative Communication app | **46/48 (95.8%)** |
38
+ | **Non-Live AST tool-call** (parsed function signatures, multi-arg) | **55.83%** |
39
+ | **Live tool-call accuracy** in real-world agent prompts | **45.23%** |
40
+ | **Multi-tool-call** parsing in a single user turn | **44.35%** Live Multiple AST |
41
+ | **Irrelevance detection** — knowing when NOT to call a tool | **91.86%** ← strong |
42
+ | HIPAA-safe **on-device** inference at 14B-class capacity (Mac / RTX 30+) | runs on 16-24 GB RAM (Q4) |
43
+
44
+ **Not a leadership story on BFCL V4 overall** — frontier 70B+ models (GPT-4o, Claude Opus, Llama 3.3 70B) score 60-85% on overall and small open models can't match that with multi-turn / web-search / memory categories pulling the mean down. We score 19.29% overall, in line with the rest of the 14B class. **Pick by category fit.**
45
 
46
  A fine-tune of **Qwen2.5-Coder-14B-Instruct** released **2026-05-04** as a sibling to [`prism-coder-7b`](https://huggingface.co/dcostenco/prism-coder-7b). Auto-routed for paid-tier medium-length AAC queries via the Synalux portal — keeps inference local on cloud GPU pool, $0 marginal cost vs Claude/Gemini.
47
 
48
+ **Powered by [Prism v14.0.0 algorithms](https://github.com/dcostenco/prism-coder/blob/main/docs/WOW_FEATURES.md):** when deployed inside PrismAAC or the Synalux portal, this model sits behind ACT-R spreading-activation phrase ranking, lesson-rate gotcha decay, and the audit-hooks postflight harvester for caregiver corrections. Model + algorithm stack together is the product.
49
+
50
  ## Sibling positioning
51
 
52
  | Model | Use case | Context | RAM (Q4) |