crystalballAI commited on
Commit
5e686cb
·
verified ·
1 Parent(s): 01a97a4

Add README

Browse files
Files changed (1) hide show
  1. README.md +31 -0
README.md ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - krystalmind
5
+ - qwen3
6
+ - routing
7
+ - pharma-crm
8
+ - json-planner
9
+ base_model: Qwen/Qwen3-0.6B-Instruct
10
+ ---
11
+
12
+ # KrystalMind Cortex (0.6B JSON Planner)
13
+
14
+ Structured JSON action planner for the L4 intelligence pipeline. Routes divergence signals to appropriate data sources.
15
+
16
+ ## Architecture
17
+ - **Base**: Qwen3-0.6B-Instruct (28 layers, d_model=1024, 16Q/8KV heads)
18
+ - **LoRA SFT**: rank=16, scale=1.0, 1000 iters on routing_sft_v1 (14,836 examples)
19
+ - **Output**: Structured JSON plan (db_queries, ecs_metrics, ssm_context, simulation)
20
+ - **Decoding**: Greedy (T=0.0) with JSON-aware brace-depth stop
21
+
22
+ ## Files
23
+ - `krystalball_cortex.kb` — Baked production model (571 MB, FP32)
24
+ - `fused/model.safetensors` — Full fused Qwen3-0.6B + LoRA (1.1 GB)
25
+ - `adapters/adapters.safetensors` — LoRA adapters only (10 MB)
26
+ - `tokenizer.json` — Qwen3 tokenizer
27
+
28
+ ## Performance
29
+ - **INT8**: ~86 tok/s, **BF16**: ~53 tok/s
30
+ - **Typical plan**: 150 tokens greedy @ ~1.75s
31
+ - **3-tier parse robustness**: Direct JSON → repair → deterministic fallback