tjarvis91 commited on
Commit
22a8107
·
verified ·
1 Parent(s): ddd9cb1

Initial release: q-coder sovereign specialist

Browse files
Files changed (4) hide show
  1. README.md +146 -0
  2. pytorch_model.pt +3 -0
  3. release.json +21 -0
  4. tokenizer.json +0 -0
README.md ADDED
@@ -0,0 +1,146 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ base_model: tjarvis91/qovaryx-50m-scratch-base
6
+ base_model_relation: finetune
7
+ library_name: pytorch
8
+ pipeline_tag: text-generation
9
+ tags:
10
+ - text-generation
11
+ - qovaryx
12
+ - compact-cognition
13
+ - local-ai
14
+ - code
15
+ - python
16
+ - code-generation
17
+ - sovereign-base
18
+ ---
19
+
20
+ # Q-Coder-50M-Sovereign — Python code one-liners + small function skeletons
21
+
22
+ ## Proprietary Qovaryx technology — built on our own scratch base
23
+
24
+ This is a **53.5M-parameter sovereign specialist** in the Qovaryx Compact
25
+ Specialist Suite. It is full-fine-tuned from
26
+ [`tjarvis91/qovaryx-50m-scratch-base`](https://huggingface.co/tjarvis91/qovaryx-50m-scratch-base) —
27
+ **our own scratch-trained base, not a borrowed foundation model**.
28
+
29
+ - **Base:** Qovaryx 50M scratch base. Pretrained from random initialization on
30
+ 491.5M tokens of curated text. **Not SmolLM2. Not Qwen. Not Llama. Not Mistral. Not Phi.**
31
+ No HuggingFace base. No closed-source weights. Every parameter in this checkpoint
32
+ traces back to a Qovaryx training run on Qovaryx hardware.
33
+ - **Tokenizer:** Qovaryx english_v1 BPE (vocab 32000), built in-house against our
34
+ pretraining corpus. **Not the SmolLM2 tokenizer. Not the Llama tokenizer.**
35
+ - **Architecture:** Qovaryx FinanceDecoder — 12 decoder blocks, GQA, RoPE,
36
+ SwiGLU FFN, RMSNorm, MTP heads, decision head. Designed in the Bleeding Edge
37
+ research line for compact local-sovereign cognition.
38
+ - **Recipe:** Qovaryx crystallization corpus discipline — train the law before
39
+ replaying the noise. See the [public research devlog](https://github.com/thron-j/qovaryx-ai-research)
40
+ for the architectural framing.
41
+ - **Runs on CPU.** No GPU required at inference.
42
+
43
+ The entire stack — base, tokenizer, model class, training recipe, eval gate,
44
+ crystal corpus — is Qovaryx proprietary technology. The decision to publish
45
+ the **weights and the audit** under Apache 2.0 is deliberate; the build pipeline
46
+ and the corpus stay private.
47
+
48
+ ## What this is
49
+
50
+ Given a short natural-language Python task, returns the smallest correct Python expression or function that solves it. Trained on aggregate ops (sum/min/max/len/avg over named lists), string ops (reverse/upper/lower/title/palindrome), list comprehensions (even/odd/positive/squares/doubles), dict .get(default), small function definitions, try/except wrappers, class skeletons, and basic file I/O. Designed for fast structured code emission, not free-form programming.
51
+
52
+ ## What this is NOT
53
+
54
+ - **Not a general-purpose chatbot.** This head does one job. Free-text generation outside
55
+ the trained task surface is not supported and will degrade.
56
+ - **Not reproducible from scratch.** The crystal corpus, the eval gate
57
+ constants, and the training hyperparameters are intentionally not published.
58
+ - **Not a replacement for a verifier.** This is one component in the
59
+ Qovaryx [cluster-shell](https://github.com/thron-j/qovaryx-ai-research)
60
+ architecture. The decision-acceptance discipline lives in the wrapper, not
61
+ in the head.
62
+
63
+ ## Honest performance
64
+
65
+ - **Task:** compact Python code generation
66
+ - **Metric:** `exact_match` (string-equal after strip + lowercase)
67
+ - **Holdout:** n=53 (date-disjoint, never seen in training)
68
+ - **Score:** **100.0%** mean
69
+ - **Bootstrap CI 95% lower bound:** 1.000
70
+ - **Gate threshold:** 0.90
71
+ - **Verdict:** PASS at both point estimate and CI lower bound
72
+
73
+ ## Example
74
+
75
+ ```
76
+ USER: Define a function `square` that returns x squared.
77
+
78
+ ASSISTANT: def square(x):
79
+ return x * x
80
+ ```
81
+
82
+ ## Architecture (Qovaryx proprietary)
83
+
84
+ - 53.5M parameters
85
+ - 12 decoder blocks, d_model=512, n_head=8, GQA n_kv_head=2
86
+ - SwiGLU FFN, RoPE positional, RMSNorm
87
+ - Multi-token prediction (MTP) auxiliary heads
88
+ - Decision head for routed-decision tasks
89
+ - Tokenizer: Qovaryx `english_v1` BPE, vocab 32000 (in-house build)
90
+ - Pretrained from `qovaryx-50m-scratch-base` step 60000 — 491.5M tokens, our scratch
91
+ lineage from random initialization
92
+ - Full fine-tune (no LoRA, no QLoRA, no adapter): every parameter was updated
93
+ on the Qovaryx crystal corpus for this specialist
94
+
95
+ ## How to use
96
+
97
+ ```python
98
+ import torch
99
+ from tokenizers import Tokenizer
100
+ from bleeding_edge.model.decoder import FinanceDecoder, DecoderConfig
101
+
102
+ tok = Tokenizer.from_file("tokenizer.json")
103
+ ckpt = torch.load("pytorch_model.pt", map_location="cpu", weights_only=False)
104
+ cfg = DecoderConfig(**{k: v for k, v in ckpt["model_cfg"].items()
105
+ if k in DecoderConfig.__dataclass_fields__})
106
+ cfg.vocab_size = tok.get_vocab_size()
107
+ model = FinanceDecoder(cfg).eval()
108
+ state = {k.removeprefix("_orig_mod."): v for k, v in ckpt["model_state"].items()}
109
+ model.load_state_dict(state, strict=False)
110
+
111
+ prompt = "Define a function `square` that returns x squared."
112
+ ids = tok.encode(prompt).ids
113
+ cur = torch.tensor([ids], dtype=torch.long)
114
+ with torch.no_grad():
115
+ for _ in range(80):
116
+ nxt = int(torch.argmax(model(cur, return_decision=False).logits[:, -1, :], dim=-1))
117
+ if nxt == 0: break
118
+ cur = torch.cat([cur, torch.tensor([[nxt]])], dim=1)
119
+ print(tok.decode(cur[0].tolist()[len(ids):]))
120
+ ```
121
+
122
+ The `bleeding_edge` package is open-source at
123
+ [github.com/thron-j/qovaryx-ai-research](https://github.com/thron-j/qovaryx-ai-research)
124
+ (architecture notes only; full source ships with the Qovaryx runtime).
125
+
126
+ ## License & posture
127
+
128
+ Apache 2.0 for the published weights, model card, and example code.
129
+
130
+ The Qovaryx scratch base, the crystallization corpus, the eval gate constants,
131
+ the cluster routing policy, and the training pipeline are **Qovaryx proprietary
132
+ technology** and are not included in this release. This is the same posture as
133
+ the rest of the Qovaryx public catalog: ship the weights and the audit, not
134
+ the recipe.
135
+
136
+ ## Sibling specialists
137
+
138
+ The other heads in the Qovaryx Compact Specialist Suite share the same base
139
+ and audit discipline. See the
140
+ [Qovaryx research devlog](https://github.com/thron-j/qovaryx-ai-research)
141
+ for the full cluster framing.
142
+
143
+ ## Watermark
144
+
145
+ This release carries a SHA256 issue fingerprint inside `model_cfg._qovaryx_watermark`
146
+ for tamper-detection and attribution. See `release.json` for the canonical record.
pytorch_model.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:68cf9c9a56a2a7cfe850016b7f6cbbcedbfa28404a376b00fc6d27196fd5a975
3
+ size 214021799
release.json ADDED
@@ -0,0 +1,21 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "specialist": "q-coder",
3
+ "hf_repo": "tjarvis91/Q-Coder-50M-Sovereign",
4
+ "release_id": "qovaryx-sovereign-2026-06-02",
5
+ "base_model": "tjarvis91/qovaryx-50m-scratch-base",
6
+ "metric": {
7
+ "name": "exact_match",
8
+ "mean": 1.0,
9
+ "ci_lower": 1.0,
10
+ "n_holdout": 53
11
+ },
12
+ "watermark": {
13
+ "issuer": "Qovaryx AI / Thomas Jarvis",
14
+ "specialist": "q-coder",
15
+ "release_id": "qovaryx-sovereign-2026-06-02",
16
+ "released_at": "2026-06-02T08:35:45Z",
17
+ "fingerprint": "4c167a5bdf82bb30a54056021790f74a852db0fff777f0b95daf19230166967a",
18
+ "base_model": "tjarvis91/qovaryx-50m-scratch-base",
19
+ "policy": "This checkpoint is a sovereign Qovaryx specialist. It is full-fine-tuned from qovaryx-50m-scratch-base. Redistribution allowed under Apache 2.0. Fingerprint is for downstream attribution and tamper-detection."
20
+ }
21
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff