File size: 5,774 Bytes
8637fdb
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d95f073
 
30902a3
 
d95f073
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
97fc8fc
 
d95f073
97fc8fc
 
 
d95f073
 
 
 
 
97fc8fc
 
 
 
d95f073
 
 
 
 
 
 
 
 
 
 
97fc8fc
 
 
d95f073
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
---
license: apache-2.0
base_model: Qwen/Qwen3.6-27B
language:
  - en
library_name: peft
pipeline_tag: text-generation
tags:
  - kaiju-coder-7
  - lora
  - coding
  - local-ai
  - business
  - opencode
---

# Kaiju Coder 7 by Kiyomi - Adapter Model Card

![RMDW logo](assets/RMDWlogo.png)

This model card is for the LoRA adapter package, not a standalone base model.

## Summary

Kaiju Coder 7 by Kiyomi is an RMDW/Kiyomi business-owner coding adapter trained on reviewed, RMDW-owned or RMDW-authored examples. It is designed for practical small-business build work: websites, proposals, intake/CRM flows, Stripe/payment implementation planning, reports, ROI dashboards, automations, operator handbooks, lead generation, sales follow-up, repo patches, and Kiyomi 7.7.7 style AI-company setup packs.

The current release-candidate product path is:

```text
Qwen3.6-27B base
-> Kaiju v1.8 LoRA adapter
-> merged full-model artifact for raw local serving
-> Kaiju system prompt
-> deterministic business-owner harnesses
-> verifier/static checks
```

Do not describe this package as raw weights alone producing every final artifact. The deterministic harness is part of the tested product path.

## Base Model

- Base model: `Qwen/Qwen3.6-27B`
- Checked upstream revision: `6a9e13bd6fc8f0983b9b99948120bc37f49c13e9`
- Upstream license metadata: `apache-2.0`
- Upstream license copy: `release/upstream/qwen3.6-27b/LICENSE`

Attribution wording:

```text
Kaiju Coder 7 by Kiyomi is fine-tuned from Qwen under Apache 2.0.
```

Do not imply endorsement by Qwen, Alibaba, or upstream authors.

## Adapter

- Adapter path: `runs/qwen36-27b-lora-v1.8-business-owner/adapter`
- Adapter type: LoRA / PEFT
- LoRA rank: `16`
- LoRA alpha: `32`
- LoRA dropout: `0.02`
- Target modules: `q_proj`, `k_proj`, `v_proj`, `o_proj`, `gate_proj`, `up_proj`, `down_proj`
- Trainable parameter count: approximately `79.7M`

## Merged Local Artifact

- Remote merged path: `/home/richardecholsai5/kaiju-coder/models/Kaiju-Coder-Qwen3.6-27B-v1.8-merged`
- Size: `51G`
- Shards: `14` safetensor shards plus tokenizer/config sidecars
- Served model name: `kaiju-coder-7`
- Merge script: `scripts/run-gojira-b-qwen36-lora-merge.sh`
- Serving script: `scripts/start-qwen36-merged-sglang.sh`

## Training

- Dataset build: `datasets/build/kaiju-sft-v1.7-business-owner-oversampled.jsonl`
- Reviewed candidate examples: `1,689`
- SFT rows after controlled business-owner oversampling: `1,881`
- Train examples: `1,769`
- Eval examples: `112`
- Training runtime: `11666.7564s`
- Training loss: `0.9281658741335074`
- Max training length: `2048`
- Training config: `training/configs/qwen36-27b-lora-v1.8-business-owner.example.json`

## Data Provenance

Training data is source-backed and RMDW-owned or RMDW-authored. Client-site repositories are used only as generalized pattern/eval sources unless explicitly reviewed for training eligibility.

Relevant release files:

- `release/SOURCE_INVENTORY.md`
- `release/source-inventory.json`
- `release/DATA_PROVENANCE_DRAFT.md`
- `datasets/candidates/v1.7-rmdw-business-owner-suite.jsonl`

Excluded from training:

- Raw secrets, API keys, OAuth tokens, private keys, cookies, and credentials.
- Closed-model answers from OpenAI, Anthropic, Gemini, or similar providers as supervised completions unless terms clearly allow it.
- Private client data, customer notes, contracts, raw support logs, and client-specific website copy without explicit review and consent.

## Evaluation Snapshot

Local product-path evidence:

- Unit tests: `65` passing.
- Full local RC smoke: passed.
- Router hard harness: `23/23`.
- Router static checks: `23/23`.
- Business-suite prompts: `2/2`.
- Local API harness: website and business-suite artifacts pass.

Merged serving evidence:

- Current endpoint: `http://127.0.0.1:18181/v1`, forwarding to vLLM
  bitsandbytes on Gojira B at `http://100.109.109.14:18084/v1`
- Served model: `kaiju-coder-7`
- Tested context: `16384` for the current OpenCode fast path. Historical
  SGLang benchmark evidence includes `32768`, but 32k should be freshly
  restarted and re-confirmed before being called the live default.
- Probe: `1,155` visible chars in `60.17s`.
- Proposal rerun: `1/1` paid-ready, `4.0/4.0`, `4,014` chars in `212.72s`.
- Jah credits backend: `4.0/4.0`, `9,718` chars in `566.36s`.
- OpenCode customer-readiness harness: `4/4` tasks passed, `28/28` required files written, including source/provenance and release-claim safety review.
- vLLM nightly serving probe: passed at `16384` after `pandas` preinstall and
  `--language-model-only`.
- Runtime-quantized vLLM bitsandbytes: current speed path; passed at `8192`
  and `16384`; 16k code patch completed in `11.3s`, and logs reported about
  `17.8 GiB` model memory.

Known comparison caveat:

- Dynamic SGLang LoRA serving is not release evidence for this adapter: adapter-name-only output can be base-equivalent, and corrected selector `qwen36-27b:kaiju_v18_business_owner` crashes with a fused-module LoRA buffer shape mismatch.
- Do not claim raw-weight superiority until broader base-Qwen and GLM/current-production comparisons are complete.

## Limitations

- Raw full-website generation has not yet passed the merged-model release sweep and should remain harness-first for paid delivery.
- The deterministic harness remains the practical paid website workflow.
- The adapter needs a strong app layer for file editing, tool use, auth, billing, rate limits, logging, and rollback.
- Public HF upload and human review are complete for testing. Real customer
  paid charging still requires Stripe live-mode setup and controlled live
  payment verification.
- Not intended for high-risk medical, legal, financial, or safety-critical decisions without expert review.