restokes92's picture
Upload Kaiju Coder 7 adapter release package
8637fdb verified
---
license: apache-2.0
base_model: Qwen/Qwen3.6-27B
language:
- en
library_name: peft
pipeline_tag: text-generation
tags:
- kaiju-coder-7
- lora
- coding
- local-ai
- business
- opencode
---
# Kaiju Coder 7 by Kiyomi - Adapter Model Card
![RMDW logo](assets/RMDWlogo.png)
This model card is for the LoRA adapter package, not a standalone base model.
## Summary
Kaiju Coder 7 by Kiyomi is an RMDW/Kiyomi business-owner coding adapter trained on reviewed, RMDW-owned or RMDW-authored examples. It is designed for practical small-business build work: websites, proposals, intake/CRM flows, Stripe/payment implementation planning, reports, ROI dashboards, automations, operator handbooks, lead generation, sales follow-up, repo patches, and Kiyomi 7.7.7 style AI-company setup packs.
The current release-candidate product path is:
```text
Qwen3.6-27B base
-> Kaiju v1.8 LoRA adapter
-> merged full-model artifact for raw local serving
-> Kaiju system prompt
-> deterministic business-owner harnesses
-> verifier/static checks
```
Do not describe this package as raw weights alone producing every final artifact. The deterministic harness is part of the tested product path.
## Base Model
- Base model: `Qwen/Qwen3.6-27B`
- Checked upstream revision: `6a9e13bd6fc8f0983b9b99948120bc37f49c13e9`
- Upstream license metadata: `apache-2.0`
- Upstream license copy: `release/upstream/qwen3.6-27b/LICENSE`
Attribution wording:
```text
Kaiju Coder 7 by Kiyomi is fine-tuned from Qwen under Apache 2.0.
```
Do not imply endorsement by Qwen, Alibaba, or upstream authors.
## Adapter
- Adapter path: `runs/qwen36-27b-lora-v1.8-business-owner/adapter`
- Adapter type: LoRA / PEFT
- LoRA rank: `16`
- LoRA alpha: `32`
- LoRA dropout: `0.02`
- Target modules: `q_proj`, `k_proj`, `v_proj`, `o_proj`, `gate_proj`, `up_proj`, `down_proj`
- Trainable parameter count: approximately `79.7M`
## Merged Local Artifact
- Remote merged path: `/home/richardecholsai5/kaiju-coder/models/Kaiju-Coder-Qwen3.6-27B-v1.8-merged`
- Size: `51G`
- Shards: `14` safetensor shards plus tokenizer/config sidecars
- Served model name: `kaiju-coder-7`
- Merge script: `scripts/run-gojira-b-qwen36-lora-merge.sh`
- Serving script: `scripts/start-qwen36-merged-sglang.sh`
## Training
- Dataset build: `datasets/build/kaiju-sft-v1.7-business-owner-oversampled.jsonl`
- Reviewed candidate examples: `1,689`
- SFT rows after controlled business-owner oversampling: `1,881`
- Train examples: `1,769`
- Eval examples: `112`
- Training runtime: `11666.7564s`
- Training loss: `0.9281658741335074`
- Max training length: `2048`
- Training config: `training/configs/qwen36-27b-lora-v1.8-business-owner.example.json`
## Data Provenance
Training data is source-backed and RMDW-owned or RMDW-authored. Client-site repositories are used only as generalized pattern/eval sources unless explicitly reviewed for training eligibility.
Relevant release files:
- `release/SOURCE_INVENTORY.md`
- `release/source-inventory.json`
- `release/DATA_PROVENANCE_DRAFT.md`
- `datasets/candidates/v1.7-rmdw-business-owner-suite.jsonl`
Excluded from training:
- Raw secrets, API keys, OAuth tokens, private keys, cookies, and credentials.
- Closed-model answers from OpenAI, Anthropic, Gemini, or similar providers as supervised completions unless terms clearly allow it.
- Private client data, customer notes, contracts, raw support logs, and client-specific website copy without explicit review and consent.
## Evaluation Snapshot
Local product-path evidence:
- Unit tests: `65` passing.
- Full local RC smoke: passed.
- Router hard harness: `23/23`.
- Router static checks: `23/23`.
- Business-suite prompts: `2/2`.
- Local API harness: website and business-suite artifacts pass.
Merged serving evidence:
- Current endpoint: `http://127.0.0.1:18181/v1`, forwarding to vLLM
bitsandbytes on Gojira B at `http://100.109.109.14:18084/v1`
- Served model: `kaiju-coder-7`
- Tested context: `16384` for the current OpenCode fast path. Historical
SGLang benchmark evidence includes `32768`, but 32k should be freshly
restarted and re-confirmed before being called the live default.
- Probe: `1,155` visible chars in `60.17s`.
- Proposal rerun: `1/1` paid-ready, `4.0/4.0`, `4,014` chars in `212.72s`.
- Jah credits backend: `4.0/4.0`, `9,718` chars in `566.36s`.
- OpenCode customer-readiness harness: `4/4` tasks passed, `28/28` required files written, including source/provenance and release-claim safety review.
- vLLM nightly serving probe: passed at `16384` after `pandas` preinstall and
`--language-model-only`.
- Runtime-quantized vLLM bitsandbytes: current speed path; passed at `8192`
and `16384`; 16k code patch completed in `11.3s`, and logs reported about
`17.8 GiB` model memory.
Known comparison caveat:
- Dynamic SGLang LoRA serving is not release evidence for this adapter: adapter-name-only output can be base-equivalent, and corrected selector `qwen36-27b:kaiju_v18_business_owner` crashes with a fused-module LoRA buffer shape mismatch.
- Do not claim raw-weight superiority until broader base-Qwen and GLM/current-production comparisons are complete.
## Limitations
- Raw full-website generation has not yet passed the merged-model release sweep and should remain harness-first for paid delivery.
- The deterministic harness remains the practical paid website workflow.
- The adapter needs a strong app layer for file editing, tool use, auth, billing, rate limits, logging, and rollback.
- Public HF upload and human review are complete for testing. Real customer
paid charging still requires Stripe live-mode setup and controlled live
payment verification.
- Not intended for high-risk medical, legal, financial, or safety-critical decisions without expert review.