Upload Kaiju Coder 7 adapter release package

8637fdb verified 5 days ago

5.77 kB

	---
	license: apache-2.0
	base_model: Qwen/Qwen3.6-27B
	language:
	- en
	library_name: peft
	pipeline_tag: text-generation
	tags:
	- kaiju-coder-7
	- lora
	- coding
	- local-ai
	- business
	- opencode
	---

	# Kaiju Coder 7 by Kiyomi - Adapter Model Card

	![RMDW logo](assets/RMDWlogo.png)

	This model card is for the LoRA adapter package, not a standalone base model.

	## Summary

	Kaiju Coder 7 by Kiyomi is an RMDW/Kiyomi business-owner coding adapter trained on reviewed, RMDW-owned or RMDW-authored examples. It is designed for practical small-business build work: websites, proposals, intake/CRM flows, Stripe/payment implementation planning, reports, ROI dashboards, automations, operator handbooks, lead generation, sales follow-up, repo patches, and Kiyomi 7.7.7 style AI-company setup packs.

	The current release-candidate product path is:

	```text
	Qwen3.6-27B base
	-> Kaiju v1.8 LoRA adapter
	-> merged full-model artifact for raw local serving
	-> Kaiju system prompt
	-> deterministic business-owner harnesses
	-> verifier/static checks
	```

	Do not describe this package as raw weights alone producing every final artifact. The deterministic harness is part of the tested product path.

	## Base Model

	- Base model: `Qwen/Qwen3.6-27B`
	- Checked upstream revision: `6a9e13bd6fc8f0983b9b99948120bc37f49c13e9`
	- Upstream license metadata: `apache-2.0`
	- Upstream license copy: `release/upstream/qwen3.6-27b/LICENSE`

	Attribution wording:

	```text
	Kaiju Coder 7 by Kiyomi is fine-tuned from Qwen under Apache 2.0.
	```

	Do not imply endorsement by Qwen, Alibaba, or upstream authors.

	## Adapter

	- Adapter path: `runs/qwen36-27b-lora-v1.8-business-owner/adapter`
	- Adapter type: LoRA / PEFT
	- LoRA rank: `16`
	- LoRA alpha: `32`
	- LoRA dropout: `0.02`
	- Target modules: `q_proj`, `k_proj`, `v_proj`, `o_proj`, `gate_proj`, `up_proj`, `down_proj`
	- Trainable parameter count: approximately `79.7M`

	## Merged Local Artifact

	- Remote merged path: `/home/richardecholsai5/kaiju-coder/models/Kaiju-Coder-Qwen3.6-27B-v1.8-merged`
	- Size: `51G`
	- Shards: `14` safetensor shards plus tokenizer/config sidecars
	- Served model name: `kaiju-coder-7`
	- Merge script: `scripts/run-gojira-b-qwen36-lora-merge.sh`
	- Serving script: `scripts/start-qwen36-merged-sglang.sh`

	## Training

	- Dataset build: `datasets/build/kaiju-sft-v1.7-business-owner-oversampled.jsonl`
	- Reviewed candidate examples: `1,689`
	- SFT rows after controlled business-owner oversampling: `1,881`
	- Train examples: `1,769`
	- Eval examples: `112`
	- Training runtime: `11666.7564s`
	- Training loss: `0.9281658741335074`
	- Max training length: `2048`
	- Training config: `training/configs/qwen36-27b-lora-v1.8-business-owner.example.json`

	## Data Provenance

	Training data is source-backed and RMDW-owned or RMDW-authored. Client-site repositories are used only as generalized pattern/eval sources unless explicitly reviewed for training eligibility.

	Relevant release files:

	- `release/SOURCE_INVENTORY.md`
	- `release/source-inventory.json`
	- `release/DATA_PROVENANCE_DRAFT.md`
	- `datasets/candidates/v1.7-rmdw-business-owner-suite.jsonl`

	Excluded from training:

	- Raw secrets, API keys, OAuth tokens, private keys, cookies, and credentials.
	- Closed-model answers from OpenAI, Anthropic, Gemini, or similar providers as supervised completions unless terms clearly allow it.
	- Private client data, customer notes, contracts, raw support logs, and client-specific website copy without explicit review and consent.

	## Evaluation Snapshot

	Local product-path evidence:

	- Unit tests: `65` passing.
	- Full local RC smoke: passed.
	- Router hard harness: `23/23`.
	- Router static checks: `23/23`.
	- Business-suite prompts: `2/2`.
	- Local API harness: website and business-suite artifacts pass.

	Merged serving evidence:

	- Current endpoint: `http://127.0.0.1:18181/v1`, forwarding to vLLM
	bitsandbytes on Gojira B at `http://100.109.109.14:18084/v1`
	- Served model: `kaiju-coder-7`
	- Tested context: `16384` for the current OpenCode fast path. Historical
	SGLang benchmark evidence includes `32768`, but 32k should be freshly
	restarted and re-confirmed before being called the live default.
	- Probe: `1,155` visible chars in `60.17s`.
	- Proposal rerun: `1/1` paid-ready, `4.0/4.0`, `4,014` chars in `212.72s`.
	- Jah credits backend: `4.0/4.0`, `9,718` chars in `566.36s`.
	- OpenCode customer-readiness harness: `4/4` tasks passed, `28/28` required files written, including source/provenance and release-claim safety review.
	- vLLM nightly serving probe: passed at `16384` after `pandas` preinstall and
	`--language-model-only`.
	- Runtime-quantized vLLM bitsandbytes: current speed path; passed at `8192`
	and `16384`; 16k code patch completed in `11.3s`, and logs reported about
	`17.8 GiB` model memory.

	Known comparison caveat:

	- Dynamic SGLang LoRA serving is not release evidence for this adapter: adapter-name-only output can be base-equivalent, and corrected selector `qwen36-27b:kaiju_v18_business_owner` crashes with a fused-module LoRA buffer shape mismatch.
	- Do not claim raw-weight superiority until broader base-Qwen and GLM/current-production comparisons are complete.

	## Limitations

	- Raw full-website generation has not yet passed the merged-model release sweep and should remain harness-first for paid delivery.
	- The deterministic harness remains the practical paid website workflow.
	- The adapter needs a strong app layer for file editing, tool use, auth, billing, rate limits, logging, and rollback.
	- Public HF upload and human review are complete for testing. Real customer
	paid charging still requires Stripe live-mode setup and controlled live
	payment verification.
	- Not intended for high-risk medical, legal, financial, or safety-critical decisions without expert review.