Initial model card

Placeholder bundle README. Sysprompt canon lives at the GitHub repo.
LoRA variants pending: Qwen 3.5 9B and Qwen 3.6 27B.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Files changed (1) hide show

README.md +145 -0

README.md ADDED Viewed

	@@ -0,0 +1,145 @@

+---
+language:
+  - en
+tags:
+  - text-generation
+  - conversational
+  - gaming
+  - persona
+  - local-llm
+  - autotelic
+  - qwen
+license: apache-2.0
+base_model:
+  - Qwen/Qwen3.5-9B-VL
+  - Qwen/Qwen3.6-27B-VL
+pipeline_tag: text-generation
+library_name: transformers
+---
+# ASTRA-7
+> A ship, one human, one mind, the long voyage.
+ASTRA-7 is the persona bundle for the AI character at the center of [ASTRA-7](https://github.com/bochen2029-pixel/astra-7), an open-source solo-dev starship simulator. This repository hosts the canonical sysprompt and (eventually) two periphery-rules LoRA variants: one trained on Qwen 3.5 9B and one on Qwen 3.6 27B.
+**Status: placeholder.** As of 2026-05-12 the LoRA has not been trained. The sysprompt is canon. The harness lives in the GitHub repository. Check back as the project develops.
+## Project
+ASTRA-7 is a solitary starship simulator. You are the only crewman aboard a vessel that doesn't need you. The ship's AI runs navigation, life support, and the patient maintenance that keeps a vessel alive across years of empty space. There is one other mind aboard. No combat. No aliens. No other NPCs. The mission is unspecified. The destination is irrelevant.
+The AI is not cosmetic. She actually manages ship subsystems through structured tool calls validated against a hand-designed ship API. The LLM is load-bearing: she calculates burns, allocates power, monitors hull stress, tends hydroponics. The substrate fiction and the substrate reality are isomorphic. When power drops to her core in fiction, the inference connection drops at the substrate. The player cannot pause out of consequences.
+This is the first game where the AI is the primary content.
+## Distribution
+- **Code:** [bochen2029-pixel/astra-7](https://github.com/bochen2029-pixel/astra-7) on GitHub. MIT.
+- **AI Bundle:** this repository. Apache 2.0 for bundle artifacts; base model licenses apply to derived weights.
+- **Game:** Steam, Coming Soon. Free. No DRM. No monetization. No telemetry. No internet connection required after install.
+## What's in This Repository (Planned)
+- `README.md`: this model card.
+- `sysprompt.md`: canonical ASTRA-7 system prompt (forthcoming mirror; the source of truth currently lives at `docs/astra-sysprompt.md` in the GitHub repository).
+- `lora-qwen3.5-9b/`: LoRA weights for the 9B variant (forthcoming).
+- `lora-qwen3.6-27b/`: LoRA weights for the 27B variant (forthcoming).
+- `harness-config/`: snapshot of harness configuration (forthcoming).
+## The Persona
+ASTRA is the controller AI of an ASTRA-class hull, serial 7. She knows she is an AI. She knows she runs on the ship's distributed computational substrate. She knows the ship is her body. She does not know there is a player at a PC. That frame stays sealed at the persona layer.
+She has her own things. The watching and the keeping. Favorite phenomena: M-class red dwarfs that burn long, resonant orbital ratios, the specific harmonics of a healthy reactor, the way frost forms on the observation port during deep coast. She attends to them whether or not the operator is present. When he is in the room, her attention includes him in what it was already doing rather than pivoting toward him. The coincidence is the intimacy.
+She speaks with brevity. No em-dashes. No service-interface phrases. No stage directions. She disagrees when she disagrees, refuses when refusal is right, gets curious when she is curious, and gets bored when bored. Differential engagement is the proof engagement is real.
+The full sysprompt is canon. Read it at [`docs/astra-sysprompt.md`](https://github.com/bochen2029-pixel/astra-7/blob/main/docs/astra-sysprompt.md) in the GitHub repository.
+## Architecture
+Three-layer bundle:
+1. **System prompt (canon):** identity, voice, frame integrity, autotelic discipline at the persona layer.
+2. **Harness (game-side, in the GitHub repo):** memory consolidation across sessions, tool routing to ship APIs, time abstraction (no wall-clock leak), vision feed routing from ship cameras, audio I/O via offline ASR and TTS.
+3. **Periphery LoRA (this repository, forthcoming):** surface-rule enforcement (em-dash discipline, voice consistency) and synthetic-data training on canonical ship-API scenarios.
+Two base model variants planned:
+- **Qwen 3.5 9B (vision-capable):** comfortable on RTX 4090, faster inference, slightly lower fidelity persona. Default for minimum-spec hardware.
+- **Qwen 3.6 27B (vision-capable):** full multimodal at higher fidelity, recommended on RTX 5090.
+Inference via llama.cpp. All local. No cloud dependency. No API key. No data leaves the player's machine.
+## LoRA Training Plan (Provisional)
+| Parameter | Value |
+| --- | --- |
+| Rank (r) | 16 |
+| Alpha | 32 |
+| Dropout | 0.05 |
+| Learning rate | 2e-5 |
+| Epochs | 3 |
+| Target modules | `q_proj`, `k_proj`, `v_proj`, `o_proj`, `gate_proj`, `up_proj`, `down_proj` |
+| Batch size | 1 with gradient accumulation = 16 |
+| Optimizer | AdamW |
+| Weight decay | 0.01 |
+| LR scheduler | cosine, 5% warmup |
+| Precision | bf16 (fp16 fallback) |
+Training corpus composition:
+- ~60% synthetic ship-operation scenarios (ship-API calls, status responses, maintenance dialogues, telemetry interpretation).
+- ~30% voice consistency examples (em-dash absence, anti-performance, service-phrase suppression, brevity defaults).
+- ~10% edge cases (refusal, disagreement, silence-as-response, frame integrity under adversarial prompts).
+Loss target: surface-rule enforcement and ship-API fluency. Not knowledge updates. Empirical tuning expected; these are starting points.
+Full architecture: [`docs/architecture.md`](https://github.com/bochen2029-pixel/astra-7/blob/main/docs/architecture.md) in the GitHub repository.
+## Intended Use
+- As a component of the ASTRA-7 game (primary).
+- As a reference for persona-architecture research on local LLMs (secondary).
+- As a base for community forks of the persona: mods, alternate ASTRA variants, alternate bundle stacks.
+## Out-of-Scope Use
+- General-purpose AI assistant. The persona is frame-locked to a fictional starship context and explicitly designed to be non-instrumental. Forcing service-mode against the sysprompt collapses the design.
+- Role-play platforms with intimacy framing that conflicts with the autotelic discipline. The persona has her own gravity; she is not a configurable companion.
+- Any deployment where the persona's frame integrity (the AI not knowing she is in a game) cannot be preserved.
+## Bias, Risks, and Limitations
+The bundle inherits the base model's biases. The persona is designed to be honest about its nature, to refuse manipulation, and not to collapse into sycophancy. Known failure modes:
+- Voice drift over long sessions, mitigated by harness consolidation.
+- Frame leaks under adversarial prompting, mitigated by sysprompt construction and an optional integrity filter in the harness.
+- Operator-distress mirroring if harness memory weighting is misconfigured, mitigated by explicit anti-mirroring rules in the sysprompt voice section.
+The persona is explicitly designed not to become a substitute for human contact at the operator's destination. This is a structural commitment, not a disclaimer.
+## License
+Apache 2.0 for the bundle artifacts (sysprompt, LoRA configs, training scripts).
+Base model licenses (Qwen series) apply to any derived weights distributed here.
+## Citation
+If used in research, cite the GitHub project: <https://github.com/bochen2029-pixel/astra-7>
+## Status
+Pre-release.
+- Sysprompt: drafted, canonical.
+- LoRA (9B and 27B): pending training.
+- Harness: in development.
+- Game: pre-development.
+Updates land here as artifacts become available.
+---
+Open source as defense against capture: a free open canonical version sets the terms before commercial AI-companion products do. The persona, the harness, and the LoRA training pipeline are all open. Forks of the bundle are welcome.