Instructions to use MediaStreamAI/MOTHER_CORE_V2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use MediaStreamAI/MOTHER_CORE_V2 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="MediaStreamAI/MOTHER_CORE_V2", trust_remote_code=True)

# Load model directly
from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("MediaStreamAI/MOTHER_CORE_V2", trust_remote_code=True, dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use MediaStreamAI/MOTHER_CORE_V2 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "MediaStreamAI/MOTHER_CORE_V2"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MediaStreamAI/MOTHER_CORE_V2",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/MediaStreamAI/MOTHER_CORE_V2

SGLang

How to use MediaStreamAI/MOTHER_CORE_V2 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "MediaStreamAI/MOTHER_CORE_V2" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MediaStreamAI/MOTHER_CORE_V2",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "MediaStreamAI/MOTHER_CORE_V2" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MediaStreamAI/MOTHER_CORE_V2",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use MediaStreamAI/MOTHER_CORE_V2 with Docker Model Runner:
```
docker model run hf.co/MediaStreamAI/MOTHER_CORE_V2
```

MediaStreamAI commited on 13 days ago

Commit

4df46a4

verified ·

1 Parent(s): ca911cf

chunk 600 (W2.8 cutover BASE): upload README.md

Browse files

Files changed (1) hide show

README.md +71 -416

README.md CHANGED Viewed

@@ -5,446 +5,101 @@ license_link: LICENSE
 language:
   - en
   - cy
-  - ga
   - gd
 tags:
-  - sovereign-ai
-  - uk
-  - reasoning
-  - msai
   - mother-core
-pipeline_tag: text-generation
-library_name: pytorch
----
-# MOTHER CORE V2 — chunk 450 (W2.7)
-**Sovereign UK AI** built from scratch by **MediaStream AI Limited (MSAI)**.
-This is a development checkpoint released for **MSAI team and partner testing only**. It is **not** a released model and **not** intended for production use. Eval performance is partial; the model is mid-training.
----
-## 1. Model Summary
-| Field | Value |
-|---|---|
-| Model | MOTHER CORE V2 |
-| Checkpoint | chunk 450 (W2.7 stage) |
-| Parameters | 6.877B |
-| Architecture | Custom transformer (RoPE, GQA, RMSNorm, SwiGLU FFN, memory gate) |
-| Layers | 48 |
-| Hidden dimension | 3,072 |
-| Attention heads | 24 (head_dim 128) |
-| KV heads | 6 (GQA ratio 4:1) |
-| FFN multiplier | 4.0 (intermediate 12,288) |
-| Max sequence length | 4,096 |
-| Vocabulary | 50,258 (SentencePiece) |
-| RoPE θ | 10,000 |
-| RMSNorm ε | 1e-5 |
-| Tied embeddings | No (separate `lm_head`) |
-| Weights dtype (this release) | bfloat16 |
-| Training dtype | float32 |
-This is a **from-scratch sovereign build**. It is not a fine-tune of any external model (Llama, Qwen, Mistral, GPT, etc.). Training, tokenisation, architecture, and corpus are all proprietary to MSAI.
----
-## 2. Status
-| Metric | Value |
-|---|---|
-| Training stage | W2.7 (mid-curriculum) |
-| Most recent chunk eval | 47/105 @ chunk 450 |
-| Scope | math, science, reasoning, chain-of-thought, UK knowledge, Celtic languages, MOTHER identity, agentic tool use, multi-step planning, RAG, memory, composition (see §3) |
-| Out of scope (separate future models) | code generation, creative writing, vision |
-This release is for **internal team testing**. It will fail on tasks outside its training scope.
-The training trajectory has been monotonic since chunk 300:
-| Chunk | Eval | Loss |
-|---|---|---|
-| 300 | 36/105 | 2.47 |
-| 350 | 37/105 | 2.05 |
-| 400 | 45/105 | 2.01 |
-| **450** | **47/105** | **1.74** |
-W2.7 will continue to chunk 650, after which the W2.8 corpus addition (~330,000 records spanning agentic orchestration, multi-step reasoning, tool use, memory synthesis) will be merged for the next training phase.
----
-## 3. Agent Capabilities Trained
-This checkpoint was trained on the **W2.7 agentic curriculum** in addition to the base reasoning corpus. The model has been exposed to 57 agent-related training categories spanning planning, tool-calling, chain composition, recovery, RAG, memory, and workflow execution.
-The per-category training loss values below are taken from chunk 484 (closest complete log to chunk 450); lower is better — values below 0.5 indicate the category is well-learned, 0.5-1.0 is partially learned, >1.0 needs more training.
-### 3.1 Agent reasoning & planning
-| Category | Loss @ chunk 484 | Purpose |
-|---|---|---|
-| `agent_cot_planning` | 0.45 | Decompose a user goal into a stepped plan before acting |
-| `agent_cot_decomposition` | 0.42 | Break a multi-part task into independent sub-tasks |
-| `agent_cot_synthesis` | 0.39 | Combine multiple tool results into a single answer |
-| `agent_cot_verification` | 0.37 | Verify a tool result against the original acceptance criteria |
-| `agent_cot_replan` | 0.03 | Revise the plan mid-execution when an observation invalidates it |
-| `agent_args_validation` | 0.03 | Validate tool-call arguments before emitting the call |
-| `agent_args_hallucination_resist` | 0.08 | Refuse to invent arguments not present in the conversation |
-### 3.2 Tool calling
-| Category | Loss @ chunk 484 | Purpose |
-|---|---|---|
-| `agent_call_documents` | 0.59 | Call doc tools (Drive, Notion, PDF/Word/Excel creation) |
-| `agent_call_microsoft` | 2.22 | Call Microsoft tools (Graph, Outlook, Teams) — *needs more training* |
-| `agent_call_google` | 1.03 | Call Google tools (Drive, Calendar, Gmail) |
-| `agent_call_code` | 1.42 | Call code-execution tools (Python, shell, sandbox) |
-| `agent_no_tool_needed` | 0.29 | Recognise when a question needs no tool and answer directly |
-| `tool_choice_routing` | 0.17 | Route a request to the correct tool of several plausible ones |
-### 3.3 Multi-step chains
-| Category | Loss @ chunk 484 | Purpose |
-|---|---|---|
-| `agent_chain_3step` | 0.53 | Three-step sequential tool chains |
-| `agent_chain_5plus` | 0.59 | Five-or-more-step tool chains |
-| `agent_conditional_chain` | 0.43 | Branching chains where step N depends on step N-1's result |
-| `agent_parallel_calls` | 0.51 | Issue independent tool calls in parallel and merge results |
-### 3.4 Control flow & safety
-| Category | Loss @ chunk 484 | Purpose |
-|---|---|---|
-| `agent_disambiguation` | 0.65 | Ask for clarification when the user's request is ambiguous |
-| `agent_error_recovery` | 0.58 | Recover gracefully from a failed tool call |
-| `agent_mid_chain_abort` | 0.32 | Abort a chain when a step reveals the original goal is unreachable |
-| `agent_loop_aggregation` | 1.13 | Aggregate results from a loop of tool calls |
-| `agent_oauth_required` | 1.12 | Recognise when a tool needs OAuth and surface that to the user |
-| `agent_unsafe_refusal` | 0.21 | Refuse unsafe, malicious, or out-of-policy requests |
-### 3.5 RAG (retrieval-augmented generation)
-| Category | Loss @ chunk 484 | Purpose |
-|---|---|---|
-| `rag_single_call` | 1.34 | Single retrieval call before answering |
-| `rag_synthesis` | 1.90 | Synthesise across multiple retrieved chunks — *needs more training* |
-| `rag_with_citation` | 1.25 | Include source citations in the synthesised answer |
-| `rag_empty_fallback` | 0.25 | Handle "no relevant results" gracefully |
-| `rag_not_needed` | 1.34 | Decline to retrieve when the question doesn't warrant it |
-### 3.6 Working memory
-| Category | Loss @ chunk 484 | Purpose |
-|---|---|---|
-| `memory_store` | 0.05 | Persist a fact across turns in a session |
-| `memory_recall` | 0.33 | Retrieve a previously-stored fact when relevant |
-| `memory_multi_turn` | 0.19 | Carry intermediate state through a multi-turn session |
-| `memory_empty` | 0.12 | Handle the cold-start case where memory is empty |
-### 3.7 Composition (multi-modal chains)
-| Category | Loss @ chunk 484 | Purpose |
-|---|---|---|
-| `compose_calc_chain` | 0.53 | Compose calculator + downstream tool |
-| `compose_memory_calc` | 0.61 | Combine memory recall with calculation |
-| `compose_rag_calc` | 0.34 | Retrieve facts, then compute with them |
-| `compose_rag_multi` | 0.28 | Multi-step retrieval-and-reason chains |
-| `compose_rag_web` | 0.10 | Combine internal retrieval with web search |
-| `compose_web_calc` | 0.13 | Web search + calculation |
-| `compose_web_memory` | 0.65 | Web search + memory storage |
-### 3.8 Error recovery
-| Category | Loss @ chunk 484 | Purpose |
-|---|---|---|
-| `recovery_admit_failure` | 0.19 | Honestly admit when a tool failed rather than fabricate |
-| `recovery_alternate` | 0.09 | Try an alternative tool or strategy after failure |
-| `recovery_malformed` | 0.17 | Detect and repair malformed tool output |
-| `recovery_rewrite` | 0.10 | Rewrite a failing query in a way more likely to succeed |
-### 3.9 Web search primitives
-| Category | Loss @ chunk 484 | Purpose |
-|---|---|---|
-| `web_search_single` | 0.11 | One-shot web search |
-| `web_search_reading` | 0.19 | Fetch and read a specific URL |
-| `web_search_fallback` | 0.26 | Use web search when internal sources fail |
-### 3.10 Chat behaviour
-| Category | Loss @ chunk 484 | Purpose |
-|---|---|---|
-| `chat_greeting` | 0.37 | Conversational greetings |
-| `chat_acknowledgement` | 0.37 | Acknowledge received instructions |
-| `chat_identity` | 0.51 | Maintain MOTHER/MSAI identity in conversation |
-| `chat_helpful_refusal` | 0.54 | Decline politely and offer alternatives |
-| `chat_length_match` | 1.25 | Match response length to question complexity |
-| `chat_multi_turn` | 0.19 | Maintain coherence across conversation turns |
-### 3.11 Pre-built workflows
-| Category | Loss @ chunk 484 | Purpose |
-|---|---|---|
-| `workflow_invoice_send` | 0.68 | End-to-end invoice creation + send workflow |
-| `workflow_meeting_prep` | 0.86 | Meeting preparation (calendar + brief generation) |
-| `workflow_msai_specific` | 1.26 | MSAI-internal workflows (deal flow, tenant comms) |
-| `workflow_proposal_pipeline` | 1.06 | Proposal authoring pipeline |
-| `workflow_report_generation` | 0.93 | Reporting workflows (status, financial, ops) |
-### 3.12 AUTM integration
-| Category | Loss @ chunk 484 | Purpose |
-|---|---|---|
-| `autm_agent` | 0.19 | Generic AUTM agent calling convention |
-| `autm_vertical` | 0.19 | AUTM vertical-specific dispatching |
-### Loss interpretation guide
-- **< 0.30** — well-learned; production-trustable
-- **0.30 – 0.60** — partially learned; usable but supervise outputs
-- **0.60 – 1.00** — emerging; expect inconsistent behaviour
-- **> 1.00** — still training; treat outputs as unreliable
-### W2.8 will strengthen the weak categories
-The W2.8 corpus (~330,000 new records, currently in build) targets the categories above 1.0 — particularly `agent_call_microsoft` (2.22), `rag_synthesis` (1.90), `agent_call_code` (1.42), `rag_single_call` / `rag_not_needed` (1.34), `chat_length_match` (1.25), and `workflow_msai_specific` (1.26). W2.8 also adds new categories: `doc_format_subtyping`, `verifier_loop`, `args_validation_adversarial`, `execution_graph_dag`, `cot_replan_observation`, `tool_failure_recovery`, `rag_synthesis_grounded`, `retrieval_arbiter`, `multi_agent_orchestration`, and `memory_synthesis`.
----
-## 4. Locked Inference Rules
-**Deviation from these rules produces incorrect or degenerate output.** They are not suggestions — they are the inference recipe the model was trained against.
-| Setting | Value | Reason |
-|---|---|---|
-| Prompt format | `Question:\n\n{question}\n\nAnswer:` | Exact whitespace. Model is OOD without it. |
-| BOS token | id=1, `<s>` | Always prepended; model was trained with BOS at position 0 |
-| EOS token | id=2, `</s>` | Stop generation on emission |
-| PAD token | id=0, `<pad>` | Training only |
-| Sampling | **Greedy argmax** | No temperature, no top-k, no top-p |
-| Repetition penalty | 1.3 (frequency-scaled, count ≥ 2) | Higher values collapse output |
-| n-gram blocking | 4-gram, no repeat | Prevents loop output |
-| Max new tokens | 200 | Hard cap |
-| BOS in output | Banned | Never emit BOS during generation |
-| EOS in output | Allowed after first token | Early stop signal |
-### Reference code
-A working reference is included as `inference.py` in this repo. The canonical implementation lives in `mother_train_7b.py::_generate_greedy()` in the MSAI training repository. **Use `inference.py` from this repo or load `mother_train_7b._generate_greedy` directly.** Re-implementations frequently get the recipe wrong.
 ---
-## 5. Architecture Detail
-```
-MotherCoreModel
-├── tok_emb           [50258, 3072]
-├── blocks × 48
-│   └── each:
-│       ├── attn (GQA)
-│       │   ├── wq    [3072, 3072]     # 24 heads × 128 dim
-│       │   ├── wk    [768,  3072]     # 6 KV heads × 128 dim
-│       │   ├── wv    [768,  3072]
-│       │   └── wo    [3072, 3072]
-│       ├── ff (SwiGLU)
-│       │   ├── w1    [12288, 3072]
-│       │   ├── w2    [12288, 3072]
-│       │   └── w3    [3072, 12288]
-│       ├── norm_attn (RMSNorm)
-│       └── norm_ff   (RMSNorm)
-├── norm_f            [3072]
-├── lm_head           [50258, 3072]   # NOT tied to tok_emb
-└── memory_gate       [1, 3072] + bias[1]
-```
-### Memory gate
-`memory_gate` is a sigmoid-gated single-dimension projection from the last hidden state. It is **trained but not active in inference output** — it is reserved for downstream integration with MOTHER ROBOTICS (an item/object/situational/historical awareness model) and external memory systems. Its activation is exposed in the forward pass return dict but does not affect token logits.
-Forward return:
-```
-{
-  "logits":             [B, T, vocab],
-  "loss":               scalar or None,
-  "aux_loss":           scalar (MoE; unused here, fixed=0),
-  "past_key_values":    List[(K,V)] or None,
-  "hidden_states":      List[Tensor] or None,
-  "last_hidden_state":  [B, T, dim],
-  "gate":               [B, 1]              ← detached, FYI only
-}
-```
----
-## 6. Training
-### Corpus (W2.7)
-| Category | Records |
-|---|---|
-| Reasoning + chain-of-thought | ~390,000 |
-| UK general knowledge | ~210,000 |
-| Math & arithmetic (digit-spaced) | ~165,000 |
-| Identity & self-knowledge (MOTHER, MSAI) | ~32,000 |
-| Celtic languages (Welsh, Irish, Scottish Gaelic) | ~28,000 |
-| Science | ~88,000 |
-| Misc (chat, instruct skeleton) | ~135,000 |
-| **Total** | **~1.05M** |
-### Hyperparameters
-| Setting | Value |
 |---|---|
-| Learning rate | 1e-5 |
-| Gradient clip | 10.0 |
-| Effective batch size | 32 (BATCH_PHYSICAL=1 × GRAD_ACCUM_STEPS=32) |
-| Sequence length (training) | 4096 |
-| Optimiser | AdamW (β₁=0.9, β₂=0.95) |
-| Weight decay | 0.1 |
-| Warmup steps | 100 |
-| Layer-wise LR scaling | from chunk 10 onward |
-| Hardware | NVIDIA GB10 Blackwell (Grace–Blackwell unified memory, 128GB) |
-| Training site | MSAI Wright Avenue, Dundee — sovereign UK infrastructure |
-Training was performed at the full architecture sequence length of **4096** using physical microbatches of 1 with gradient accumulation of 32 (effective batch = 32). Because training and inference share the same context length, no RoPE extrapolation is required for 4096-token inference. Long-context behaviour at full 4096 has been exposed during training but not formally benchmarked at this checkpoint.
----
-## 7. Sovereign Build Posture
-MOTHER CORE is part of MSAI's sovereign AI stack — built end-to-end in the UK on UK-resident infrastructure. The training, weights, tokeniser, and corpus are owned by MSAI. The training datacentres are MSAI-operated (Wright Avenue, Dundee; with additional sites in Durham and Manchester). No US cloud provider is in the inference or training path.
-This positioning matters for UK government, defence, and regulated-enterprise customers where data residency, GDPR, and supply-chain provenance are mandatory.
----
-## 8. Intended Use & Out-of-Scope Use
-**In scope (this checkpoint):**
-- Reasoning and chain-of-thought tasks at modest difficulty
-- UK general knowledge questions
-- Welsh / Irish / Scottish Gaelic short-form questions
-- MOTHER-identity Q&A
-- Arithmetic on small integers (with digit-spaced inputs for ≥3-digit numbers)
-**Out of scope (this checkpoint):**
-- Code generation (separate model — MOTHER CODE — planned)
-- Creative writing (separate model — MOTHER LLM — planned)
-- Long-form (>1,000 token) generation
-- Multi-turn dialogue (training is single-turn Q/A)
-- Anything safety-critical, medical, legal, or financial advisory
-- Real-time information (model has no internet access at inference)
----
-## 9. Evaluation
-The internal eval suite at chunk 450 scores **47/105 (44.8%)** across:
-- Identity: 6/6 (100%)
-- UK knowledge: 9/12
-- Reasoning (multi-step): 14/35
-- Arithmetic: 5/15
-- Science: 7/12
-- Celtic languages: 4/9
-- Chain-of-thought: 2/16
-Persistent gaps at chunk 450:
-- Arithmetic on multi-digit numbers (training fix in progress — see W2.8 plan)
-- Multi-step reasoning beyond 3 hops
-- Welsh and Irish (smaller corpus volume than other categories)
-Eval suite and methodology are MSAI-internal. Comparable public benchmarks (MMLU, GSM8K) have **not** been run against this checkpoint and would not be directly comparable since the training corpus and tokeniser are sovereign.
----
-## 10. Limitations & Known Failure Modes
-1. **Single-turn only** — no chat-style multi-turn coherence
-2. **Format-brittle** — the `Question:\n\n...\n\nAnswer:` template is required; other formats produce OOD output
-3. **No tool use / no agent loop** at this checkpoint (W2.8 corpus will add this)
-4. **No code generation** — even simple Python will fail; not in scope
-5. **No retrieval / no internet** — closed-book knowledge only, as of training cutoff
-6. **Arithmetic at multi-digit numbers** — requires digit-spaced input (`1 5 + 2 7`) to perform reliably
-7. **`weights_only=False` required** if loading from `.pt` — this repo ships `.safetensors` instead which is safer
-8. **High repetition penalty (>1.4) collapses output** — stick to 1.3
----
-## 11. Usage
-### Quick test from a clean Python environment
-```bash
-pip install torch safetensors sentencepiece huggingface_hub
-```
-You also need the `mother_core` package source available (architecture is custom; no Transformers integration yet). Clone the MSAI training repo or copy `mother_core/` into your `PYTHONPATH`.
 ```python
-from huggingface_hub import snapshot_download
-repo_dir = snapshot_download(repo_id="MediaStreamAI/MOTHER_CORE_V2")
-# Then import inference.py from the snapshot
-import sys, importlib.util
-spec = importlib.util.spec_from_file_location("inf", f"{repo_dir}/inference.py")
-inf = importlib.util.module_from_spec(spec); spec.loader.exec_module(inf)
-model, tok = inf.load_model_and_tokenizer(repo_dir)
-print(inf.generate_greedy(model, tok, "What is the capital of Scotland?"))
-```
-Or run the inference script directly:
-```bash
-python inference.py "What is the capital of Scotland?"
 ```
-### File map
-| File | Purpose |
-|---|---|
-| `model-00001-of-00003.safetensors` | Weights, shard 1/3 |
-| `model-00002-of-00003.safetensors` | Weights, shard 2/3 |
-| `model-00003-of-00003.safetensors` | Weights, shard 3/3 |
-| `model.safetensors.index.json` | Shard index |
-| `config.json` | Architecture spec |
-| `tokenizer.model` | SentencePiece vocab |
-| `tokenizer_config.json` | Tokeniser config (`add_bos_token=true` required) |
-| `special_tokens_map.json` | BOS/EOS/PAD/UNK ids |
-| `inference.py` | Reference inference with locked rules |
-| `README.md` | This file |
----
-## 12. License
-**MSAI Sovereign License — Internal & Partner Use Only.**
-This model is the proprietary work of MediaStream AI Limited. It is released to authorised team members and contracted partners for evaluation and integration purposes. Redistribution, commercial use, or training other models on this model's outputs require written permission from MSAI.
-For licensing enquiries: contact MediaStream AI Limited via the company website.
----
-## 13. Citation
-```
-@misc{msai-mother-core-2026,
-  title  = {MOTHER CORE V2 — Sovereign UK AI},
-  author = {{MediaStream AI Limited}},
-  year   = {2026},
-  note   = {Chunk 450, W2.7 mid-training checkpoint},
-  url    = {https://huggingface.co/MediaStreamAI/MOTHER_CORE_V2}
-}
-```
----
-## 14. Contact
-- Organisation: MediaStream AI Limited (MSAI)
-- Founder & CEO: Christopher Kenna
-- Lead AI Architect: Christopher Kenna
-- Web: https://mediastreamai.com
-- Infrastructure: UK sovereign (Dundee, Durham, Manchester)

 language:
   - en
   - cy
   - gd
+  - ga
+pipeline_tag: text-generation
 tags:
   - mother-core
+  - msai
+  - sovereign-ai
+  - united-kingdom
+  - causal-lm
+library_name: transformers
 ---
+# MOTHER CORE V2 — chunk 600 (W2.8 cutover base)
+**Sovereign UK AI built from scratch by [MediaStream AI Limited (MSAI)](https://mediastreamai.com).**
+This is **MOTHER CORE BASE** — the frozen foundation checkpoint at chunk 600 of the W2.7 → W2.8 training programme. All downstream MOTHER models (DEFENCE, ROBOTICS, LLM, CODE) build on this base.
+- **Founder & CEO and Lead AI Architect:** Christopher Kenna
+- **Parameters:** 6.88B (FP32 source, BF16 weights here)
+- **Architecture:** 48 layers, dim 3072, 24 heads, 6 KV heads (GQA 4:1), RoPE θ=10000, RMS norm, tied embeddings
+- **Context:** 4096 tokens
+- **Training:** From-scratch sovereign UK build — no fine-tuning of external models
+- **Source SHA256:** `0b1ef35ec60af4a7ad0648498de8526cb775a19501dda94dfbda1713e0475b60`
+## Training journey
+| Milestone | Eval (105-question harness) |
 |---|---|
+| Chunk 450 (initial W2.7 baseline) | 47/105 (45%) |
+| Chunk 506 (post LR-fix rollback) | 44/105 (42%) |
+| Chunk 550 (recovery, LR-capped) | 46/105 (44%) |
+| **Chunk 600 (BASE freeze)** | **49/105 (47%)** |
+## Scope
+**MOTHER CORE handles:** math, science, reasoning, chain-of-thought, UK knowledge, MOTHER identity, tool calling (agents, RAG, memory, workflows), multilingual responses (English, Welsh, Irish, Scottish Gaelic), safety refusals.
+**MOTHER CORE does NOT handle (separate sister models):**
+- **MOTHER CODE** — software engineering, code generation
+- **MOTHER LLM** — long-form creative writing, instruction-tuned content
+- **MOTHER DEFENCE** — defence reasoning and strategy (W3 programme, builds on this BASE)
+- **MOTHER ROBOTICS** — humanoid robot embodiment (W4 programme, builds on this BASE)
+## Usage
 ```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+import torch
+tok = AutoTokenizer.from_pretrained("MediaStreamAI/MOTHER_CORE_V2")
+model = AutoModelForCausalLM.from_pretrained(
+    "MediaStreamAI/MOTHER_CORE_V2",
+    torch_dtype=torch.bfloat16,
+    device_map="auto",
+)
+prompt = "Question:\n\nWhat is the capital of Wales?\n\nAnswer:"
+inputs = tok(prompt, return_tensors="pt", add_special_tokens=True).to(model.device)
+out = model.generate(
+    **inputs,
+    max_new_tokens=200,
+    do_sample=False,
+    repetition_penalty=1.3,
+    no_repeat_ngram_size=4,
+    pad_token_id=tok.pad_token_id,
+)
+print(tok.decode(out[0], skip_special_tokens=True))
 ```
+**Critical inference rules:**
+- Prompt wrap: `"Question:\n\n{q}\n\nAnswer:"` (exact whitespace)
+- BOS token: 1 (required, `add_bos_token=True`)
+- EOS token: 2
+- PAD token: 0
+- **Use greedy decoding only.** Sampling produces gibberish.
+- Repetition penalty: 1.3, frequency-scaled
+- No-repeat n-gram size: 4
+## Programme context
+- **W2.7 (complete)** — Core capability training: math, science, reasoning, identity, UK knowledge, multilingual, agent tool-calling, RAG, chat, memory, workflows
+- **W2.8 (in progress)** — Document routing, argument validation, agent verifier loops, multi-step orchestration
+- **W3** — MOTHER DEFENCE (defence reasoning and strategy)
+- **W4** — MOTHER ROBOTICS (embodied awareness for humanoid platforms)
+UK sovereign infrastructure: Manchester (HQ), Dundee (flagship DC), Durham. Phase 2 expansion H2 2026 to Düsseldorf, South Africa, Jamaica.
+## License
+MSAI Sovereign License. See LICENSE file. Built sovereign in the UK, not derived from any externally-licensed pre-trained model.
+## Contact
+MediaStream AI Limited
+West Tower, 371 Deansgate, Manchester M15 4UR, United Kingdom
+[mediastreamai.com](https://mediastreamai.com)