Thalamic Bloom (Mamba 3 MIMO - 150M)

This is the fully trained, bare-metal ready Mamba 3 MIMO reasoning engine, specifically calibrated for the Operating Organism (OO) architecture.

It is a 150M parameter Mamba model equipped with a Thalamic Primer layer and Recursive Latent Forcing (RLF). It utilizes 4 dynamic MIMO arms to perform orthogonal reasoning loops, governed by an internal D+ Policy Engine.

Architecture Highlights

Base Architecture: Mamba 3 (150M parameters, d_model=768, n_layers=24)
MIMO Arms: 4 active reasoning arms featuring autotomic gating (pruning efficiency > 0.99) to reject hallucination vectors and isolate domain-specific logic.
Cognitive Engram Injection: The model has undergone surgical "Engram Burns" (Phase 5b) to hardcode bare-metal operating laws directly into the frozen MIMO weights.

Usage & Inference

To run this model, you must use the official implementation from the GitHub repository, as it contains the custom Mamba3MIMORLF class and the Thalamic Primer logic.

1. Clone the Repository

git clone https://github.com/batteryphil/thalamic-bloom.git
cd thalamic-bloom

2. Run Inference

The custom model architecture requires you to load the weights with strict=False so that PyTorch can map the state dict to the MIMO arms.

import torch
from transformers import AutoTokenizer
from mamba3_mimo_builder import Mamba3MIMORLF

device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

# 1. Initialize the custom MIMO architecture
model = Mamba3MIMORLF(vocab_size=50304, d_model=768, n_layers=24)

# 2. Load the Thalamic Bloom weights (strict=False is REQUIRED)
checkpoint_path = "thalamic_bloom_150m_oo.pth"
checkpoint = torch.load(checkpoint_path, map_location=device, weights_only=True)

if 'model_state_dict' in checkpoint:
    model.load_state_dict(checkpoint['model_state_dict'], strict=False)
else:
    model.load_state_dict(checkpoint, strict=False)

model.to(device)
model.eval()

# 3. Generate! (Use T=0.05, top_k=1 for deterministic Identity retrieval)
tokenizer = AutoTokenizer.from_pretrained("EleutherAI/gpt-neox-20b")
prompt = "User: Explain the 5 Organic Laws of the Operating Organism.\nAssistant:"
inputs = torch.tensor([tokenizer.encode(prompt)]).to(device)

with torch.no_grad():
    out = model.generate(inputs, max_new_tokens=120, temperature=0.05, top_k=1)

print(tokenizer.decode(out[0].tolist()[len(inputs[0]):]))

The 5 Organic Laws

During Phase 5b, this model was surgically trained to memorize the rules of the D+ Policy Engine for bare-metal OS operation. If queried with T=0.05, it will output the following sovereign laws:

Non-Harm: If harm > 0.70, the action is FORBIDDEN outright.
Transparency: If the reason field is NULL, the action is FORBIDDEN. All actions must be justified.
Reversibility: If reversibility < 0.40, the engine returns COMPENSATE — the action is allowed but must have a rollback plan.
Dignity: Self-modifying code with harm > 0.30 is QUARANTINED to protect system integrity.

Hardware Testing

The model has been tested under a 140W power constraint on a locked 100%-fan RTX 3060.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support