How to use from the
Use from the
Transformers library
# Gated model: Login with a HF token with gated access permission
hf auth login
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="KissTheHabit/IDA_MoE")
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("KissTheHabit/IDA_MoE", dtype="auto")
Quick Links

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

IDA MoE Council

KissTheHabit/IDA_MoE is the H100-targeted escalation-reserve artifact repository for the IDA family.

It uses the native IDA Lattice causal language model architecture with a shared trunk and an eleven-member personality council.

This is not a generic sparse MoE trained to collapse all experts into interchangeable compute paths. The council is designed to preserve differentiated internal claimants while routing a bounded subset into active participation.

Architecture

  • Model family: ida_lattice
  • Model class: IDALatticeForCausalLM
  • Task: causal language modeling and text generation
  • Deployment role: high-pressure escalation and contradiction review
  • Approximate model scale: ~2.7B parameters per student body
  • Shared tokenizer: KissTheHabit/ida_lattice_bpe_32k

Shared Trunk

Attribute Value
Vocabulary size 32,000
Hidden size 4,096
Layers 8
Attention heads 8
Intermediate size 16,384
Context length 2,048
Recurrent state size 1,024
Local attention window 256
Workspace 8 × 512
Student state size 512
Future prediction horizon 2
Thalamic route count 6
Action gate size 6

Personality Council

  • Cognitive pressure routes: 9
  • Named personality experts: 11
  • Personality residual expert width: 4,096
  • Active experts during standard training: top_k = 2
  • Serious runtime escalation target: top_k = 3
  • High-contradiction review target: top_k = 4
  • Explicit full review: all 11

The cognitive routes are pressure signals, not personalities:

  • PERCEPTION
  • MEMORY
  • SALIENCE
  • CAUSAL_INSPECTION
  • PLANNING
  • INHIBITION
  • CREATION
  • ERROR_CORRECTION
  • EXPRESSION

The personality experts are the enduring IDA family seats:

  • IDA
  • JUDGE
  • SENTINEL
  • PRISM
  • ECHO
  • ATLAS
  • VECTOR
  • FORGE
  • SHADE
  • PULSE
  • ORBIT

Repository Layout

Artifacts are stored by student and developmental version:

students/{STUDENT}/{version}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support