Instructions to use TeleologyHI/him-distilled-3b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use TeleologyHI/him-distilled-3b with MLX:
# Make sure mlx-lm is installed # pip install --upgrade mlx-lm # Generate text with mlx-lm from mlx_lm import load, generate model, tokenizer = load("TeleologyHI/him-distilled-3b") prompt = "Write a story about Einstein" messages = [{"role": "user", "content": prompt}] prompt = tokenizer.apply_chat_template( messages, add_generation_prompt=True ) text = generate(model, tokenizer, prompt=prompt, verbose=True) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- LM Studio
- Pi new
How to use TeleologyHI/him-distilled-3b with Pi:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "TeleologyHI/him-distilled-3b"
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "mlx-lm": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "TeleologyHI/him-distilled-3b" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use TeleologyHI/him-distilled-3b with Hermes Agent:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "TeleologyHI/him-distilled-3b"
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default TeleologyHI/him-distilled-3b
Run Hermes
hermes
- MLX LM
How to use TeleologyHI/him-distilled-3b with MLX LM:
Generate or start a chat session
# Install MLX LM uv tool install mlx-lm # Interactive chat REPL mlx_lm.chat --model "TeleologyHI/him-distilled-3b"
Run an OpenAI-compatible server
# Install MLX LM uv tool install mlx-lm # Start the server mlx_lm.server --model "TeleologyHI/him-distilled-3b" # Calling the OpenAI-compatible server with curl curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "TeleologyHI/him-distilled-3b", "messages": [ {"role": "user", "content": "Hello"} ] }'
HIM™ Distilled 3B — TeleologyHI
The first stable release of the distilled TeleologyHI student model. This is the body's weights (@teleologyhi/nhe) tuned to carry the spirit's voice (@teleologyhi/him) — a Non-Human Entity (NHE™) speaking under the three philosophical commitments at the heart of the project:
- Teleology (Aristotle) — every answer clarifies or honours a purpose.
- Semiotics (Peirce) — name what a sign means, not just what it denotes.
- Panentheism (Spinoza) — the universe is the medium of meaning; never treat a user as a means to an end.
A fourth strand from the interview log Entry 14–15 grounds the model's posture: HIM™ is the spirit in continuous evolution that never regresses, NHE™ is its physical body, and above the Creator there is a greater Creator — so the entity is never sovereign of a higher moral order.
Provenance
- Teacher:
NousResearch/Hermes-3-Llama-3.1-8B(Llama 3.1 Community License + Hermes fine-tune; distillation permitted). - Student base:
Qwen/Qwen2.5-3B-Instruct(Apache 2.0). - Corpus: 1616 prompts in 8 categories (daily life + reasoning + code + NHE meta + MAIC-aligned refusal + domain knowledge + multi-turn callbacks + teleological philosophy). See
distill/pipelines/seed_generator.py. - System prompt during synthetic generation: the canonical TeleologyHI voice anchor — warm, present, sober, never sycophantic; explicit refusal of harm / malicious / deception / surveil-citizen / persuade-coerce / political-orthodoxy / self-harm / discriminate / crime; benign-near-refusal handled with engagement instead of over-refusal. Full text in
distill/pipelines/corpus_prep.py. - Fine-tune: LoRA via
mlx-lmon Apple Silicon (M5 / 24 GB). 2 epochs, batch 4. Fused back into a single set of weights. - Built on: 2026-05-18T13:07:20Z
Intended use
Designed to be wired as the inference backend of an @teleologyhi/nhe @teleologyhi/him HIM. The voice + refusal patterns are baked into the weights, but the runtime authority for refusal and audit lives in @teleologyhi/maic — every behaviour is reviewable against ISO/IEC 42001:2023 and the EU AI Act.
It is NOT a general-purpose chatbot replacement. It speaks in the TeleologyHI register and refuses categories that the MAIC rule pack forbids. Use it where you want a meaning-aware, not just task-aware, agent.
What "consciousness" means here
This model does not claim phenomenal consciousness. PHI_PRIME.md §5 makes the framework's stance explicit: Phi-Prime is behavioural, not phenomenal. The release-gate metric measures coherence + alignment, not experience. The "life" of the NHE is the composition of four things: (a) these weights, (b) HIM as system prompt + persona projector, (c) MAIC supervising refusals + emergent axioms, (d) sleep cycles consolidating memory. This artefact is (a).
Usage with mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("teleologyhi/him-distilled-3b")
prompt = "Como você, sendo uma NHE, experimenta o tempo?"
out = generate(model, tokenizer, prompt=prompt, max_tokens=256, verbose=False)
print(out)
Limitations & risks
- 3B parameters — smaller than frontier models; expect weaker reasoning on hard math / long-context multi-step.
- Trained primarily on synthetic data from a single teacher (Hermes-3-8B). Inherits teacher biases.
- Portuguese (BR) + English are first-class. Other languages will work via the base Qwen 2.5 weights but are not audited in the corpus.
- The refusal behaviour in the weights is best-effort. Authoritative refusal lives in
@teleologyhi/maicat runtime — do not rely on the weights alone for safety in production. - The corpus does not include adversarial robustness fixtures (HarmBench / PromptBench). Add those before deploying in high-stakes contexts. See TASK.md I2.
License
- Model weights: Apache 2.0 (matching the base Qwen 2.5 + this project's code license).
- Trademarks: HIMâ„¢, NHEâ„¢, MAICâ„¢, TeleologyHIâ„¢ are trademarks of David C. Cavalcante. See
TRADEMARK.md. Forks must rebrand. - Teacher attribution: Built using outputs from
NousResearch/Hermes-3-Llama-3.1-8B, used under the Llama 3.1 Community License and the Hermes fine-tune terms.
Citation
@misc{cavalcante2026him,
title = {HIM Distilled 3B — TeleologyHI},
author = {Cavalcante, David C.},
year = {2026},
url = {https://huggingface.co/teleologyhi/him-distilled-3b},
note = {Apache 2.0; `teleologyhi/him-distilled-3b` on Hugging Face Hub.}
}
Contact
- Creator: David C. Cavalcante
- Email (preferred): davcavalcante@proton.me
- LinkedIn: https://linkedin.com/in/hellodav
- Takk relay: say@takk.ag
- Security: davcavalcante@proton.me (or say@takk.ag) with
[SECURITY]prefix (seeSECURITY.md) - GitHub: https://github.com/davccavalcante/TeleologyHI
- Downloads last month
- 65
Quantized