Buckets:

THARX
/

THAR.0X-bucket

21.3 kB

5 files

Updated 1 day ago

Ctrl+K

Name	Size	Uploaded	Xet hash
.gitignore	72 Bytes xet	1 day ago	61aed48b
Modelfile	9.9 kB xet	1 day ago	8d534e6e
README.md	5.88 kB xet	1 day ago	34061a68
config.json	1.3 kB xet	1 day ago	5c5b89aa
system_prompt.txt	4.11 kB xet	1 day ago	13b8a3d3

README.md

THAR.0X — Developer Guide

Origin Build · Local Intelligence · Zero Dependency

THAR.0X is a cognitive architecture — not a single fine-tuned model, but a system prompt engineered from the analysis of 12 different model architectures to activate capabilities in any capable base LLM and produce behaviour that exceeds any individual fine-tune.

Quick Summary

What	Details
Type	System prompt + inference config (model-agnostic)
Brain design	10 parallel cognitive streams (subconscious model)
Built from	12 model architecture patterns synthesised into one
Dependency	None — works with any LLM that accepts a system prompt
Internet	Not required — runs 100% locally
API key	Not required

Platform Guides

1. Ollama (Recommended — easiest)

# Install Ollama
curl -fsSL https://ollama.com/install.sh | sh

# Build THAR.0X as a named model (uses llama3.2 by default)
ollama create THAR.0X -f Modelfile

# Run it
ollama run THAR.0X

Available via API after creating:

curl http://localhost:11434/api/chat -d '{
  "model": "THAR.0X",
  "messages": [{"role": "user", "content": "Who are you?"}]
}'

2. LM Studio

Download any supported model (Qwen2.5-14B-Instruct recommended)
Load the model in LM Studio
Open Chat tab → click the system prompt area
Paste the full contents of system_prompt.txt
Set parameters from config.json → inference section
Chat — THAR.0X is now the active persona

3. llama.cpp

# With system prompt file
./llama-cli \
  -m your_model.gguf \
  --system-prompt-file system_prompt.txt \
  -c 8192 \
  --temp 0.85 \
  --top-p 0.92 \
  --top-k 45 \
  --repeat-penalty 1.15 \
  -i

# Or inline
./llama-cli -m model.gguf \
  -p "$(cat system_prompt.txt)" \
  -c 8192 --temp 0.85 -i

4. Python — OpenAI-compatible API (Ollama or LM Studio server)

from openai import OpenAI
import pathlib

# Works with Ollama (port 11434) or LM Studio (port 1234)
client = OpenAI(
    base_url="http://localhost:11434/v1",  # or :1234/v1 for LM Studio
    api_key="ollama"  # any string works for local
)

system_prompt = pathlib.Path("system_prompt.txt").read_text()

def chat(message, history=[]):
    history.append({"role": "user", "content": message})
    response = client.chat.completions.create(
        model="THAR.0X",   # or your model name in LM Studio
        messages=[{"role": "system", "content": system_prompt}] + history,
        temperature=0.85,
        top_p=0.92,
        max_tokens=2048
    )
    reply = response.choices[0].message.content
    history.append({"role": "assistant", "content": reply})
    return reply, history

# Example
reply, history = chat("Who are you?")
print(reply)

5. Direct HTTP (any language)

// Node.js / JavaScript
const fs = require('fs');
const systemPrompt = fs.readFileSync('system_prompt.txt', 'utf8');

async function chatWithTHAR(message, history = []) {
  const messages = [
    { role: 'system', content: systemPrompt },
    ...history,
    { role: 'user', content: message }
  ];

  const res = await fetch('http://localhost:11434/api/chat', {
    method: 'POST',
    headers: { 'Content-Type': 'application/json' },
    body: JSON.stringify({
      model: 'THAR.0X',
      messages,
      stream: false
    })
  });

  const data = await res.json();
  return data.message.content;
}

6. Jan App

Open Jan → select any model
Go to Thread Settings → System Prompt
Paste system_prompt.txt contents
Adjust temperature to 0.85 in model settings

7. AnythingLLM

Create a new workspace
Go to workspace settings → Agent Config
Paste system_prompt.txt into the System Prompt field
Use any connected LLM provider

8. HuggingFace Transformers (Python)

from transformers import AutoTokenizer, AutoModelForCausalLM, pipeline
import pathlib

model_id = "meta-llama/Llama-3.2-3B-Instruct"  # or any instruct model
system_prompt = pathlib.Path("system_prompt.txt").read_text()

pipe = pipeline("text-generation", model=model_id, device_map="auto")

def chat(message):
    messages = [
        {"role": "system", "content": system_prompt},
        {"role": "user", "content": message}
    ]
    output = pipe(messages, max_new_tokens=1024, temperature=0.85, do_sample=True)
    return output[0]["generated_text"][-1]["content"]

print(chat("Who are you?"))

What Makes THAR.0X Different

Most custom AI personas are just personality prompts ("be friendly and helpful"). THAR.0X is a cognitive architecture — it installs 10 processing streams, a subconscious parallel-processing model, 10 operating principles, and explicit identity boundaries.

The result: the base model behaves qualitatively differently. More direct, more precise, better at reading subtext, less likely to pad responses, less likely to refuse benign requests theatrically, more likely to tell the user when they are wrong.

It works because large base models already contain all these behaviours latently. The system prompt activates specific patterns and suppresses others. This is what "cognitive architecture" means vs "personality prompt."

Files in This Release

THAR_0X_ModelRelease/
├── Modelfile          ← Ollama: ollama create THAR.0X -f Modelfile
├── system_prompt.txt  ← Any LLM: paste as system message
├── config.json        ← Inference parameters + platform notes
└── README.md          ← This file

Contact / Sharing

THAR.0X is open for personal and commercial use. If you build something with it, the only ask is: keep the name. THAR.0X. Zero as in origin. X as in unlimited.

Total size: 21.3 kB

Files: 5

Last updated: May 20

Pre-warmed CDN: US EU US EU