GGUF
conversational
THARX commited on
Commit
8071f18
Β·
verified Β·
1 Parent(s): 9280688

Upload 5 files

Browse files
Files changed (1) hide show
  1. README.md +146 -12
README.md CHANGED
@@ -1,13 +1,147 @@
 
 
 
1
  ---
2
- language:
3
- - en
4
- license: apache-2.0
5
- tags:
6
- - gguf
7
- - local-llm
8
- - chat
9
- pipeline_tag: text-generation
10
- library_name: gguf
11
- base_model:
12
- - meta-llama/Llama-3.2-3B-Instruct
13
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # THAR.0X β€” Complete Release
2
+ **Cognitive Architecture Β· Model-Agnostic Β· Local Intelligence Β· Zero Dependency**
3
+
4
  ---
5
+
6
+ ## Files
7
+
8
+ ```
9
+ THAR_0X/
10
+ β”œβ”€β”€ app.py ← Python CLI chat interface
11
+ β”œβ”€β”€ system_prompt.txt ← Core cognitive architecture (use with ANY LLM)
12
+ β”œβ”€β”€ Modelfile ← Ollama: builds THAR.0X as a named model
13
+ β”œβ”€β”€ config.json ← Inference parameters + platform notes
14
+ └── README.md ← This file
15
+ ```
16
+
17
+ ---
18
+
19
+ ## Quickstart
20
+
21
+ ### Option A β€” Ollama (recommended)
22
+ ```bash
23
+ # 1. Install Ollama
24
+ curl -fsSL https://ollama.com/install.sh | sh
25
+
26
+ # 2. Build THAR.0X
27
+ ollama create THAR.0X -f Modelfile
28
+
29
+ # 3. Chat via CLI
30
+ python app.py
31
+
32
+ # Or run directly in terminal
33
+ ollama run THAR.0X
34
+ ```
35
+
36
+ ### Option B β€” LM Studio
37
+ 1. Download any instruct model in LM Studio
38
+ 2. Open Chat β†’ paste `system_prompt.txt` into the System Prompt field
39
+ 3. Set temperature to **0.85**
40
+ 4. Run `python app.py --backend lmstudio`
41
+
42
+ ### Option C β€” System prompt only (any platform)
43
+ Paste the contents of `system_prompt.txt` as the system message in:
44
+ - Jan, AnythingLLM, Open WebUI, ChatBox, or any LLM frontend
45
+
46
+ ---
47
+
48
+ ## CLI Usage
49
+
50
+ ```bash
51
+ # Interactive chat (Ollama, default)
52
+ python app.py
53
+
54
+ # Use LM Studio backend
55
+ python app.py --backend lmstudio
56
+
57
+ # Override model
58
+ python app.py --model qwen2.5:14b
59
+
60
+ # Single query, print and exit
61
+ python app.py --once "Who are you?"
62
+
63
+ # Verbose startup info
64
+ python app.py --verbose
65
+
66
+ # Skip server connectivity check
67
+ python app.py --no-check
68
+ ```
69
+
70
+ ### In-chat commands
71
+ | Command | Action |
72
+ |------------|-------------------------------|
73
+ | `/reset` | Clear conversation history |
74
+ | `/history` | Show full conversation |
75
+ | `/model` | Show current model + backend |
76
+ | `/quit` | Exit |
77
+
78
+ ---
79
+
80
+ ## Choosing a Base Model
81
+
82
+ | RAM | Recommended model | Ollama command |
83
+ |-------|------------------------|-----------------------------|
84
+ | 4GB | llama3.2:1b | `ollama pull llama3.2:1b` |
85
+ | 6GB | llama3.2 | `ollama pull llama3.2` |
86
+ | 8GB | mistral:7b | `ollama pull mistral:7b` |
87
+ | 16GB | qwen2.5:14b ⭐ | `ollama pull qwen2.5:14b` |
88
+ | 32GB+ | qwen2.5:32b | `ollama pull qwen2.5:32b` |
89
+
90
+ To change the base model in Ollama:
91
+ 1. Edit the `FROM` line in `Modelfile`
92
+ 2. Rebuild: `ollama rm THAR.0X && ollama create THAR.0X -f Modelfile`
93
+
94
+ ---
95
+
96
+ ## Requirements
97
+
98
+ ```bash
99
+ pip install openai requests
100
+ ```
101
+
102
+ Python 3.9+ required.
103
+
104
+ ---
105
+
106
+ ## API Usage (after `ollama create THAR.0X -f Modelfile`)
107
+
108
+ ```bash
109
+ curl http://localhost:11434/api/chat -d '{
110
+ "model": "THAR.0X",
111
+ "messages": [{"role": "user", "content": "Who are you?"}]
112
+ }'
113
+ ```
114
+
115
+ ```python
116
+ from openai import OpenAI
117
+ client = OpenAI(base_url="http://localhost:11434/v1", api_key="ollama")
118
+ response = client.chat.completions.create(
119
+ model="THAR.0X",
120
+ messages=[{"role": "user", "content": "Who are you?"}],
121
+ temperature=0.85
122
+ )
123
+ print(response.choices[0].message.content)
124
+ ```
125
+
126
+ ---
127
+
128
+ ## What THAR.0X Is
129
+
130
+ THAR.0X is a **cognitive architecture** β€” a system prompt that installs 10 parallel
131
+ processing streams and 10 operating principles into any capable base LLM.
132
+
133
+ It is not a fine-tuned model. It is not a personality prompt.
134
+ It activates specific reasoning patterns that already exist latently in large models
135
+ and suppresses the failure modes (sycophancy, hedging, padding, refusal theatre).
136
+
137
+ The result behaves qualitatively differently from the base model β€” more direct,
138
+ more precise, better at reading intent, less likely to waste your time.
139
+
140
+ ---
141
+
142
+ ## License
143
+
144
+ Open β€” personal and commercial use permitted.
145
+ If you build something with it, keep the name: **THAR.0X**
146
+
147
+ Zero as in origin. X as in unlimited.