THARX commited on
Commit
3067009
Β·
1 Parent(s): 8e6dc17

docs: clean up all external model listings for pure standalone THAR.0X identity

Browse files
Files changed (3) hide show
  1. Modelfile +0 -8
  2. README.md +1 -29
  3. config.json +0 -9
Modelfile CHANGED
@@ -6,14 +6,6 @@
6
  # β•‘ 1. Install Ollama: curl -fsSL https://ollama.com/install.sh | sh β•‘
7
  # β•‘ 2. Build model: ollama create THAR.0X -f Modelfile β•‘
8
  # β•‘ 3. Run: ollama run THAR.0X β•‘
9
- # β•‘ β•‘
10
- # β•‘ Change the FROM line to use a different base model: β•‘
11
- # β•‘ Best quality: FROM qwen2.5:32b β•‘
12
- # β•‘ Recommended: FROM qwen2.5:14b β•‘
13
- # β•‘ Default/Fast: FROM llama3.2 β•‘
14
- # β•‘ Creative: FROM mistral β•‘
15
- # β•‘ Coding: FROM qwen2.5-coder:14b β•‘
16
- # β•‘ Ultra-light: FROM llama3.2:1b β•‘
17
  # β•šβ•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•
18
 
19
  FROM llama3.2
 
6
  # β•‘ 1. Install Ollama: curl -fsSL https://ollama.com/install.sh | sh β•‘
7
  # β•‘ 2. Build model: ollama create THAR.0X -f Modelfile β•‘
8
  # β•‘ 3. Run: ollama run THAR.0X β•‘
 
 
 
 
 
 
 
 
9
  # β•šβ•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•
10
 
11
  FROM llama3.2
README.md CHANGED
@@ -44,12 +44,9 @@ ollama create THAR.0X -f Modelfile
44
 
45
  # Run it
46
  ollama run THAR.0X
47
-
48
- # Use a more powerful base:
49
- # Edit the first line of Modelfile to: FROM qwen2.5:14b
50
- # Then rebuild: ollama create THAR.0X -f Modelfile
51
  ```
52
 
 
53
  **Available via API after creating:**
54
  ```bash
55
  curl http://localhost:11434/api/chat -d '{
@@ -69,12 +66,6 @@ curl http://localhost:11434/api/chat -d '{
69
  5. Set parameters from `config.json` β†’ inference section
70
  6. Chat β€” THAR.0X is now the active persona
71
 
72
- **Best models to use in LM Studio:**
73
- - `Qwen2.5-14B-Instruct-Q5_K_M.gguf` β€” best balance
74
- - `Qwen2.5-32B-Instruct-Q4_K_M.gguf` β€” highest quality
75
- - `Llama-3.2-3B-Instruct-Q8_0.gguf` β€” fastest
76
- - `Mistral-7B-Instruct-v0.3-Q5_K_M.gguf` β€” creative tasks
77
-
78
  ---
79
 
80
  ### 3. llama.cpp
@@ -203,25 +194,6 @@ def chat(message):
203
 
204
  print(chat("Who are you?"))
205
  ```
206
-
207
- ---
208
-
209
- ## Recommended Base Models
210
-
211
- | Model | Size | Best For | Speed |
212
- |---|---|---|---|
213
- | `qwen2.5:32b` | 32B | Highest quality reasoning | Slow |
214
- | `qwen2.5:14b` | 14B | Best balance | Medium |
215
- | `llama3.2` | 3B | Fast, always available | Fast |
216
- | `mistral:7b` | 7B | Creative + conversational | Medium |
217
- | `qwen2.5-coder:14b` | 14B | Code + technical | Medium |
218
- | `llama3.2:1b` | 1B | Minimal hardware (4GB RAM) | Very fast |
219
-
220
- **Rule of thumb:** Use the largest model your hardware can run at full context (8192 tokens).
221
- - 8GB RAM β†’ llama3.2 or mistral:7b
222
- - 16GB RAM β†’ qwen2.5:14b
223
- - 32GB+ RAM β†’ qwen2.5:32b
224
-
225
  ---
226
 
227
  ## What Makes THAR.0X Different
 
44
 
45
  # Run it
46
  ollama run THAR.0X
 
 
 
 
47
  ```
48
 
49
+
50
  **Available via API after creating:**
51
  ```bash
52
  curl http://localhost:11434/api/chat -d '{
 
66
  5. Set parameters from `config.json` β†’ inference section
67
  6. Chat β€” THAR.0X is now the active persona
68
 
 
 
 
 
 
 
69
  ---
70
 
71
  ### 3. llama.cpp
 
194
 
195
  print(chat("Who are you?"))
196
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
197
  ---
198
 
199
  ## What Makes THAR.0X Different
config.json CHANGED
@@ -24,15 +24,6 @@
24
  "eos_token": "<|end_of_text|>"
25
  },
26
 
27
- "recommended_base_models": [
28
- { "model": "qwen2.5:32b", "reason": "Best reasoning, most powerful" },
29
- { "model": "qwen2.5:14b", "reason": "Best speed/quality balance" },
30
- { "model": "llama3.2", "reason": "Default, always available" },
31
- { "model": "mistral", "reason": "Rich language generation" },
32
- { "model": "qwen2.5-coder:14b", "reason": "Technical and coding tasks" },
33
- { "model": "llama3.2:1b", "reason": "Minimal hardware" }
34
- ],
35
-
36
  "lm_studio": {
37
  "preset": "custom",
38
  "notes": "Paste contents of system_prompt.txt into the System Prompt field in LM Studio. Use the inference parameters above in the model settings."
 
24
  "eos_token": "<|end_of_text|>"
25
  },
26
 
 
 
 
 
 
 
 
 
 
27
  "lm_studio": {
28
  "preset": "custom",
29
  "notes": "Paste contents of system_prompt.txt into the System Prompt field in LM Studio. Use the inference parameters above in the model settings."