johnsonchromia commited on
Commit
e0858da
·
verified ·
1 Parent(s): 252cc13

README: redirect Ollama users to ollama.com registry (sharded-GGUF issue 5245)

Browse files
Files changed (1) hide show
  1. README.md +9 -3
README.md CHANGED
@@ -66,12 +66,18 @@ root. See **Vision** below.
66
  - llama.cpp: pass `--jinja`. Gemma 4 thinking mode is on by default; set
67
  `enable_thinking: false` in chat-template kwargs for shorter replies.
68
 
 
 
 
 
 
 
69
  ## Run
70
 
71
  ```bash
72
- # Ollama (defaults to Q4_K_M)
73
- ollama pull hf.co/evalengine/unbound-e4b-GGUF
74
- ollama run hf.co/evalengine/unbound-e4b-GGUF
75
  ```
76
 
77
  ```bash
 
66
  - llama.cpp: pass `--jinja`. Gemma 4 thinking mode is on by default; set
67
  `enable_thinking: false` in chat-template kwargs for shorter replies.
68
 
69
+ For Ollama specifically, pull from the **Ollama Registry** —
70
+ `ollama pull hf.co/...` [doesn't yet support sharded GGUFs](https://github.com/ollama/ollama/issues/5245).
71
+ The registry version is a single-file Q4_K_M with a bundled Modelfile
72
+ (`temperature=0.6, top_p=0.95, top_k=64, repeat_penalty=1.05, num_ctx=8192`
73
+ and an identity-grounding system prompt).
74
+
75
  ## Run
76
 
77
  ```bash
78
+ # Ollama Registry (single-file Q4_K_M, identity-grounded Modelfile)
79
+ ollama pull evalengine/unbound-e4b
80
+ ollama run evalengine/unbound-e4b
81
  ```
82
 
83
  ```bash