johnsonchromia commited on
Commit
019c702
·
verified ·
1 Parent(s): 1a2b188

README: redirect Ollama users to ollama.com registry (sharded-GGUF issue 5245)

Browse files
Files changed (1) hide show
  1. README.md +8 -6
README.md CHANGED
@@ -50,17 +50,19 @@ root — load it alongside any LM quant for image input. See **Vision** below.
50
  - llama.cpp: pass `--jinja`. Gemma 4 thinking mode is on by default; set
51
  `enable_thinking: false` in chat-template kwargs for shorter replies.
52
 
53
- Ollama: `ollama pull hf.co/...` uses a bundled Modelfile with
54
- `temperature=0.6, top_p=0.95, top_k=64, repeat_penalty=1.05, num_ctx=8192`
55
- and an identity-grounding system prompt. Override per-session with
 
 
56
  `/set parameter temperature 0.3` etc.
57
 
58
  ## Run
59
 
60
  ```bash
61
- # Ollama (defaults to Q4_K_M)
62
- ollama pull hf.co/evalengine/unbound-e2b-GGUF
63
- ollama run hf.co/evalengine/unbound-e2b-GGUF
64
  ```
65
 
66
  ```bash
 
50
  - llama.cpp: pass `--jinja`. Gemma 4 thinking mode is on by default; set
51
  `enable_thinking: false` in chat-template kwargs for shorter replies.
52
 
53
+ For Ollama specifically, pull from the **Ollama Registry**
54
+ `ollama pull hf.co/...` [doesn't yet support sharded GGUFs](https://github.com/ollama/ollama/issues/5245).
55
+ The registry version is a single-file Q4_K_M with a bundled Modelfile
56
+ (`temperature=0.6, top_p=0.95, top_k=64, repeat_penalty=1.05, num_ctx=8192`
57
+ and an identity-grounding system prompt). Override per-session with
58
  `/set parameter temperature 0.3` etc.
59
 
60
  ## Run
61
 
62
  ```bash
63
+ # Ollama Registry (single-file Q4_K_M, identity-grounded Modelfile)
64
+ ollama pull evalengine/unbound-e2b
65
+ ollama run evalengine/unbound-e2b
66
  ```
67
 
68
  ```bash