FoolDev Claude Opus 4.7 commited on
Commit
7d11d16
·
1 Parent(s): 1c2a85f

docs: audit cleanup — stale GGUF count, sibling URL, example count

Browse files

- CITATION.cff: bump FoolDev/janus → FoolDev/Janus-35B. Commit ab668b6
fixed the same stale URL in README.md but the citation file was missed
in the sweep.
- README.md "What's here": "ships two GGUFs" → "ships a single GGUF".
Q3_K_S was unbundled in 75bbdfe; only Thanatos-27B.Q4_K_M.gguf is in
the repo now, so the plural claim contradicts the immediately
adjacent file list.
- examples/README.md: "All three apply the same Thanatos system prompt"
→ "All four apply ...". Commit 5cb1604 fixed the table header (Three
→ Four entry points) but the trailing sentence kept the stale count.
Added a clarifier that ollama_chat.py inherits the defaults via the
Modelfile / bridge files rather than setting them in Python.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Files changed (3) hide show
  1. CITATION.cff +1 -1
  2. README.md +1 -1
  3. examples/README.md +4 -2
CITATION.cff CHANGED
@@ -34,4 +34,4 @@ references:
34
  title: "Janus-35B-A3B (MoE sibling)"
35
  authors:
36
  - name: FoolDev
37
- url: "https://huggingface.co/FoolDev/janus"
 
34
  title: "Janus-35B-A3B (MoE sibling)"
35
  authors:
36
  - name: FoolDev
37
+ url: "https://huggingface.co/FoolDev/Janus-35B"
README.md CHANGED
@@ -130,7 +130,7 @@ The 27B is **dense**: every parameter participates in every forward pass. It's s
130
  | `CHANGELOG.md` | Versioned tooling/docs changes |
131
  | `README.md` | This file |
132
 
133
- This repo ships two GGUFs to back the HF/Ollama "Use this model"
134
  widget — `Thanatos-27B.Q4_K_M.gguf` (~17 GB):
135
 
136
  ```bash
 
130
  | `CHANGELOG.md` | Versioned tooling/docs changes |
131
  | `README.md` | This file |
132
 
133
+ This repo ships a single GGUF to back the HF/Ollama "Use this model"
134
  widget — `Thanatos-27B.Q4_K_M.gguf` (~17 GB):
135
 
136
  ```bash
examples/README.md CHANGED
@@ -9,9 +9,11 @@ Four minimal entry points. Pick the one that matches how you run models.
9
  | `llama_cpp_quickstart.py` | llama-cpp-python | You want to invoke a local GGUF directly without a daemon (CI, batch jobs, scripts). Text only. |
10
  | `llama_cpp_vision.py` | llama-cpp-python + mmproj | **Image input.** Loads a text GGUF + `mmproj-F16.gguf` and answers questions about an image. The only working vision path right now. |
11
 
12
- All three apply the same Thanatos system prompt and sampling defaults
13
  (`temp=0.6, top_p=0.95, top_k=20, repeat_penalty=1.05`) so behavior should
14
- be consistent across backends modulo quantization noise.
 
 
15
 
16
  ## Setup
17
 
 
9
  | `llama_cpp_quickstart.py` | llama-cpp-python | You want to invoke a local GGUF directly without a daemon (CI, batch jobs, scripts). Text only. |
10
  | `llama_cpp_vision.py` | llama-cpp-python + mmproj | **Image input.** Loads a text GGUF + `mmproj-F16.gguf` and answers questions about an image. The only working vision path right now. |
11
 
12
+ All four apply the same Thanatos system prompt and sampling defaults
13
  (`temp=0.6, top_p=0.95, top_k=20, repeat_penalty=1.05`) so behavior should
14
+ be consistent across backends modulo quantization noise. The three
15
+ non-Ollama scripts set them explicitly; `ollama_chat.py` inherits them
16
+ from the `Modelfile` / bridge files.
17
 
18
  ## Setup
19