FoolDev commited on
Commit
70c2f62
·
1 Parent(s): 5cb1604

Fix stale references to pre-bundled-GGUF era

Browse files

examples/README.md:
- 'Three minimal entry points' -> 'Four' (table lists 4)
- 'Or build locally with a different upstream quant' snippet was stale:
it told users to hf-download Q4_K_M (now bundled) and manually edit
../Modelfile FROM (bundled file already points at ./Janus-27B.Q4_K_M.gguf).
Replaced with the canonical 'make build' flow plus a note that
'make build QUANT=...' handles non-bundled quants by patching FROM
into a temp Modelfile copy automatically.

CITATION.cff:
- Abstract claimed 'weights ... rather than redistributed' but commits
b0d8482 (Q4_K_M) and 4c20ab5 (Q3_K_S) added bundled GGUFs. Updated
to acknowledge the two redistributed quants while noting other quants
+ safetensors are still pulled on demand.

CHANGELOG: log all three fixes.

Files changed (3) hide show
  1. CHANGELOG.md +15 -0
  2. CITATION.cff +5 -3
  3. examples/README.md +13 -5
CHANGELOG.md CHANGED
@@ -33,6 +33,21 @@ and documentation**, not the underlying base model.
33
  the repo now ships, so the no-edit flow `ollama create janus-27b
34
  -f ../Modelfile` works out of the box. Replaced the four-step list
35
  with two paths (bundled-GGUF or pull-from-HF) covering both flows.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
36
 
37
  ### Added
38
  - `Modelfile` hardware notes: log a measured data point for the
 
33
  the repo now ships, so the no-edit flow `ollama create janus-27b
34
  -f ../Modelfile` works out of the box. Replaced the four-step list
35
  with two paths (bundled-GGUF or pull-from-HF) covering both flows.
36
+ - `examples/README.md`: header said "Three minimal entry points" but
37
+ the table lists four (`ollama_chat.py`, `transformers_quickstart.py`,
38
+ `llama_cpp_quickstart.py`, `llama_cpp_vision.py`).
39
+ - `examples/README.md` "Or build locally with a different upstream
40
+ quant" snippet was stale: it told users to `hf download` Q4_K_M (now
41
+ bundled) and manually edit `../Modelfile` `FROM` (the bundled file
42
+ already points at `./Janus-27B.Q4_K_M.gguf`). Replaced with the
43
+ canonical `make build` flow, plus a note that `make build QUANT=...`
44
+ handles a non-bundled quant by patching `FROM` in a temp Modelfile
45
+ copy automatically (matches what `scripts/build.sh` actually does).
46
+ - `CITATION.cff` abstract: claimed "weights are pulled from upstream …
47
+ rather than redistributed" but commits b0d8482 (Q4_K_M) and 4c20ab5
48
+ (Q3_K_S) added bundled GGUFs to the repo. Updated to acknowledge the
49
+ two redistributed quants while noting other quants + safetensors are
50
+ still pulled on demand.
51
 
52
  ### Added
53
  - `Modelfile` hardware notes: log a measured data point for the
CITATION.cff CHANGED
@@ -10,9 +10,11 @@ url: "https://huggingface.co/FoolDev/janus-27b"
10
  abstract: >-
11
  Janus-27B is a personal repackaging of the dense Qwen 3.6 27B base model
12
  with Claude Opus 4.7 in the reasoning teacher slot. The repository ships
13
- an Ollama Modelfile, sampling defaults, and usage examples; weights are
14
- pulled from upstream (Qwen/Qwen3.6-27B safetensors or
15
- unsloth/Qwen3.6-27B-GGUF quants) rather than redistributed.
 
 
16
  keywords:
17
  - qwen
18
  - qwen3.6
 
10
  abstract: >-
11
  Janus-27B is a personal repackaging of the dense Qwen 3.6 27B base model
12
  with Claude Opus 4.7 in the reasoning teacher slot. The repository ships
13
+ an Ollama Modelfile, sampling defaults, usage examples, and two
14
+ ready-to-run GGUFs (Q4_K_M ~17 GB and Q3_K_S ~12 GB) so the HF "Use
15
+ this model" widget surfaces a one-liner Ollama snippet. Other quants
16
+ and the upstream safetensors (Qwen/Qwen3.6-27B) are pulled from
17
+ upstream on demand rather than redistributed.
18
  keywords:
19
  - qwen
20
  - qwen3.6
examples/README.md CHANGED
@@ -1,6 +1,6 @@
1
  # Janus-27B examples
2
 
3
- Three minimal entry points. Pick the one that matches how you run models.
4
 
5
  | File | Backend | When to use |
6
  |---|---|---|
@@ -26,12 +26,20 @@ pip install requests
26
  MODEL=hf.co/FoolDev/janus-27b python ollama_chat.py
27
  ```
28
 
29
- Or build locally with a different upstream quant:
 
30
 
31
  ```bash
32
- hf download unsloth/Qwen3.6-27B-GGUF Qwen3.6-27B-Q4_K_M.gguf --local-dir .
33
- # edit ../Modelfile -> FROM ./Qwen3.6-27B-Q4_K_M.gguf
34
- ollama create janus-27b -f ../Modelfile
 
 
 
 
 
 
 
35
  python ollama_chat.py
36
  ```
37
 
 
1
  # Janus-27B examples
2
 
3
+ Four minimal entry points. Pick the one that matches how you run models.
4
 
5
  | File | Backend | When to use |
6
  |---|---|---|
 
26
  MODEL=hf.co/FoolDev/janus-27b python ollama_chat.py
27
  ```
28
 
29
+ Or build locally from this repo (uses the bundled `Janus-27B.Q4_K_M.gguf`,
30
+ no edits required):
31
 
32
  ```bash
33
+ cd .. && make build && cd examples
34
+ python ollama_chat.py
35
+ ```
36
+
37
+ For a quant the repo doesn't bundle (e.g. Q5_K_M), `make build` will
38
+ fetch it from `unsloth/Qwen3.6-27B-GGUF` and patch the `Modelfile`
39
+ `FROM` line into a temp copy automatically:
40
+
41
+ ```bash
42
+ cd .. && make build QUANT=Q5_K_M && cd examples
43
  python ollama_chat.py
44
  ```
45