FoolDev commited on
Commit
3d2e907
·
1 Parent(s): b20f7c9

Modelfile: point FROM at bundled Janus-27B.Q4_K_M.gguf

Browse files

The repo now ships Janus-27B.Q4_K_M.gguf and Janus-27B.Q3_K_S.gguf
(commits b0d8482 and 4c20ab5), so a fresh clone with LFS smudge on
already has a working GGUF on disk. The previous default
(./Qwen3.6-27B-Q4_K_M.gguf) was a file that didn't exist after a
clone, forcing users to either run make build or hand-edit the
Modelfile before ollama create would work.

Comment block refreshed: removed the stale 'does not redistribute
weights' line (contradicted by the bundled GGUFs) and added a
pointer to make build QUANT=Q3_K_S for the smaller quant.

scripts/build.sh keeps patching FROM in a temp Modelfile copy, so
existing make build flows are unchanged regardless of which path
the FROM points at on disk.

Files changed (2) hide show
  1. CHANGELOG.md +12 -0
  2. Modelfile +9 -8
CHANGELOG.md CHANGED
@@ -7,6 +7,18 @@ and documentation**, not the underlying base model.
7
 
8
  ## [Unreleased]
9
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  ### Added
11
  - `examples/ollama_chat.py`: respect `MODEL` and `HOST` env vars
12
  (matches the convention scripts/smoke_test.sh and scripts/bench.sh
 
7
 
8
  ## [Unreleased]
9
 
10
+ ### Fixed
11
+ - `Modelfile`: change FROM from `./Qwen3.6-27B-Q4_K_M.gguf` to
12
+ `./Janus-27B.Q4_K_M.gguf`. The repo now ships GGUFs (commit b0d8482
13
+ added Q4_K_M, commit 4c20ab5 added Q3_K_S), so the no-script path
14
+ `git clone && ollama create janus-27b -f Modelfile` works
15
+ out-of-the-box for users with LFS smudge enabled. Comment block
16
+ refreshed: removed the stale "does not redistribute weights" line
17
+ (contradicted by the bundled GGUFs) and added a pointer to
18
+ `make build QUANT=Q3_K_S` for the smaller quant. `scripts/build.sh`
19
+ still patches FROM in a temp Modelfile copy, so existing build
20
+ flows are unchanged.
21
+
22
  ### Added
23
  - `examples/ollama_chat.py`: respect `MODEL` and `HOST` env vars
24
  (matches the convention scripts/smoke_test.sh and scripts/bench.sh
Modelfile CHANGED
@@ -5,20 +5,21 @@
5
  # missing the qwen35 arch entries). Use llama.cpp directly for image
6
  # input, or wait for the fix. See the Vision section in README.md.
7
  #
8
- # This repo does not redistribute weights. Edit the FROM line below to
9
- # point at a local Qwen 3.6 27B GGUF, then:
 
 
10
  #
11
  # ollama create janus-27b -f Modelfile && ollama run janus-27b
12
  #
13
- # Recommended GGUF source:
14
- # https://huggingface.co/unsloth/Qwen3.6-27B-GGUF
15
  #
16
- # Or a community Opus-distilled variant:
 
17
  # https://huggingface.co/rico03/Qwen3.6-27B-Claude-Opus-Reasoning-Distilled-GGUF
18
- #
19
- # Replace the path below with wherever you keep the GGUF.
20
 
21
- FROM ./Qwen3.6-27B-Q4_K_M.gguf
22
 
23
  # Chat template — Qwen 3.6 ChatML in Ollama Go-template form, with the
24
  # tool-calling blocks Ollama's capability detector looks for. Without a
 
5
  # missing the qwen35 arch entries). Use llama.cpp directly for image
6
  # input, or wait for the fix. See the Vision section in README.md.
7
  #
8
+ # This repo bundles two GGUFs: Janus-27B.Q4_K_M.gguf (~17 GB, default)
9
+ # and Janus-27B.Q3_K_S.gguf (~12 GB, smaller-footprint option). The FROM
10
+ # line below points at the bundled Q4_K_M, so a fresh clone (with LFS
11
+ # smudge enabled) supports the no-script path:
12
  #
13
  # ollama create janus-27b -f Modelfile && ollama run janus-27b
14
  #
15
+ # To use the smaller quant, edit FROM to ./Janus-27B.Q3_K_S.gguf, or run
16
+ # `make build QUANT=Q3_K_S` which patches FROM in a temp Modelfile copy.
17
  #
18
+ # Other GGUF sources (use with `make build GGUF_PATH=...`):
19
+ # https://huggingface.co/unsloth/Qwen3.6-27B-GGUF
20
  # https://huggingface.co/rico03/Qwen3.6-27B-Claude-Opus-Reasoning-Distilled-GGUF
 
 
21
 
22
+ FROM ./Janus-27B.Q4_K_M.gguf
23
 
24
  # Chat template — Qwen 3.6 ChatML in Ollama Go-template form, with the
25
  # tool-calling blocks Ollama's capability detector looks for. Without a