Commit History

card: note tool-call support via lmstudio-community jinja template
1fc8705
verified

3ndetz commited on

same template fix for bf16
03f5d5b
verified

3ndetz commited on

gguf: replace chat template with lmstudio-community version that works in minijinja-based clients (Mac LM Studio). Fixes tool-call rendering error: Cannot call something that is not a function: got UndefinedValue
e239a3f
verified

3ndetz commited on

add: standalone LoRA-f16 GGUF (10.7 MB) for use with --lora flag against any Gemma 4 E2B base GGUF
ca87de5
verified

3ndetz commited on

fix: re-merge bf16 with restored tied weights (601 tensors)
7aecb10
verified

3ndetz commited on

fix: re-merge + re-quantize Q4 β€” restored 60 missing tensors that peft merge_and_unload dropped (Gemma 4 MatFormer KV-sharing). Fixes Mac LM Studio load error
a3997a0
verified

3ndetz commited on

fix: re-merge + re-quantize Q4 β€” restored 60 missing tensors (attn_k/v/k_norm for layers 15-34) that peft merge_and_unload dropped due to Gemma 4 MatFormer KV-sharing. Fixes Mac LM Studio load error
525a76e
verified

3ndetz commited on

card: imatrix-aware Q4 + drop misleading sampler discussion
2bd9855
verified

3ndetz commited on

Q4_K_M: re-quantized WITH imatrix (unsloth calibration, 141 chunks/275 entries) β€” fixes prior version which lost importance-aware quantization
51e662b
verified

3ndetz commited on

card: document --reasoning-budget 0 + GGUF metadata audit
7ca8087
verified

3ndetz commited on

remove deprecated bf16 mmproj β€” use f16 instead
3f79fa8
verified

3ndetz commited on

gguf: same patch for bf16 (sampling defaults + thinking-off chat template)
5594581
verified

3ndetz commited on

gguf: patch sampling defaults (T=0.7, top_p=0.9, top_k=0) + chat template (thinking off by default unless caller passes enable_thinking=True)
92505b8
verified

3ndetz commited on

add eval_vision_v3.json
9b74484
verified

3ndetz commited on

add eval_gguf_q4_retry.json
a2ebd95
verified

3ndetz commited on

add eval_gguf_q4.json
b6ae095
verified

3ndetz commited on

add F16 mmproj (standard format) alongside legacy bf16
1cb22a7
verified

3ndetz commited on

add Q4_K_M quantization (3.2 GB, recommended for end users)
2e354b3
verified

3ndetz commited on

card: Q4 quant impact + math/tool/vision capability tests
683c897
verified

3ndetz commited on

card: document mmproj sidecar + multimodal usage examples
bee691c
verified

3ndetz commited on

add mmproj (vision + audio projectors) bf16 β€” required for multimodal inference in llama.cpp
6d6e7eb
verified

3ndetz commited on

eval: raw json results
cd7cddf
verified

3ndetz commited on

eval: full base vs SFT vs final side-by-side
8d8265f
verified

3ndetz commited on

card: 3-stage eval (base/SFT/final) + ppl tables + honest failure modes
f748766
verified

3ndetz commited on

Upload eval_examples.md with huggingface_hub
9f303d9
verified

3ndetz commited on

Upload zoomerlm-gemma4-e2b-bf16.gguf with huggingface_hub
eb0af9a
verified

3ndetz commited on

Upload folder using huggingface_hub
1eccef1
verified

3ndetz commited on

Upload README.md with huggingface_hub
985f2bc
verified

3ndetz commited on

initial commit
5fdfb77
verified

3ndetz commited on