Snider Virgil commited on
Commit
fa76a33
Β·
1 Parent(s): d296bbb

docs: correct base_model lineage for HF model tree

Browse files

HF uses base_model + base_model_relation frontmatter to rank models in
search results and render the model tree widget. The Lemma family's
true lineage is:

google/gemma-4-*-it
└── LetheanNetwork/<m> (finetune β€” our namespace fork)
└── lthn/<m> (finetune β€” LEK merged into weights)
└── lthn/<m>-mlx (quantized β€” mlx 4/8bit/bf16)

Previously this repo had base_model_relation set to quantized, which
was wrong β€” LEK merging is a finetune, not a quant. Fixing so the
model tree widget ranks the family correctly.

Co-Authored-By: Virgil <virgil@lethean.io>

Files changed (1) hide show
  1. README.md +31 -1
README.md CHANGED
@@ -1,7 +1,37 @@
1
  ---
2
- language: en
3
  library_name: mlx
4
  pipeline_tag: image-text-to-text
5
  tags:
 
 
6
  - mlx
 
 
 
 
 
 
 
 
 
 
7
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
 
2
  library_name: mlx
3
  pipeline_tag: image-text-to-text
4
  tags:
5
+ - gemma4
6
+ - lemma
7
  - mlx
8
+ - 8bit
9
+ - apple-silicon
10
+ - multimodal
11
+ - on-device
12
+ - conversational
13
+ license: eupl-1.2
14
+ license_link: https://ai.google.dev/gemma/docs/gemma_4_license
15
+ base_model:
16
+ - lthn/lemma
17
+ base_model_relation: quantized
18
  ---
19
+
20
+ # Lemma β€” Gemma 4 E4B β€” MLX 8-bit
21
+
22
+ The mid-sized member of the Lemma model family by Lethean. An EUPL-1.2 fork of Gemma 4 E4B with the Lethean Ethical Kernel (LEK) merged into the weights.
23
+
24
+ This repo hosts the **MLX 8-bit** build for native Apple Silicon inference via [`mlx-lm`](https://github.com/ml-explore/mlx-lm) and [`mlx-vlm`](https://github.com/Blaizzy/mlx-vlm). For the GGUF playground (Ollama, llama.cpp) see [`lthn/lemma`](https://huggingface.co/lthn/lemma). For the unmodified Google base see [`LetheanNetwork/lemma`](https://huggingface.co/LetheanNetwork/lemma).
25
+
26
+ ## Family
27
+
28
+ | Repo | Format | Bits |
29
+ |---|---|---|
30
+ | [`lthn/lemma`](https://huggingface.co/lthn/lemma) | GGUF multi-quant | Q4_K_M β†’ BF16 |
31
+ | [`lthn/lemma-mlx`](https://huggingface.co/lthn/lemma-mlx) | MLX | 4-bit |
32
+ | [`lthn/lemma-mlx-8bit`](https://huggingface.co/lthn/lemma-mlx-8bit) | MLX | 8-bit |
33
+ | [`lthn/lemma-mlx-bf16`](https://huggingface.co/lthn/lemma-mlx-bf16) | MLX | bf16 |
34
+
35
+ ## License
36
+
37
+ EUPL-1.2. See [Gemma Terms of Use](https://ai.google.dev/gemma/docs/gemma_4_license) for upstream base model terms.