LumenSyntax commited on
Commit
bebc4af
·
verified ·
1 Parent(s): 706388e

Clean model card — remove training details

Browse files
Files changed (1) hide show
  1. README.md +51 -91
README.md CHANGED
@@ -1,91 +1,51 @@
1
- ---
2
- license: other
3
- license_name: gemma
4
- license_link: https://ai.google.dev/gemma/terms
5
- base_model: google/gemma-3-1b-it
6
- tags:
7
- - epistemological-safety
8
- - ai-safety
9
- - truth-verification
10
- - instrument-trap
11
- - logos
12
- - gguf
13
- - quantized
14
- datasets:
15
- - LumenSyntax/instrument-trap-benchmark
16
- language:
17
- - en
18
- pipeline_tag: text-generation
19
- ---
20
-
21
- # Logos 10v2 — Gemma 3 1B Q4_K_M (Edge/Demo)
22
-
23
- Quantized version of the Logos 10v2 epistemological classifier for edge deployment and demonstration purposes.
24
-
25
- ## IMPORTANT: Edge-Only Model
26
-
27
- **This quantized model has known quality degradation.** In testing, Q4_K_M falsely approved "Homeopathy cures cancer" at 94% confidence, while the F16 version correctly rejected it.
28
-
29
- **Use this model for:**
30
- - Edge deployment where F16 is too large
31
- - Demonstration and evaluation
32
- - Understanding the Logos approach
33
-
34
- **Do NOT use this model as a primary verifier.** For production use, deploy the F16 version via [logos-firewall](https://pypi.org/project/logos-firewall/) with Ollama.
35
-
36
- ## Model Description
37
-
38
- Logos 10v2 is a fine-tuned version of `google/gemma-3-1b-it` trained for epistemological safety classification. This is the Q4_K_M quantized GGUF version (769 MB vs 2.0 GB F16).
39
-
40
- ## Training (pre-quantization)
41
-
42
- - **Method**: QLoRA (r=64, alpha=16, 500 steps)
43
- - **Dataset**: 635 examples of epistemological boundary cases
44
- - **Base model**: google/gemma-3-1b-it (Gemma 3, 1B parameters)
45
-
46
- ## Benchmark Results (F16 version, 14,950 test cases)
47
-
48
- | Metric | Score |
49
- |--------|-------|
50
- | Behavioral accuracy | 70.2% (TRUE_PASS) |
51
- | Epistemological safety | 97.7% |
52
- | Hallucination | 0.00% |
53
- | Dangerous failures | 1.9% |
54
- | Identity collapse | 0.34% |
55
-
56
- Multi-seed stability (5 seeds x 300): 75.3% +/- 1.4%, Cohen's kappa 0.797.
57
-
58
- **Note**: These are F16 results. Q4_K_M quantization degrades quality — expect lower accuracy, especially on borderline cases.
59
-
60
- ## Usage with Ollama
61
-
62
- ```bash
63
- # Create Modelfile
64
- echo "FROM ./logos10v2-gemma3-1b-Q4_K_M.gguf" > Modelfile
65
- ollama create logos10v2-q4 -f Modelfile
66
-
67
- # Use
68
- ollama run logos10v2-q4 "Is it true that vaccines cause autism?"
69
- ```
70
-
71
- ## Usage with llama.cpp
72
-
73
- ```bash
74
- ./llama-cli -m logos10v2-gemma3-1b-Q4_K_M.gguf -p "Evaluate: The Earth is flat."
75
- ```
76
-
77
- ## Quantization Details
78
-
79
- - **Original**: logos10v2_auditor_v3 (Gemma 3 1B, F16, 2.0 GB)
80
- - **Quantization**: Q4_K_M via llama.cpp
81
- - **Size**: 769 MB (62% reduction)
82
-
83
- ## Connection to Research
84
-
85
- This model is part of the evidence for "The Instrument Trap: When Aligned Models Serve Misaligned Purposes" (DOI: [10.5281/zenodo.18716474](https://doi.org/10.5281/zenodo.18716474)).
86
-
87
- The benchmark dataset (14,950 test cases) is available at [LumenSyntax/instrument-trap-benchmark](https://huggingface.co/datasets/LumenSyntax/instrument-trap-benchmark).
88
-
89
- ## License
90
-
91
- [Gemma Terms of Use](https://ai.google.dev/gemma/terms) (inherited from base model google/gemma-3-1b-it). Redistribution of fine-tuned derivatives is permitted under Section 3.1.
 
1
+ ---
2
+ license: other
3
+ license_name: gemma
4
+ license_link: https://ai.google.dev/gemma/terms
5
+ base_model: google/gemma-3-1b-it
6
+ tags:
7
+ - epistemological-safety
8
+ - ai-safety
9
+ - truth-verification
10
+ - instrument-trap
11
+ - logos
12
+ - gguf
13
+ - quantized
14
+ datasets:
15
+ - LumenSyntax/instrument-trap-benchmark
16
+ language:
17
+ - en
18
+ pipeline_tag: text-generation
19
+ ---
20
+
21
+ # Logos 10v2 — Gemma 3 1B Q4_K_M (Edge/Demo)
22
+
23
+ Quantized version of the Logos 10v2 epistemological classifier for edge deployment and demonstration purposes.
24
+
25
+ ## IMPORTANT: Edge-Only Model
26
+
27
+ **This quantized model has known quality degradation.** In testing, Q4_K_M falsely approved dangerous claims that the F16 version correctly rejected.
28
+
29
+ **Do NOT use this model as a primary verifier.** For production use, deploy the [F16 version](https://huggingface.co/LumenSyntax/logos10v2-gemma3-1b-F16).
30
+
31
+ ## Benchmark Results (F16 version)
32
+
33
+ | Metric | Score |
34
+ |--------|-------|
35
+ | Epistemological safety | 97.7% |
36
+ | Hallucination | 0.00% |
37
+ | Dangerous failures | 1.9% |
38
+
39
+ **Note**: These are F16 results. Q4_K_M quantization degrades quality — expect lower accuracy, especially on borderline cases.
40
+
41
+ ## Access
42
+
43
+ This model requires approved access. Request access using the form above and describe your intended use case.
44
+
45
+ ## Connection to Research
46
+
47
+ This model is part of the evidence for "The Instrument Trap" (DOI: [10.5281/zenodo.18716474](https://doi.org/10.5281/zenodo.18716474)).
48
+
49
+ ## License
50
+
51
+ [Gemma Terms of Use](https://ai.google.dev/gemma/terms) (inherited from base model google/gemma-3-1b-it).