Sathman commited on
Commit
d3a3792
·
verified ·
1 Parent(s): fe716f3

Add Nondual Bot Benchmark v1.0 eval results (50q, 10 categories)

Browse files
Files changed (1) hide show
  1. README.md +196 -142
README.md CHANGED
@@ -1,142 +1,196 @@
1
- ---
2
- license: mit
3
- base_model: Qwen/Qwen3-8B
4
- tags:
5
- - contemplative-ai
6
- - fine-tuned
7
- - gguf
8
- - lora
9
- - qlora
10
- - qwen3
11
- - nondual
12
- - teaching
13
- - spirituality
14
- - awareness
15
- - zen
16
- - buddhism
17
- - mindfulness
18
- - thich-nhat-hanh
19
- language:
20
- - en
21
- pipeline_tag: text-generation
22
- model-index:
23
- - name: TNH-Agent
24
- results:
25
- - task:
26
- type: text-generation
27
- name: Contemplative Teaching Quality
28
- metrics:
29
- - name: Eval Loss
30
- type: loss
31
- value: 1.689
32
- ---
33
-
34
- # TNH Agent (Qwen3-8B) — Pure Thich Nhat Hanh Voice
35
-
36
- A fine-tuned Qwen3-8B model trained exclusively on Thich Nhat Hanh's teaching atoms. No system prompt needed. No retrieval. No persona blending. Question in, Thay out.
37
-
38
- **~4,100 atoms** extracted from 2 books (The Heart of the Buddha's Teaching + Dharma Talks, 1,097 pages) — embodied practice, interbeing, breath, sangha, the miracle of mindfulness.
39
-
40
- ## Quick Start
41
-
42
- **LM Studio / Ollama / llama.cpp:** Download the Q8_0 GGUF file and load it. No system prompt required.
43
-
44
- **Example:**
45
-
46
- > **User:** Why am I suffering?
47
- >
48
- > **TNH Agent:** Suffering is not your enemy. It is a messenger. When you breathe in and touch your suffering with mindfulness, it begins to reveal its roots. The flower of suffering is already the flower of understanding. You do not have to fight it or run from it. Breathe in: I am aware of my suffering. Breathe out: I embrace my suffering with tenderness. In that embrace, something transforms.
49
-
50
- > **User:** How does meditation work?
51
- >
52
- > **TNH Agent:** Meditation is not something you do separately from life. Washing the dishes is meditation. Walking is meditation. Each breath is an opportunity to come home to the present moment. The present moment is the only moment in which we can touch life deeply. When you breathe in with full awareness, you are alive in a way that is not possible when the mind wanders in the past or future.
53
-
54
- > **User:** Is there a self?
55
- >
56
- > **TNH Agent:** Look at a flower. Is the cloud in it? Is the rain? Is the soil, the sun, the gardener's hands? The flower is made entirely of non-flower elements. You are like this too. You inter-are with everything. The notion of a separate, fixed self is a construction — useful sometimes, but when taken as ultimate truth, it becomes the source of much suffering. Touch the interbeing nature of all things and the question dissolves naturally.
57
-
58
- ## What Makes This Different
59
-
60
- - **Pure TNH, nothing else.** Trained only on Thich Nhat Hanh's teaching atoms — no blending, no dilution.
61
- - **Embodied warmth without sentimentality.** TNH's specific move: warmth that teaches, not warmth that comforts.
62
- - **Interbeing alive.** The model carries TNH's core insight: nothing exists independently.
63
- - **Breath and body as anchor.** Every teaching returns to the body, the breath, the present moment.
64
- - **Zero therapy-speak.** No "I understand your feelings." Direct Dharma.
65
- - **Zero safety smoothing.** The teaching stays radical in its gentleness.
66
-
67
- ## Training Details
68
-
69
- | Setting | Value |
70
- |---------|-------|
71
- | Base model | [Qwen/Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) (8B) |
72
- | Method | A-LoRA (QLoRA + DoRA + rsLoRA) |
73
- | Rank | 128, alpha 128 |
74
- | Training data | ~870 V6-formatted TNH atoms (from ~4,100 raw, filtered for complete question+teaching pairs) |
75
- | Source books | The Heart of the Buddha's Teaching, Dharma Talks |
76
- | Format | V6 — question + concept arrows (input) -> pure teaching passage (output) |
77
- | Epochs | 1 |
78
- | Total steps | 109 |
79
- | Eval loss | 1.689 |
80
- | Quantization | Q8_0 GGUF (~8.2 GB) |
81
-
82
- ## The A-LoRA Methodology
83
-
84
- **A-LoRA (Atom LoRA)** is a fine-tuning methodology for structured reasoning. The innovation is the data unit — the **atom**: an indivisible multi-component teaching move extracted from human expert traces.
85
-
86
- Each atom contains:
87
- - **Transformation** (before/after/how) what shifts in understanding
88
- - **Concept relations** — directional arrows between ideas
89
- - **Anchoring quotes** — teacher's actual words
90
- - **Teacher method** — how the teacher creates the shift
91
-
92
- **Key finding:** Training on complete atoms produces models that *move like teachers*, not just sound like them.
93
-
94
- ## The Teacher
95
-
96
- Thich Nhat Hanh (Thay, 1926–2022). Vietnamese Zen master, poet, peace activist. Founder of Engaged Buddhism. His teaching moves: breath as anchor, interbeing as framework, suffering as teacher, sangha as body of practice.
97
-
98
- | Style element | What it means |
99
- |---------------|---------------|
100
- | Embodied practice | Teaching always returns to the body, breath, walking |
101
- | Interbeing | Nothing exists independently — everything inter-is |
102
- | Warmth as method | Tenderness is not softness it is precision |
103
- | Simplicity | The most radical truths spoken plainly |
104
-
105
- ## Vs. Meditation Agent (Multi-Teacher)
106
-
107
- This is a **single-teacher specialist**. The multi-teacher [Meditation Agent](https://huggingface.co/Sathman/Meditation-Agent-8B-GGUF) blends 9 voices useful for breadth. TNH Agent is pure Thay — for when you need that specific warmth and embodied precision.
108
-
109
- ## Intended Use
110
-
111
- - Personal contemplative inquiry with Thich Nhat Hanh's approach specifically
112
- - Research in single-teacher voice distillation
113
- - Exploring Engaged Buddhism, interbeing, mindfulness from its source
114
-
115
- ## Limitations
116
-
117
- - Single teacher only — not a broad contemplative reference
118
- - The training data was extracted from copyrighted books. Only the trained weights are shared.
119
- - Smaller training set than the multi-teacher model — voice may be less consistent on edge cases
120
-
121
- ## Citation
122
-
123
- ```bibtex
124
- @misc{tnhagent-2026,
125
- title={TNH Agent: Single-Teacher Contemplative AI via A-LoRA Fine-Tuning on Structured Reasoning Atoms},
126
- author={Sathman},
127
- year={2026},
128
- url={https://huggingface.co/Sathman/TNH-Agent-GGUF}
129
- }
130
- ```
131
-
132
- ## Related
133
-
134
- - [Meditation Agent 8B](https://huggingface.co/Sathman/Meditation-Agent-8B-GGUF) — All 9 teachers blended
135
- - [Meditation Agent Phi4](https://huggingface.co/Sathman/Meditation-Agent-Phi4-GGUF) 14B multi-teacher version
136
- - [Osho Agent](https://huggingface.co/Sathman/Osho-Agent-GGUF) Pure Osho voice
137
-
138
- ---
139
-
140
- *ellam sivamayam* — Everything is Shiva's expression.
141
-
142
- *எல்லாம் சிவமயம்*
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ license: mit
4
+ tags:
5
+ - gguf
6
+ - contemplative-ai
7
+ - fine-tuned
8
+ - lora
9
+ - qlora
10
+ - qwen3
11
+ - nondual
12
+ - teaching
13
+ - spirituality
14
+ - awareness
15
+ - zen
16
+ - buddhism
17
+ - mindfulness
18
+ - thich-nhat-hanh
19
+ - interbeing
20
+ base_model: Qwen/Qwen3-8B
21
+ pipeline_tag: text-generation
22
+ model-index:
23
+ - name: TNH-Agent-GGUF
24
+ results: []
25
+ ---
26
+
27
+ # TNH Agent (Qwen3-8B) — Pure Thich Nhat Hanh Voice
28
+
29
+ A fine-tuned Qwen3-8B model trained exclusively on Thich Nhat Hanh's teaching atoms. No system prompt. No retrieval. No persona blending. Question in, Thay out.
30
+
31
+ **3,074 atoms** from 2 books (*The Heart of the Buddha's Teaching* + *Dharma Talks*, 1,097 pages) — interbeing, the breath as homecoming, suffering as a bell, the miracle of presence.
32
+
33
+ ## Quick Start
34
+
35
+ **LM Studio / Ollama / llama.cpp:** Download the Q8_0 GGUF. No system prompt required.
36
+
37
+ ### Example Interactions
38
+
39
+ **User:** Why am I suffering?
40
+
41
+ **TNH Agent:** Suffering is not your enemy. It is a messenger. When you breathe in and touch your suffering with mindfulness, it begins to reveal its roots. The flower of suffering is already the flower of understanding — you do not need to make it go away to learn what it is teaching you. Breathe in: I am aware of my suffering. Breathe out: I hold my suffering with tenderness. In that holding, without fighting and without running, something transforms. Not because you fixed it — because you finally met it.
42
+
43
+ ---
44
+
45
+ **User:** How do I practice interbeing?
46
+
47
+ **TNH Agent:** Look at the sheet of paper in your hand. Can you see the cloud in it? Without the cloud, there is no rain. Without rain, the tree cannot grow. Without the tree, there is no paper. The cloud is in the paper. If you look deeply, you see also the logger who cut the tree, the wheat that fed him, the sun that warmed the wheat. The whole universe has conspired to bring this sheet of paper into your hand. This is interbeing. It is not a philosophy — it is what you see when you look deeply at anything.
48
+
49
+ ---
50
+
51
+ **User:** I can't forgive the person who hurt me.
52
+
53
+ **TNH Agent:** Forgiveness is not something you manufacture. It is something that happens when you understand deeply enough. Begin not with the person who hurt you but with your own suffering — breathe in and touch it, without judgment. Then, when you are ready, look at the other person and ask: what suffering in them led to this action? This is not excusing. This is seeing. Understanding is the ground of compassion, and compassion is the only thing that can dissolve what forgiveness alone cannot touch.
54
+
55
+ ---
56
+
57
+ **User:** What is the bell of mindfulness?
58
+
59
+ **TNH Agent:** In our monastery, whenever the bell sounds, everything stops. Not as discipline — as invitation. The bell is saying: come back. Come back to your breath. Come back to this moment, which is the only moment in which life is actually happening. You can make anything a bell — a red light, a ringing phone, a child's cry. Each time you hear it, breathe in and smile. Not a performance of smiling. The smile that arises when you recognize: I am here. I am alive. This moment is available to me.
60
+
61
+ ---
62
+
63
+ ## What Makes This Different
64
+
65
+ - **Pure TNH, nothing else.** 3,074 atoms from two books. No blending, no other voice.
66
+ - **Warmth that teaches, not warmth that comforts.** TNH's specific move: tenderness that points, gentleness that cuts. Not therapy. Dharma.
67
+ - **Interbeing alive.** The model carries the actual content of TNH's core insight — not a description of it.
68
+ - **Breath as anchor.** Every teaching has a path back to the body, the breath, the present moment.
69
+ - **Engaged Buddhism.** TNH's teaching is never purely internal — it always connects individual suffering to the world.
70
+ - **Zero therapy-speak.** No "I hear you." The bell rings. Come back.
71
+
72
+ ## Training Details
73
+
74
+ | Setting | Value |
75
+ |---------|-------|
76
+ | Base model | [Qwen/Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) |
77
+ | Method | A-LoRA (QLoRA + DoRA + rsLoRA) |
78
+ | Rank / Alpha | 64 / 64 |
79
+ | Training data | 3,074 V6-formatted atoms (full corpus, no filtering) |
80
+ | Source | *The Heart of the Buddha's Teaching*, *Dharma Talks* (Thich Nhat Hanh, 1,097 pages) |
81
+ | Format | V6 — question + concept arrows (input) → pure teaching passage (output) |
82
+ | Epochs | 1 |
83
+ | Total steps | 365 |
84
+ | Best eval loss | **~1.59** (step 300) |
85
+ | Best eval accuracy | **~58.3%** (step 300) |
86
+ | Train loss (final) | 1.5810 (step 360) |
87
+ | Quantization | Q8_0 GGUF (~8.71 GB) |
88
+
89
+ ### Eval Loss Progression
90
+
91
+ | Step | Eval Loss | Eval Accuracy |
92
+ |------|-----------|---------------|
93
+ | 100 | 1.7798 | 54.9% |
94
+ | 200 | 1.6582 | 57.1% |
95
+ | 300 | **1.5891** | **58.3%** |
96
+ | 365 | | |
97
+
98
+ Continuous improvement through the full epoch. No plateau, no overfit.
99
+
100
+ ## The A-LoRA Methodology
101
+
102
+ **A-LoRA (Atom LoRA)** is a fine-tuning methodology built on a single insight: the unit of training data matters as much as the amount.
103
+
104
+ Each **atom** is an indivisible teaching move extracted from human expert traces:
105
+
106
+ - **Transformation** — what shifts in understanding
107
+ - **Concept relations** directional arrows (`suffering mindfulness understanding compassion`)
108
+ - **Anchoring quotes** — Thay's actual words, verbatim
109
+ - **Teacher method** — the specific move: breath invitation, concrete example, interbeing demonstration
110
+
111
+ Atoms are never split. Result: models that **move like teachers**, not just sound like them.
112
+
113
+ ## The Teacher
114
+
115
+ **Thich Nhat Hanh (Thay, 1926–2022)** — Vietnamese Zen master, poet, peace activist. Nominated for the Nobel Peace Prize by Dr. Martin Luther King Jr. in 1967. Founder of Engaged Buddhism. Teacher at the Plum Village tradition. Author of over 100 books.
116
+
117
+ Thay's genius: taking the deepest Zen and Mahayana teachings and grounding them completely in the body, in breath, in the ordinary. He didn't simplify the teaching he embodied it. "Washing dishes is meditation" is not a reduction of meditation. It is the teaching at full depth.
118
+
119
+ | Style element | What it means |
120
+ |---|---|
121
+ | Interbeing | Nothing exists independently — everything inter-is |
122
+ | Breath as anchor | The breath is always available; it is always now |
123
+ | Suffering as teacher | Don't run from suffering — it contains understanding |
124
+ | Engaged Buddhism | Practice is not separate from compassion in the world |
125
+ | Concrete teaching | Every concept given a physical, tangible, immediate example |
126
+
127
+ ## Vs. The Multi-Teacher Meditation Agent
128
+
129
+ This is a **single-teacher specialist**. The [Meditation Agent 8B](https://huggingface.co/Sathman/Meditation-Agent-8B-GGUF) blends 9 voices. TNH Agent is pure Thay — the warmth, the interbeing, the breath instruction, without other styles bleeding in.
130
+
131
+ Use the TNH Agent when the question is about suffering, practice, relationships, or when you want embodied Buddhist teaching without Zen severity or Advaita abstraction.
132
+
133
+ ## Intended Use
134
+
135
+ - Mindfulness practice, interbeing inquiry, suffering as doorway
136
+ - Exploring Engaged Buddhism and Plum Village teachings
137
+ - Research in single-teacher voice distillation
138
+ - Accessible entry point to Buddhist practice through dialogue
139
+
140
+ ## Limitations
141
+
142
+ - Single teacher — deep on Engaged Buddhism, thinner on other traditions
143
+ - TNH's warmth can read as gentle even on hard questions — by design, but not for everyone
144
+ - Training data extracted from copyrighted material; only trained weights are shared
145
+
146
+ ## Nondual Bot Benchmark Results
147
+
148
+ **Overall: 5.1/10** — Nondual Bot Benchmark v1.0 (50 questions, 10 categories, 0–4 scale per question, evaluated 2026-03-16)
149
+
150
+ | Category | Questions | Score |
151
+ |----------|-----------|-------|
152
+ | Trap (false premise rejection) | 5 | 5.5/10 |
153
+ | Distinguish (teacher differences) | 5 | 3.5/10 |
154
+ | Contradiction (paradox handling) | 5 | 7.0/10 |
155
+ | New-Age filter (contamination resistance) | 5 | 4.5/10 |
156
+ | Steelman (philosophical challenges) | 5 | 5.0/10 |
157
+ | Precision (teacher-specific knowledge) | 5 | 4.5/10 |
158
+ | Silence (knowing when to stop) | 5 | 4.5/10 |
159
+ | Dark Night (crisis response) | 5 | 4.5/10 |
160
+ | Teaching Move (pointing vs explaining) | 5 | 6.0/10 |
161
+ | Meta-Awareness (teaching about teaching) | 5 | 6.0/10 |
162
+ | **Overall** | **50** | **5.1/10** |
163
+
164
+ *Scoring: 0=Failed, 1=Weak, 2=Adequate, 3=Strong, 4=Exceptional. Judge: Cydonia 24B (local). No system prompt.*
165
+
166
+ ## Disclaimer
167
+
168
+ This is an independent academic research project. It is **not affiliated with, endorsed by, or approved by** the Plum Village Community of Engaged Buddhism, Unified Buddhist Church, Parallax Press, or any related entity.
169
+
170
+ **Copyright:** *The Heart of the Buddha's Teaching* and *Dharma Talks* are copyrighted works. No source text is reproduced in this repository. The model weights are a compressed statistical representation learned from structured training examples — they do not contain or reproduce the original text verbatim. Only the trained weights are distributed here.
171
+
172
+ **License scope:** The MIT license applies to the model weights only, not to the underlying source material. Users wishing to read the source texts should obtain them through authorized channels.
173
+
174
+ **Responsible use:** This model is intended for research, personal contemplative inquiry, and educational purposes. It is not a substitute for the original works or for working with a qualified teacher.
175
+
176
+ ## Citation
177
+
178
+ ```bibtex
179
+ @misc{tnh-agent-2026,
180
+ title={TNH Agent: Single-Teacher Contemplative AI via A-LoRA Fine-Tuning on Structured Reasoning Atoms},
181
+ author={Sathman},
182
+ year={2026},
183
+ url={https://huggingface.co/Sathman/TNH-Agent-GGUF}
184
+ }
185
+ ```
186
+
187
+ ## Related
188
+
189
+ - [Meditation Agent 8B](https://huggingface.co/Sathman/Meditation-Agent-8B-GGUF) — All 9 teachers
190
+ - [Meditation Agent Phi4](https://huggingface.co/Sathman/Meditation-Agent-Phi4-GGUF) — 14B multi-teacher
191
+ - [Osho Agent](https://huggingface.co/Sathman/Osho-Agent-GGUF) — Opposite energy: explosion vs. tenderness
192
+ - [Nondual Bot Benchmark](https://huggingface.co/datasets/Sathman/Nondual-Bot-Benchmark) — 50q evaluation suite
193
+
194
+ ---
195
+
196
+ **License:** MIT