Kenny Wang commited on
Commit
cd4a477
·
1 Parent(s): f3e7b7e

Clarify model family and public dataset scope

Browse files
Files changed (1) hide show
  1. README.md +48 -1
README.md CHANGED
@@ -10,4 +10,51 @@ app_file: app.py
10
  pinned: false
11
  ---
12
 
13
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  pinned: false
11
  ---
12
 
13
+ # Dialectic Reasoning
14
+
15
+ Interactive demo for the **dialectic LoRA model family**, with the **Qwen3-8B variant** as the primary model.
16
+
17
+ This Space is meant to demonstrate a specific capability:
18
+
19
+ - better **crux identification**
20
+ - stronger **conditional commitment**
21
+ - deeper **integrative resolution**
22
+
23
+ It is **not** just a “balanced conversation” bot and it is **not** intended as evidence by itself. The supporting evaluation artifacts live in the associated dataset/model repos.
24
+
25
+ ## What This Demo Represents
26
+
27
+ The strongest current result in the family is the **8B LoRA**:
28
+
29
+ - base model: `Qwen/Qwen3-8B`
30
+ - trained on **408 examples** drawn from a larger **510-trace internal corpus**
31
+ - evaluated on held-out prompts with a rubric focused on real synthesis behavior
32
+
33
+ Smaller family members also exist, but they should be treated as exploratory variants rather than equivalent peers.
34
+
35
+ ## Main Result
36
+
37
+ On a held-out rubric evaluation, the fine-tuned 8B model improved substantially over base Qwen3-8B on:
38
+
39
+ - **Conditional commitment**
40
+ - **Actionability**
41
+ - **Resolution depth**
42
+ - **Crux clarity**
43
+
44
+ It also reduced weak and bad outputs, although generic hedge language is still too common.
45
+
46
+ ## Read This As A Demo, Not The Whole Claim
47
+
48
+ Use the Space to get a feel for the behavior.
49
+
50
+ For the actual methodology and published reports, see:
51
+
52
+ - model: `hikewa/dialectic-qwen3-8b-lora`
53
+ - dataset + eval artifacts: `hikewa/dialectic-reasoning-traces`
54
+
55
+ ## Limitations
56
+
57
+ - The Space is a demo wrapper, not a research paper
58
+ - Public dataset release is smaller than the full internal corpus used for the 8B model
59
+ - The model can still sound diplomatic or over-general on some prompts
60
+ - Stronger evidence comes from held-out evaluation, not from an isolated chat impression