Spaces:
Sleeping
Sleeping
Kenny Wang commited on
Commit ·
cd4a477
1
Parent(s): f3e7b7e
Clarify model family and public dataset scope
Browse files
README.md
CHANGED
|
@@ -10,4 +10,51 @@ app_file: app.py
|
|
| 10 |
pinned: false
|
| 11 |
---
|
| 12 |
|
| 13 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 10 |
pinned: false
|
| 11 |
---
|
| 12 |
|
| 13 |
+
# Dialectic Reasoning
|
| 14 |
+
|
| 15 |
+
Interactive demo for the **dialectic LoRA model family**, with the **Qwen3-8B variant** as the primary model.
|
| 16 |
+
|
| 17 |
+
This Space is meant to demonstrate a specific capability:
|
| 18 |
+
|
| 19 |
+
- better **crux identification**
|
| 20 |
+
- stronger **conditional commitment**
|
| 21 |
+
- deeper **integrative resolution**
|
| 22 |
+
|
| 23 |
+
It is **not** just a “balanced conversation” bot and it is **not** intended as evidence by itself. The supporting evaluation artifacts live in the associated dataset/model repos.
|
| 24 |
+
|
| 25 |
+
## What This Demo Represents
|
| 26 |
+
|
| 27 |
+
The strongest current result in the family is the **8B LoRA**:
|
| 28 |
+
|
| 29 |
+
- base model: `Qwen/Qwen3-8B`
|
| 30 |
+
- trained on **408 examples** drawn from a larger **510-trace internal corpus**
|
| 31 |
+
- evaluated on held-out prompts with a rubric focused on real synthesis behavior
|
| 32 |
+
|
| 33 |
+
Smaller family members also exist, but they should be treated as exploratory variants rather than equivalent peers.
|
| 34 |
+
|
| 35 |
+
## Main Result
|
| 36 |
+
|
| 37 |
+
On a held-out rubric evaluation, the fine-tuned 8B model improved substantially over base Qwen3-8B on:
|
| 38 |
+
|
| 39 |
+
- **Conditional commitment**
|
| 40 |
+
- **Actionability**
|
| 41 |
+
- **Resolution depth**
|
| 42 |
+
- **Crux clarity**
|
| 43 |
+
|
| 44 |
+
It also reduced weak and bad outputs, although generic hedge language is still too common.
|
| 45 |
+
|
| 46 |
+
## Read This As A Demo, Not The Whole Claim
|
| 47 |
+
|
| 48 |
+
Use the Space to get a feel for the behavior.
|
| 49 |
+
|
| 50 |
+
For the actual methodology and published reports, see:
|
| 51 |
+
|
| 52 |
+
- model: `hikewa/dialectic-qwen3-8b-lora`
|
| 53 |
+
- dataset + eval artifacts: `hikewa/dialectic-reasoning-traces`
|
| 54 |
+
|
| 55 |
+
## Limitations
|
| 56 |
+
|
| 57 |
+
- The Space is a demo wrapper, not a research paper
|
| 58 |
+
- Public dataset release is smaller than the full internal corpus used for the 8B model
|
| 59 |
+
- The model can still sound diplomatic or over-general on some prompts
|
| 60 |
+
- Stronger evidence comes from held-out evaluation, not from an isolated chat impression
|