Spaces:

hikewa
/

dialectic-reasoning

Sleeping

App Files Files Community

dialectic-reasoning / README.md

hikewa

Upload README.md with huggingface_hub

be2e047 verified about 2 months ago

preview code

raw

history blame contribute delete

1.76 kB

	---
	title: Dialectic Reasoning
	emoji: 😻
	colorFrom: blue
	colorTo: purple
	sdk: gradio
	sdk_version: 6.10.0
	python_version: '3.12'
	app_file: app.py
	pinned: false
	---

	# Dialectic Reasoning

	Interactive demo for the dialectic LoRA model family, fine-tuned to identify genuine tensions, make conditional commitments, and reach integrative resolutions instead of hedging.

	## Current Best: 4B v3

	The strongest model in the family is the [Qwen3-4B v3 LoRA](https://huggingface.co/hikewa/dialectic-qwen3-4b-v3-lora):

	- Trained on 507 examples (408 original + 99 domain-diverse traces from 3 model families)
	- Rubric avg: 9.8/10 — all 14 held-out prompts score "strong"
	- generic_hedge: 0.00 (eliminated)

	The earlier 8B model (6.6/10 on 408 traces) demonstrated that data diversity matters more than model size.

	## What This Demo Shows

	- Crux identification — finding the real decision point
	- Conditional commitment — "if X, then Y; if Z, then W"
	- Integrative resolution — not "both sides have merit" but concrete synthesis

	This is not a balanced conversation bot. It is a demo of a specific trained capability.

	## Evidence

	For methodology and evaluation:

	- Best model: [hikewa/dialectic-qwen3-4b-v3-lora](https://huggingface.co/hikewa/dialectic-qwen3-4b-v3-lora)
	- 8B model: [hikewa/dialectic-qwen3-8b-lora](https://huggingface.co/hikewa/dialectic-qwen3-8b-lora)
	- Dataset + eval artifacts: [hikewa/dialectic-reasoning-traces](https://huggingface.co/datasets/hikewa/dialectic-reasoning-traces)

	## Limitations

	- The Space is a demo wrapper, not a research paper
	- Training data is synthetic (multi-model generated)
	- English-only
	- Stronger evidence comes from held-out evaluation, not from chat impressions