Spaces:

hikewa
/

dialectic-reasoning

Sleeping

Kenny Wang commited on Apr 4

Commit

f463310

1 Parent(s): 219170b

Add Qwen3-8B v7 as default model (8.9 Mistral-strict)

Files changed (1) hide show

app.py CHANGED Viewed

@@ -7,7 +7,11 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
 from peft import PeftModel
 MODELS = {
-    "Qwen3-4B v7 (best-of-N, latest)": {
         "base": "Qwen/Qwen3-4B",
         "adapter": "hikewa/dialectic-qwen3-4b-v7-lora",
     },
@@ -122,15 +126,15 @@ demo = gr.ChatInterface(
     additional_inputs=[
         gr.Dropdown(
             choices=list(MODELS.keys()),
-            value="Qwen3-4B v7 (best-of-N, latest)",
             label="Model",
         ),
     ],
     title="Dialectic Reasoning Models",
     description=(
         "Fine-tuned on dialectic reasoning traces with falsifiability-based quality filtering. "
-        "v7 (113 best-of-N traces from 6 providers) scores 8.3 Mistral-strict with 13/14 strong verdicts. "
-        "Pick a model and ask a question involving competing perspectives."
     ),
     examples=[
         ["Should AI systems be transparent about their reasoning, even when transparency reduces performance?"],

 from peft import PeftModel
 MODELS = {
+    "Qwen3-8B v7 (best-of-N, 8.9 strict)": {
+        "base": "Qwen/Qwen3-8B",
+        "adapter": "hikewa/dialectic-qwen3-8b-v7-lora",
+    },
+    "Qwen3-4B v7 (best-of-N, 8.3 strict)": {
         "base": "Qwen/Qwen3-4B",
         "adapter": "hikewa/dialectic-qwen3-4b-v7-lora",
     },
     additional_inputs=[
         gr.Dropdown(
             choices=list(MODELS.keys()),
+            value="Qwen3-8B v7 (best-of-N, 8.9 strict)",
             label="Model",
         ),
     ],
     title="Dialectic Reasoning Models",
     description=(
         "Fine-tuned on dialectic reasoning traces with falsifiability-based quality filtering. "
+        "8B v7 scores 8.9 Mistral-strict (up from 4B's 8.3) with deeper resolution and less hedging. "
+        "113 best-of-N traces from 6 providers. Pick a model and ask a question involving competing perspectives."
     ),
     examples=[
         ["Should AI systems be transparent about their reasoning, even when transparency reduces performance?"],