Spaces:

gr0010
/

CustomThinker-Demo

Running on Zero

gr0010 commited on about 1 month ago

Commit

d89decd

verified ·

1 Parent(s): 4c447a2

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -34,13 +34,13 @@ def generate_and_parse(messages: list, temperature: float = 0.6,
     and parses it into thinking and answer parts.
     Decorated with @spaces.GPU for Zero GPU allocation.
     """
-    # Apply chat template WITHOUT enable_thinking to preserve thinking tags in history
-    prompt_text = tokenizer.apply_chat_template(
-        messages,
-        tokenize=False,
-        add_generation_prompt=True,
-        enable_thinking=False  # Changed to False to preserve <think> tags in context
-    )
     # --- CONSOLE DEBUG OUTPUT ---
     print("\n" + "="*50)

     and parses it into thinking and answer parts.
     Decorated with @spaces.GPU for Zero GPU allocation.
     """
+    # Build prompt manually to preserve <think> tags in context
+    prompt_text = ""
+    for msg in messages:
+        role = msg["role"]
+        content = msg["content"]
+        prompt_text += f"<|im_start|>{role}\n{content}<|im_end|>\n"
+    prompt_text += "<|im_start|>assistant\n"
     # --- CONSOLE DEBUG OUTPUT ---
     print("\n" + "="*50)