Spaces:

cafe3310
/

ling-playground-basic

Sleeping

App Files Files Community

cafe3310 commited on Sep 24, 2025

Commit

de239b9

1 Parent(s): 96a502d

feat: Implement dual-context workflow extraction

Browse files

Files changed (3) hide show

GEMINI.md +71 -4
app.py +63 -8
comp.py +9 -5

GEMINI.md CHANGED Viewed

@@ -10,26 +10,37 @@
 # 项目目标
 ## 未完成
-- [ ] 构建一个能够综合利用 `Ring-mini-2.0` 和 `Ling-flash-2.0` (或其量化版本) 的工作流应用。
-## 已完成
-- (暂无)
 ---
 # 子目标
 ## 未完成
-- [ ] **(进行中)** 在 Gradio UI 中区分“思考”和“正文” token。
 - [ ] 实现自动化部署和验证流程。
 ## 已完成
 - [x] 解决模型体积过大导致部署失败的问题。
 - [x] 使用 LangGraph 实现一个可以路由两个模型的聊天网页应用。
 ---
 # Todolist
 ## 已完成
 - [x] 使用 Markdown 优化思考过程的显示效果。
 - [x] 为“思考”和“正文” token 实现不同的颜色显示。
 - [x] 实现调试模式以观察“思考”和“正文” token 的区别。
@@ -71,3 +82,59 @@
 - **订阅:** HuggingFace Pro
 - **推理资源:** 可以使用 ZeroGPU
 - **文档参考:** 在必要的时候，主动搜索 HuggingFace 以及 Gradio 的在线 API 文档。

 # 项目目标
 ## 未完成
+- [ ] 构建一个具备工作流提取与执行能力的 Agent 应用。
+## 进行中
+- [x] 构建一个能够综合利用 `Ring-mini-2.0` 的工作流应用。
 ---
 # 子目标
 ## 未完成
+- [ ] **(进行中)** 实现双 LLM 上下文架构（聊天 + 工作流提取）。
+- [ ] 改造 Gradio UI 以展示双上下文结果。
 - [ ] 实现自动化部署和验证流程。
 ## 已完成
+- [x] 在 Gradio UI 中区分“思考”和“正文” token。
 - [x] 解决模型体积过大导致部署失败的问题。
 - [x] 使用 LangGraph 实现一个可以路由两个模型的聊天网页应用。
 ---
 # Todolist
+## 待办
+(暂无)
 ## 已完成
+- [x] 阅读 `app.py` 的当前代码。
+- [x] 在 `app.py` 中，将 UI 从单聊天窗口改为“聊天 + 工作流”的上下布局。
+- [x] 在 `app.py` 中，实现两个独立的聊天状态 (`gr.State`)。
+- [x] 实现将“聊天上下文”的对话历史传递给“工作流提取上下文”的逻辑。
+- [x] 为“工作流提取上下文”设计并集成系统提示词。
+- [x] 更新 `GEMINI.md` 中的项目目标和子目标。
 - [x] 使用 Markdown 优化思考过程的显示效果。
 - [x] 为“思考”和“正文” token 实现不同的颜色显示。
 - [x] 实现调试模式以观察“思考”和“正文” token 的区别。
 - **订阅:** HuggingFace Pro
 - **推理资源:** 可以使用 ZeroGPU
 - **文档参考:** 在必要的时候，主动搜索 HuggingFace 以及 Gradio 的在线 API 文档。
+---
+# 项目需求文档：工作流提取与执行 Agent
+## 1. 总体目标
+构建一个具备双重上下文能力的 AI 应用。该应用能与用户进行自然语言交互，同时在后台自动提取、结构化用户的任务意图和执行步骤，形成一个动态的工作流。
+## 2. 核心功能
+### 2.1. 双重 LLM 上下文架构
+应用需维护两个独立的 LLM 上下文：
+1.  **聊天上下文 (Chat Context):**
+    *   **职责:** 直接与用户进行交互。
+    *   **能力:** 理解并响应用户的指令和问题，进行多轮对话。
+    *   **特点:** 无预设的系统提示词（System Prompt），行为完全由用户引导。
+2.  **工作流提取上下文 (Workflow Extraction Context):**
+    *   **职责:** "观察"聊天上下文中的对话，并进行分析处理。
+    *   **数据流:** 聊天上下文的完整对话记录（用户输入与模型输出）将作为输入实时或准实时地传送给此上下文。
+    *   **能力:**
+        *   **任务识别:** 根据对话内容，准确识别并提炼出用户当前的核心任务或意图。
+        *   **步骤提炼:** 将用户与聊天上下文的交互过程，拆解为一系列清晰、可执行的步骤。
+        *   **任务状态跟踪:** 能够判断用户任务的开始、进行中和结束状态。
+    *   **特点:** 包含一个特定的系统提示词，指导其完成上述分析和提取任务。
+### 2.2. Gradio 用户界面 (UI) 改造
+为了清晰地展示双重上下文的工作状态，需要对现有 UI 进行重新布局。
+*   **移除:** 旧的 `[系统提示]` 输入框。
+*   **调整后布局:**
+    1.  **`[聊天界面]` (Chatbot Interface):**
+        *   **对接:** 聊天上下文。
+        *   **功能:** 用户在此处输入问题，并看到聊天模型的直接回复。
+    2.  **`[分割线]` (Separator):**
+        *   **功能:** 在视觉上明确区分两个不同功能的区域。
+    3.  **`[任务意图]` (Task Intent Display):**
+        *   **形式:** 只读文本框 (Textbox)。
+        *   **对接:** 工作流提取上下文。
+        *   **内容:** 实时显示该上下文识别出的用户当前任务意图。
+    4.  **`[步骤提炼]` (Extracted Steps Display):**
+        *   **形式:** 只读文本框 (Textbox)。
+        *   **对接:** 工作流提取上下文。
+        *   **内容:** 实时展示该上下文从对话中提炼出的结构化步骤。
+## 3. 技术实现要点
+*   **上下文管理:** 需要设计一种机制，在 `app.py` 中同时管理和维护两个独立的对话历史（`history`）。
+*   **数据同步:** 确保聊天上下文的每一次更新都能被工作流提取上下文捕获。
+*   **UI 更新:** Gradio 的界面元素需要与两个上下文的状态进行绑定，实现局部刷新，以展示实时分析结果。
+---

app.py CHANGED Viewed

@@ -1,13 +1,46 @@
 import gradio as gr
 from comp import generate_response
 # --- Gradio UI ---
 with gr.Blocks() as demo:
     gr.Markdown("# Ling Playground")
-    chatbot = gr.Chatbot()
-    msg = gr.Textbox()
-    clear = gr.ClearButton([msg, chatbot])
     def user(user_message, history):
         return "", history + [[user_message, None]]
@@ -15,22 +48,44 @@ with gr.Blocks() as demo:
     def bot(history):
         user_message = history[-1][0]
         history[-1][1] = ""
         for response in generate_response(user_message, history[:-1]):
             if "</think>" in response:
                 parts = response.split("</think>", 1)
                 thinking_text = parts[0].replace("<think>", "")
                 body_text = parts[1]
                 md_output = f"**Thinking...**\n```\n{thinking_text}\n```\n\n{body_text}"
                 history[-1][1] = md_output
             else:
                 history[-1][1] = response
             yield history
-    msg.submit(user, [msg, chatbot], [msg, chatbot], queue=False).then(
-        bot, chatbot, chatbot
     )
-    clear.click(lambda: None, None, chatbot, queue=False)
 if __name__ == "__main__":
-    demo.launch()

 import gradio as gr
 from comp import generate_response
+import re
+# --- Constants ---
+WORKFLOW_SYSTEM_PROMPT = """You are an expert in analyzing conversations and extracting user workflows.
+Based on the provided chat history, identify the user's main goal or intent.
+Then, break down the conversation into a series of actionable steps that represent the workflow to achieve that goal.
+The output should be in two parts, clearly separated:
+**Intent**: [A concise description of the user's goal]
+**Steps**:
+[A numbered list of steps]
+"""
+# --- Helper Functions ---
+def parse_workflow_response(response):
+    intent_match = re.search(r"\*\*Intent\*\*:\s*(.*)", response, re.IGNORECASE)
+    steps_match = re.search(r"\*\*Steps\*\*:\s*(.*)", response, re.DOTALL | re.IGNORECASE)
+    intent = intent_match.group(1).strip() if intent_match else "Could not determine intent."
+    steps = steps_match.group(1).strip() if steps_match else "Could not determine steps."
+    return intent, steps
 # --- Gradio UI ---
 with gr.Blocks() as demo:
     gr.Markdown("# Ling Playground")
+    with gr.Row():
+        with gr.Column(scale=2):
+            gr.Markdown("## Chat")
+            chat_chatbot = gr.Chatbot(label="Chat", bubble_full_width=False)
+            chat_msg = gr.Textbox(label="Your Message")
+        with gr.Column(scale=1):
+            gr.Markdown("## Workflow Extraction")
+            intent_textbox = gr.Textbox(label="Task Intent", interactive=False)
+            steps_textbox = gr.Textbox(
+                label="Extracted Steps", interactive=False, lines=15
+            )
+    chat_clear = gr.ClearButton([chat_msg, chat_chatbot, intent_textbox, steps_textbox])
     def user(user_message, history):
         return "", history + [[user_message, None]]
     def bot(history):
         user_message = history[-1][0]
         history[-1][1] = ""
+        # Main chat model call (uses default system prompt)
         for response in generate_response(user_message, history[:-1]):
             if "</think>" in response:
                 parts = response.split("</think>", 1)
                 thinking_text = parts[0].replace("<think>", "")
                 body_text = parts[1]
                 md_output = f"**Thinking...**\n```\n{thinking_text}\n```\n\n{body_text}"
                 history[-1][1] = md_output
             else:
                 history[-1][1] = response
             yield history
+    def update_workflow(history):
+        if not history or not history[-1][0]:
+            return "", ""
+        # The last user message is the main prompt for the workflow agent
+        user_message = history[-1][0]
+        # The rest of the conversation is the history
+        chat_history_for_workflow = history[:-1]
+        # Call the model with the workflow system prompt
+        full_response = ""
+        for response in generate_response(
+            user_message,
+            chat_history_for_workflow,
+            system_prompt=WORKFLOW_SYSTEM_PROMPT
+        ):
+            full_response = response
+        intent, steps = parse_workflow_response(full_response)
+        return intent, steps
+    (   chat_msg.submit(user, [chat_msg, chat_chatbot], [chat_msg, chat_chatbot], queue=False)
+        .then(bot, chat_chatbot, chat_chatbot)
+        .then(update_workflow, chat_chatbot, [intent_textbox, steps_textbox])
     )
 if __name__ == "__main__":
+    demo.launch(share=True)

comp.py CHANGED Viewed

@@ -4,6 +4,7 @@ import spaces
 # Model and tokenizer initialization
 MODEL_NAME = "inclusionAI/Ring-mini-2.0"
 tokenizer = AutoTokenizer.from_pretrained(MODEL_NAME, trust_remote_code=True)
 model = AutoModelForCausalLM.from_pretrained(
@@ -14,26 +15,29 @@ model = AutoModelForCausalLM.from_pretrained(
 )
 @spaces.GPU(duration=120)
-def generate_response(message, history):
-    # (msg, history) -> str: stream response (yielding partial responses)
     # To construct the 'chat', we start with system prompt
     # then append user and assistant messages from history
     messages = [
-        {"role": "system", "content": "你是 Ring，蚂蚁集团开发的智能助手，致力于为用户提供有用的信息和帮助，用中文回答用户的问题。"}
     ]
     # Add conversation history
     # history is a list of (human, assistant) tuples
     for human, assistant in history:
         messages.append({"role": "user", "content": human})
-        messages.append({"role": "assistant", "content": assistant})
     # Add current message from user
     messages.append({"role": "user", "content": message})
     # Apply chat template
-    # Doc: https://github.com/huggingface/transformers/blob/main/src/transformers/tokenization_utils_base.py#L1510
     text = tokenizer.apply_chat_template(
         messages,
         tokenize=False,

 # Model and tokenizer initialization
 MODEL_NAME = "inclusionAI/Ring-mini-2.0"
+DEFAULT_SYSTEM_PROMPT = "你是 Ring，蚂蚁集团开发的智能助手，致力于为用户提供有用的信息和帮助，用中文回答用户的问题。"
 tokenizer = AutoTokenizer.from_pretrained(MODEL_NAME, trust_remote_code=True)
 model = AutoModelForCausalLM.from_pretrained(
 )
 @spaces.GPU(duration=120)
+def generate_response(message, history, system_prompt=None):
+    # (msg, history, system_prompt) -> str: stream response (yielding partial responses)
+    # Determine the system prompt to use
+    prompt_to_use = system_prompt if system_prompt is not None else DEFAULT_SYSTEM_PROMPT
     # To construct the 'chat', we start with system prompt
     # then append user and assistant messages from history
     messages = [
+        {"role": "system", "content": prompt_to_use}
     ]
     # Add conversation history
     # history is a list of (human, assistant) tuples
     for human, assistant in history:
         messages.append({"role": "user", "content": human})
+        if assistant: # Ensure assistant message is not None
+            messages.append({"role": "assistant", "content": assistant})
     # Add current message from user
     messages.append({"role": "user", "content": message})
     # Apply chat template
     text = tokenizer.apply_chat_template(
         messages,
         tokenize=False,