Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -25,7 +25,7 @@ tags:
25
  <!-- Provide a longer summary of what this model is. -->
26
 
27
  - **Base Model:** `Qwen3-VL-32B-Thinking`
28
- - **Fine-tuning Strategy:** Hierarchical Multi-Task SFT followed by Online Agentic RL with a Hybrid Reward mechanism.
29
  - **Primary Task:** Autonomous web navigation and task execution.
30
  - **Input:** A combination of a natural language task description and a webpage screenshot.
31
  - **Output:** A JSON-formatted action (e.g., `click`, `type`, `scroll`) or a final answer.
 
25
  <!-- Provide a longer summary of what this model is. -->
26
 
27
  - **Base Model:** `Qwen3-VL-32B-Thinking`
28
+ - **Fine-tuning Strategy:** Online Agentic RL with a Hybrid Reward mechanism.
29
  - **Primary Task:** Autonomous web navigation and task execution.
30
  - **Input:** A combination of a natural language task description and a webpage screenshot.
31
  - **Output:** A JSON-formatted action (e.g., `click`, `type`, `scroll`) or a final answer.