Update Readme
#1
by
zhugekongkong - opened
README.md
CHANGED
|
@@ -25,7 +25,7 @@ tags:
|
|
| 25 |
<!-- Provide a longer summary of what this model is. -->
|
| 26 |
|
| 27 |
- **Base Model:** `Qwen3-VL-32B-Thinking`
|
| 28 |
-
- **Fine-tuning Strategy:**
|
| 29 |
- **Primary Task:** Autonomous web navigation and task execution.
|
| 30 |
- **Input:** A combination of a natural language task description and a webpage screenshot.
|
| 31 |
- **Output:** A JSON-formatted action (e.g., `click`, `type`, `scroll`) or a final answer.
|
|
|
|
| 25 |
<!-- Provide a longer summary of what this model is. -->
|
| 26 |
|
| 27 |
- **Base Model:** `Qwen3-VL-32B-Thinking`
|
| 28 |
+
- **Fine-tuning Strategy:** Online Agentic RL with a Hybrid Reward mechanism.
|
| 29 |
- **Primary Task:** Autonomous web navigation and task execution.
|
| 30 |
- **Input:** A combination of a natural language task description and a webpage screenshot.
|
| 31 |
- **Output:** A JSON-formatted action (e.g., `click`, `type`, `scroll`) or a final answer.
|