Commit ·
379c2c4
1
Parent(s): 0fe41f7
update README.md
Browse files
README.md
CHANGED
|
@@ -25,27 +25,6 @@ UI-TARS is a next-generation native GUI agent model designed to interact seamles
|
|
| 25 |
|
| 26 |
<!--  -->
|
| 27 |
|
| 28 |
-
## Core Features
|
| 29 |
-
### Perception
|
| 30 |
-
- **Comprehensive GUI Understanding**: Processes multimodal inputs (text, images, interactions) to build a coherent understanding of interfaces.
|
| 31 |
-
- **Real-Time Interaction**: Continuously monitors dynamic GUIs and responds accurately to changes in real-time.
|
| 32 |
-
|
| 33 |
-
### Action
|
| 34 |
-
- **Unified Action Space**: Standardized action definitions across platforms (desktop, mobile, and web).
|
| 35 |
-
- **Platform-Specific Actions**: Supports additional actions like hotkeys, long press, and platform-specific gestures.
|
| 36 |
-
|
| 37 |
-
### Reasoning
|
| 38 |
-
- **System 1 & System 2 Reasoning**: Combines fast, intuitive responses with deliberate, high-level planning for complex tasks.
|
| 39 |
-
- **Task Decomposition & Reflection**: Supports multi-step planning, reflection, and error correction for robust task execution.
|
| 40 |
-
|
| 41 |
-
### Memory
|
| 42 |
-
- **Short-Term Memory**: Captures task-specific context for situational awareness.
|
| 43 |
-
- **Long-Term Memory**: Retains historical interactions and knowledge for improved decision-making.
|
| 44 |
-
|
| 45 |
-
## Capabilities
|
| 46 |
-
- **Cross-Platform Interaction**: Supports desktop, mobile, and web environments with a unified action framework.
|
| 47 |
-
- **Multi-Step Task Execution**: Trained to handle complex tasks through multi-step trajectories and reasoning.
|
| 48 |
-
- **Learning from Synthetic and Real Data**: Combines large-scale annotated and synthetic datasets for improved generalization and robustness.
|
| 49 |
|
| 50 |
## Performance
|
| 51 |
**Perception Capabilty Evaluation**
|
|
|
|
| 25 |
|
| 26 |
<!--  -->
|
| 27 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 28 |
|
| 29 |
## Performance
|
| 30 |
**Perception Capabilty Evaluation**
|