|
|
--- |
|
|
license: apache-2.0 |
|
|
datasets: |
|
|
- Bofeee5675/GUI-Net-1M |
|
|
language: |
|
|
- en |
|
|
base_model: |
|
|
- Qwen/Qwen2.5-VL-3B-Instruct |
|
|
tags: |
|
|
- VLM |
|
|
- Computer-Use |
|
|
--- |
|
|
# TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for Generalized GUI Agents |
|
|
|
|
|
Model trained from [GUI-Net Dataset](https://huggingface.co/datasets/Bofeee5675/GUI-Net-1M) |
|
|
|
|
|
See detail at our [Project Page](https://github.com/TongUI-agent/TongUI-agent) |
|
|
|
|
|
|
|
|
## Model Details |
|
|
|
|
|
The base model is `Qwen/Qwen2.5-VL-3B-Instruct`. We fine-tuned base model by Lora. |