adnankhan-11
/

VisionNav-3B

Model card Files Files and versions

VisionNav-3B / README.md

adnankhan-11's picture

Upload README.md with huggingface_hub

26f05b6 verified 11 days ago

|

history blame contribute delete

516 Bytes

	---
	license: apache-2.0
	datasets:
	- Bofeee5675/GUI-Net-1M
	language:
	- en
	base_model:
	- Qwen/Qwen2.5-VL-3B-Instruct
	tags:
	- VLM
	- Computer-Use
	---
	# TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for Generalized GUI Agents

	Model trained from [GUI-Net Dataset](https://huggingface.co/datasets/Bofeee5675/GUI-Net-1M)

	See detail at our [Project Page](https://github.com/TongUI-agent/TongUI-agent)


	## Model Details

	The base model is `Qwen/Qwen2.5-VL-3B-Instruct`. We fine-tuned base model by Lora.