Add model card for OpenMobile

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +30 -0
README.md CHANGED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ pipeline_tag: image-text-to-text
4
+ ---
5
+
6
+ # OpenMobile Qwen3-VL
7
+
8
+ This repository contains the fine-tuned Qwen3-VL model presented in the paper [OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis](https://huggingface.co/papers/2604.15093).
9
+
10
+ OpenMobile is an open-source framework designed to synthesize high-quality task instructions and agent trajectories for mobile agents. It addresses the data gap in the field by introducing:
11
+ 1. **Scalable Task Synthesis**: A pipeline that constructs global environment memory from exploration to generate diverse and grounded instructions.
12
+ 2. **Policy-Switching Strategy**: A trajectory rollout method that captures essential error-recovery data by alternating between learner and expert models.
13
+
14
+ ## Resources
15
+ - **Paper:** [OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis](https://huggingface.co/papers/2604.15093)
16
+ - **Project Page:** [https://njucckevin.github.io/openmobile/](https://njucckevin.github.io/openmobile/)
17
+ - **Code:** [https://github.com/njucckevin/OpenMobile-Code](https://github.com/njucckevin/OpenMobile-Code)
18
+
19
+ ## Performance
20
+ Agents trained using the OpenMobile framework achieve competitive results across dynamic mobile agent benchmarks. Notably, this fine-tuned Qwen3-VL checkpoint reaches **64.7%** success on AndroidWorld, significantly surpassing existing open-data approaches.
21
+
22
+ ## Citation
23
+ ```bibtex
24
+ @article{openmobile2025,
25
+ title={OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis},
26
+ author={Wu, Zhiyong and others},
27
+ journal={arXiv preprint arXiv:2604.15093},
28
+ year={2025}
29
+ }
30
+ ```