UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience
Paper • 2603.24533 • Published • 29
UI-Voyager is a novel two-stage self-evolving mobile GUI agent fine-tuned from Qwen3-VL-4B-Instruct. Our 4B model achieves a 81.0% success rate on the AndroidWorld benchmark, outperforming numerous recent baselines and exceeding human-level performance.
Overview of UI-Voyager performance on AndroidWorld
| Attribute | Detail |
|---|---|
| Base Model | Qwen3-VL-4B-Instruct |
| Parameters | ~4B |
| License | MIT |
| Task | Mobile GUI Agent / Image-Text-to-Text |
| Benchmark | AndroidWorld |
| Success Rate | 81.0% |
For full evaluation instructions using AndroidWorld with parallel emulators, please refer to our GitHub repository.
# Start parallel evaluation (4 emulators)
NUM_WORKERS=4 CONFIG_NAME=UI-Voyager MODEL_NAME=UI-Voyager ./run_android_world.sh
If you find this work useful, please consider giving a star ⭐ and citation:
@misc{lin2026uivoyager,
title={UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience},
author={Zichuan Lin and Feiyu Liu and Yijun Yang and Jiafei Lyu and Yiming Gao and Yicheng Liu and Zhicong Lu and Yangbin Yu and Mingyu Yang and Junyou Li and Deheng Ye and Jie Jiang},
year={2026},
eprint={2603.24533},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2603.24533},
}