| license: mit | |
| language: | |
| - en | |
| base_model: | |
| - Qwen/Qwen2.5-VL-3B-Instruct | |
| - Qwen/Qwen2.5-VL-7B-Instruct | |
| pipeline_tag: visual-question-answering | |
| This repository contains the model presented in [GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents](https://huggingface.co/papers/2504.10458). | |
| Project page: https://github.com/ritzz-ai/GUI-R1 |