Improve model card metadata and content

Hi! I'm Niels from the community science team at Hugging Face. I'm opening this PR to improve the model card for Mobile-Agent-v3.5 (GUI-Owl-1.5).

The improvements include:
- Adding the `image-text-to-text` pipeline tag for better discoverability.
- Adding `library_name: transformers` as the model architecture is compatible with the Transformers library.
- Moving the arXiv reference from the YAML metadata to the markdown section as per our best practices.
- Adding a model description and links to the paper and code.

This helps users understand the model's capabilities and find the associated research and code.

Files changed (1) hide show

README.md +19 -4

README.md CHANGED Viewed

@@ -1,15 +1,29 @@
 ---
-license: mit
 language:
 - en
-tags:
-- arxiv:2602.16855
 ---
 ## Citation
-If you find this model useful, please cite our paper:
 ```bibtex
 @article{MobileAgentv3.5,
@@ -18,3 +32,4 @@ If you find this model useful, please cite our paper:
   journal={arXiv preprint arXiv:2602.16855},
   year={2026}
 }

 ---
 language:
 - en
+license: mit
+pipeline_tag: image-text-to-text
+library_name: transformers
 ---
+# Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents (GUI-Owl-1.5)
+Mobile-Agent-v3.5 (also known as **GUI-Owl-1.5**) is a family of native multi-platform GUI agent foundation models. It supports automation across desktop, mobile, and browser environments, enabling cloud-edge collaboration and real-time interaction.
+The model is built on the Qwen3-VL architecture and achieves state-of-the-art results on over 20 GUI benchmarks, excelling in tasks such as GUI automation (OSWorld, AndroidWorld, WebArena), grounding (ScreenSpotPro), and tool-calling.
+- **Paper:** [Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents](https://huggingface.co/papers/2602.16855)
+- **Repository:** [GitHub - X-PLUG/MobileAgent](https://github.com/X-PLUG/MobileAgent)
+- **Demo:** [ModelScope online demo](http://modelscope.cn/studios/MobileAgentTest/computer_use)
+## Key Features
+- **Multi-platform Support:** Native support for desktop, mobile, and browser automation.
+- **Unified Capability:** Combines UI understanding, reasoning, and trajectory generation.
+- **Enhanced Reasoning:** Incorporates a thought-synthesis pipeline to improve decision-making and memory.
 ## Citation
+If you find this model useful, please cite the paper:
 ```bibtex
 @article{MobileAgentv3.5,
   journal={arXiv preprint arXiv:2602.16855},
   year={2026}
 }
+```