Attention-driven GUI Grounding: Leveraging Pretrained Multimodal Large Language Models without Fine-Tuning Paper • 2412.10840 • Published Dec 14, 2024 • 1
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents Paper • 2506.03143 • Published Jun 3, 2025 • 53
LlavaGuard Collection This collection contains the original repos of the LlavaGuard releases • 19 items • Updated May 12, 2025 • 7