OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows Paper β’ 2510.24411 β’ Published Oct 28, 2025 β’ 73
Jailbreaking as a Reward Misspecification Problem Paper β’ 2406.14393 β’ Published Jun 20, 2024 β’ 13
Running Featured 561 Vision Arena (Testing VLMs side-by-side) πΌ 561 Explore AI-powered visual tasks in Vision Arena
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models Paper β’ 2404.12387 β’ Published Apr 18, 2024 β’ 40