PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records Paper • 2601.09636 • Published Jan 14 • 9
$π$-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows Paper • 2605.14678 • Published May 19 • 108 • 4
π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows Paper • 2605.14678 • Published May 19 • 108
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V Paper • 2310.11441 • Published Oct 17, 2023 • 29
IDEA-Research/grounding-dino-base Zero-Shot Object Detection • 0.2B • Updated May 12, 2024 • 1.94M • 193
Runtime error Agents 23 ShapeLLM-Omni 🏢 23 A Native Multimodal LLM for 3D Generation and Understanding
Running on Zero MCP Featured 1.61k Wan2.1 Fast 🎥 1.61k Animate a still image into a short video using a prompt