FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection Paper • 2601.03928 • Published 23 days ago • 17
ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands Paper • 2512.24965 • Published about 1 month ago • 41
Factorized Learning for Temporally Grounded Video-Language Models Paper • 2512.24097 • Published Dec 30, 2025 • 7
Factorized Learning for Temporally Grounded Video-Language Models Paper • 2512.24097 • Published Dec 30, 2025 • 7
SlideTailor: Personalized Presentation Slide Generation for Scientific Papers Paper • 2512.20292 • Published Dec 23, 2025 • 9
SlideTailor: Personalized Presentation Slide Generation for Scientific Papers Paper • 2512.20292 • Published Dec 23, 2025 • 9
Real-time Multi-person Eyeblink Detection in the Wild for Untrimmed Video Paper • 2303.16053 • Published Mar 28, 2023
End-to-end Video Gaze Estimation via Capturing Head-face-eye Spatial-temporal Interaction Context Paper • 2310.18131 • Published Oct 27, 2023
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs Paper • 2508.16153 • Published Aug 22, 2025 • 160