Benchmarking Visual State Tracking in Multimodal Video Understanding Paper • 2606.03920 • Published 24 days ago • 50
Dexterous Point Policy: Learning Point-based Dexterous Hand Policies from Human Demonstrations Paper • 2606.10614 • Published 17 days ago • 25
Running 3.9k The Ultra-Scale Playbook 🌌 3.9k The ultimate guide to training LLM on large GPU Clusters
Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off Paper • 2508.04825 • Published Aug 6, 2025 • 60
Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off Paper • 2508.04825 • Published Aug 6, 2025 • 60 • 4