Translation as a Bridging Action: Transferring Manipulation Skills from Humans to Robots Paper • 2606.28133 • Published 7 days ago • 37
UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating Paper • 2606.21661 • Published 14 days ago • 27
UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating Paper • 2606.21661 • Published 14 days ago • 27
Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models Paper • 2606.25041 • Published 10 days ago • 112
RoPE-Aware Bit Allocation for KV-Cache Quantization Paper • 2606.24033 • Published 10 days ago • 8 • 2
OmniGameArena: A Unified UE5 Benchmark for VLM Game Agents with Improvement Dynamics Paper • 2606.09826 • Published 25 days ago • 19
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published May 26 • 145
On-Policy Adversarial Flow Distillation for Autoregressive Video Generation Paper • 2605.26105 • Published May 25 • 19