FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning Paper • 2601.11141 • Published 11 days ago • 20
Running on Zero 111 Music Flamingo 🎵 111 Upload music or YouTube videos and ask detailed questions about them
End-to-End Video Character Replacement without Structural Guidance Paper • 2601.08587 • Published 14 days ago • 8
Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion Paper • 2512.23709 • Published 28 days ago • 49
InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion Paper • 2512.17504 • Published Dec 19, 2025 • 97
Openly licensed large image datasets Collection Openly licensed dataset with allowed commercial usage • 3 items • Updated Jul 1, 2024 • 1