SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer Paper • 2605.30409 • Published May 28 • 41
Cosmos3 Collection Omnimodal World Models for Physical AI • 16 items • Updated 2 days ago • 132
facebook/dinov3-vitl16-pretrain-sat493m Image Feature Extraction • 0.3B • Updated Aug 19, 2025 • 13.2k • 45
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 15 items • Updated Mar 10 • 674
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.41M • • 3.12k