view article Article TimeScope: How Long Can Your Video Large Multimodal Model Go? +2 Jul 23, 2025 • 48
ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation Paper • 2510.04290 • Published Oct 5, 2025 • 19
FLUX.1 Collection A collection of our FLUX.1 models and LoRAs. • 13 items • Updated 29 days ago • 270
view article Article We’re open-sourcing our text-to-image model and the process behind it Nov 12, 2025 • 76
StructVisuals Collection StructBench and StructVisuals (Training Set) • 4 items • Updated Oct 9, 2025 • 5
Factuality Matters: When Image Generation and Editing Meet Structured Visuals Paper • 2510.05091 • Published Oct 6, 2025 • 20
SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer Paper • 2509.24695 • Published Sep 29, 2025 • 45
Modular Diffusers Custom Blocks Collection Custom blocks for Modular Diffusers • 10 items • Updated 2 days ago • 2
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 Sep 4, 2025 • 272
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels Aug 18, 2025 • 90