Nano-World-Model Collection 🌍 A minimalist repository for training video world models based on diffusion-forcing. • 18 items • Updated 9 days ago • 5
view article Article You could have designed state of the art positional encoding FL33TW00D-HF • Nov 25, 2024 • 478
Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models Paper • 2604.08545 • Published Apr 9 • 41
GEMS: Agent-Native Multimodal Generation with Memory and Skills Paper • 2603.28088 • Published Mar 30 • 85
view article Article We’re open-sourcing our text-to-image model and the process behind it Photoroom • Nov 12, 2025 • 99