Scale RAE Collection Collection for "Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders" • 9 items • Updated 6 days ago • 3
Solaris: Building a Multiplayer Video World Model in Minecraft Paper • 2602.22208 • Published 23 days ago • 28
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 17 days ago • 97
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 17 days ago • 97