Emu3.5 Collection Native Multimodal Models are World Learners π β’ 4 items β’ Updated 4 days ago β’ 72
Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models Paper β’ 2508.00819 β’ Published Aug 1 β’ 62
Cohere Labs Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. β’ 5 items β’ Updated Jul 31 β’ 70
AIMv2 Collection A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. β’ 19 items β’ Updated Aug 25 β’ 82
Foundation AI Papers Collection Curated List of Must-Reads on LLM reasoning at Temus AI team β’ 135 items β’ Updated Jun 15, 2024 β’ 35
π MINT-1T Collection Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" β’ 14 items β’ Updated Oct 22 β’ 64
LLM Compiler Collection Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. β’ 4 items β’ Updated Jun 27, 2024 β’ 155