Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training Paper β’ 2510.12586 β’ Published Oct 14, 2025 β’ 112
TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models Paper β’ 2506.03099 β’ Published Jun 3, 2025 β’ 19
V-JEPA 2 Collection A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann β’ 8 items β’ Updated Jun 13, 2025 β’ 187
Time Blindness: Why Video-Language Models Can't See What Humans Can? Paper β’ 2505.24867 β’ Published May 30, 2025 β’ 81
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases Paper β’ 2412.04862 β’ Published Dec 6, 2024 β’ 50
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models Paper β’ 2309.03883 β’ Published Sep 7, 2023 β’ 35
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. β’ 4 items β’ Updated 3 days ago β’ 162
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Dec 6, 2024 β’ 882
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper β’ 2402.17764 β’ Published Feb 27, 2024 β’ 627
Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning Paper β’ 2307.02053 β’ Published Jul 5, 2023 β’ 22
LongNet: Scaling Transformers to 1,000,000,000 Tokens Paper β’ 2307.02486 β’ Published Jul 5, 2023 β’ 81
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Paper β’ 2307.01952 β’ Published Jul 4, 2023 β’ 90