Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers Paper • 2601.04890 • Published 18 days ago • 41
LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation Paper • 2412.15188 • Published Dec 19, 2024 • 1
MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation Paper • 2506.07999 • Published Jun 9, 2025 • 2
TV2TV: A Unified Framework for Interleaved Language and Video Generation Paper • 2512.05103 • Published Dec 4, 2025 • 19
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Paper • 2511.19399 • Published Nov 24, 2025 • 61
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Paper • 2511.19399 • Published Nov 24, 2025 • 61
NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models Paper • 2506.07731 • Published Jun 9, 2025 • 2
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance Paper • 2507.22448 • Published Jul 30, 2025 • 70
BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models Paper • 2310.01329 • Published Oct 2, 2023
UnifiedQA: Crossing Format Boundaries With a Single QA System Paper • 2005.00700 • Published May 2, 2020
Dense Passage Retrieval for Open-Domain Question Answering Paper • 2004.04906 • Published Apr 10, 2020 • 2
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation Paper • 2305.14251 • Published May 23, 2023 • 2
Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts Paper • 2112.08348 • Published Dec 15, 2021
Do Membership Inference Attacks Work on Large Language Models? Paper • 2402.07841 • Published Feb 12, 2024
Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? Paper • 2202.12837 • Published Feb 25, 2022 • 2
Measuring and Narrowing the Compositionality Gap in Language Models Paper • 2210.03350 • Published Oct 7, 2022
Exploring The Landscape of Distributional Robustness for Question Answering Models Paper • 2210.12517 • Published Oct 22, 2022
CREPE: Open-Domain Question Answering with False Presuppositions Paper • 2211.17257 • Published Nov 30, 2022