ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models Paper • 2512.07843 • Published Nov 24, 2025 • 22
A PINN Approach to Symbolic Differential Operator Discovery with Sparse Data Paper • 2212.04630 • Published Dec 9, 2022
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published Jan 8, 2025 • 96
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Paper • 2411.04996 • Published Nov 7, 2024 • 51
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation Paper • 2410.03960 • Published Oct 4, 2024 • 2
MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts Paper • 2407.21770 • Published Jul 31, 2024 • 22
CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs Paper • 2406.18521 • Published Jun 26, 2024 • 31
From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline Paper • 2406.11939 • Published Jun 17, 2024 • 8
Nearest Neighbor Speculative Decoding for LLM Generation and Attribution Paper • 2405.19325 • Published May 29, 2024 • 14
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM Paper • 2403.07816 • Published Mar 12, 2024 • 45
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference Paper • 2403.04132 • Published Mar 7, 2024 • 41
Instruction-tuned Language Models are Better Knowledge Learners Paper • 2402.12847 • Published Feb 20, 2024 • 26
LEVER: Learning to Verify Language-to-Code Generation with Execution Paper • 2302.08468 • Published Feb 16, 2023 • 1
Efficient Large Scale Language Modeling with Mixtures of Experts Paper • 2112.10684 • Published Dec 20, 2021 • 2