The Coverage Principle: A Framework for Understanding Compositional Generalization Paper • 2505.20278 • Published May 26, 2025 • 7
The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think Paper • 2505.10185 • Published May 15, 2025 • 26
Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model Paper • 2406.15275 • Published Jun 21, 2024 • 12
How Do Large Language Models Acquire Factual Knowledge During Pretraining? Paper • 2406.11813 • Published Jun 17, 2024 • 31
Gradient Ascent Post-training Enhances Language Model Generalization Paper • 2306.07052 • Published Jun 12, 2023
EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records Paper • 2301.07695 • Published Jan 16, 2023 • 2
Exploring the Benefits of Training Expert Language Models over Instruction Tuning Paper • 2302.03202 • Published Feb 7, 2023 • 1
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning Paper • 2305.14045 • Published May 23, 2023 • 5