Indications of Belief-Guided Agency and Meta-Cognitive Monitoring in Large Language Models Paper • 2602.02467 • Published Feb 2
From Directions to Regions: Decomposing Activations in Language Models via Local Geometry Paper • 2602.02464 • Published Feb 2 • 3
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs Paper • 2603.09906 • Published 3 days ago • 57
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs Paper • 2603.09906 • Published 3 days ago • 57
Mixing Mechanisms: How Language Models Retrieve Bound Entities In-Context Paper • 2510.06182 • Published Oct 7, 2025 • 9
LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations Paper • 2509.03405 • Published Sep 3, 2025 • 24
Jump to Conclusions: Short-Cutting Transformers With Linear Transformations Paper • 2303.09435 • Published Mar 16, 2023
DiscoFuse: A Large-Scale Dataset for Discourse-Based Sentence Fusion Paper • 1902.10526 • Published Feb 27, 2019
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies Paper • 2101.02235 • Published Jan 6, 2021
What's in your Head? Emergent Behaviour in Multi-Task Transformer Models Paper • 2104.06129 • Published Apr 13, 2021
A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains Paper • 2402.00559 • Published Feb 1, 2024 • 3
SCROLLS: Standardized CompaRison Over Long Language Sequences Paper • 2201.03533 • Published Jan 10, 2022 • 1
Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space Paper • 2203.14680 • Published Mar 28, 2022