TabPFN-2.5: Advancing the State of the Art in Tabular Foundation Models Paper • 2511.08667 • Published Nov 11, 2025 • 6
LLM Explainability with Counterfactual Chains and Causal Graphs Paper • 2606.05972 • Published 22 days ago • 18
A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks Paper • 2605.28556 • Published about 1 month ago • 71
STRABLE: Benchmarking Tabular Machine Learning with Strings Paper • 2605.12292 • Published May 12 • 4
MulTaBench: Benchmarking Multimodal Tabular Learning with Text and Image Paper • 2605.10616 • Published May 11 • 142
STRABLE: Benchmarking Tabular Machine Learning with Strings Paper • 2605.12292 • Published May 12 • 4
Jamba-1.5: Hybrid Transformer-Mamba Models at Scale Paper • 2408.12570 • Published Aug 22, 2024 • 33
Predicting Decisions of AI Agents from Limited Interaction through Text-Tabular Modeling Paper • 2605.12411 • Published May 12 • 49
MulTaBench: Benchmarking Multimodal Tabular Learning with Text and Image Paper • 2605.10616 • Published May 11 • 142
Alignment Makes Language Models Normative, Not Descriptive Paper • 2603.17218 • Published Mar 17 • 46
Empty Shelves or Lost Keys? Recall Is the Bottleneck for Parametric Factuality Paper • 2602.14080 • Published Feb 15 • 23
STATe-of-Thoughts: Structured Action Templates for Tree-of-Thoughts Paper • 2602.14265 • Published Feb 15 • 21
LIBERTy: A Causal Framework for Benchmarking Concept-Based Explanations of LLMs with Structural Counterfactuals Paper • 2601.10700 • Published Jan 15 • 18
The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents Paper • 2601.11496 • Published Jan 16 • 47
Fine-Grained Detection of Context-Grounded Hallucinations Using LLMs Paper • 2509.22582 • Published Sep 26, 2025 • 12