Beyond IID: How General Are Tabular Foundation Models, Really? Paper • 2606.30410 • Published 2 days ago • 36
LLM Explainability with Counterfactual Chains and Causal Graphs Paper • 2606.05972 • Published 27 days ago • 18
A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks Paper • 2605.28556 • Published May 27 • 73
A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks Paper • 2605.28556 • Published May 27 • 73
Efficient Video Sampling: Pruning Temporally Redundant Tokens for Faster VLM Inference Paper • 2510.14624 • Published Oct 16, 2025 • 2
A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks Paper • 2605.28556 • Published May 27 • 73
Predicting Decisions of AI Agents from Limited Interaction through Text-Tabular Modeling Paper • 2605.12411 • Published May 12 • 49
MulTaBench: Benchmarking Multimodal Tabular Learning with Text and Image Paper • 2605.10616 • Published May 11 • 142
Running on CPU Upgrade Featured 3.22k The Smol Training Playbook 📚 3.22k The secrets to building world-class LLMs
TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations Paper • 2505.18125 • Published May 23, 2025 • 113
Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots
Running on CPU Upgrade Agents 1.02k Open VLM Leaderboard 🌎 1.02k VLMEvalKit Evaluation Results Collection