Provable Benefits of In-Tool Learning for Large Language Models Paper • 2508.20755 • Published Aug 28, 2025 • 11
CauKer: classification time series foundation models can be pretrained on synthetic data only Paper • 2508.02879 • Published Aug 4, 2025
Vision Transformer Finetuning Benefits from Non-Smooth Components Paper • 2602.06883 • Published 8 days ago • 4
Optimal Self-Consistency for Efficient Reasoning with Large Language Models Paper • 2511.12309 • Published Nov 15, 2025
Vision Transformer Finetuning Benefits from Non-Smooth Components Paper • 2602.06883 • Published 8 days ago • 4
Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods Paper • 2502.01384 • Published Feb 3, 2025 • 1
LLMs as In-Context Meta-Learners for Model and Hyperparameter Selection Paper • 2510.26510 • Published Oct 30, 2025 • 2
LLMs as In-Context Meta-Learners for Model and Hyperparameter Selection Paper • 2510.26510 • Published Oct 30, 2025 • 2
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level Paper • 2411.03562 • Published Nov 5, 2024 • 69
From Data to Rewards: a Bilevel Optimization Perspective on Maximum Likelihood Estimation Paper • 2510.07624 • Published Oct 8, 2025 • 8
From Data to Rewards: a Bilevel Optimization Perspective on Maximum Likelihood Estimation Paper • 2510.07624 • Published Oct 8, 2025 • 8
Analysing Multi-Task Regression via Random Matrix Theory with Application to Time Series Forecasting Paper • 2406.10327 • Published Jun 14, 2024