Growing Through Experience: Scaling Episodic Grounding in Language Models Paper • 2506.01312 • Published Jun 2, 2025
AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment Paper • 2411.10606 • Published Nov 15, 2024 • 1
LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement Paper • 2504.16053 • Published Apr 22, 2025
SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs Paper • 2510.05069 • Published Oct 6, 2025 • 13
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models Paper • 2507.14204 • Published Jul 14, 2025
Superficial Self-Improved Reasoners Benefit from Model Merging Paper • 2503.02103 • Published Mar 3, 2025
Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners Paper • 2510.04454 • Published Oct 6, 2025
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published Mar 10, 2025 • 101
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders Paper • 2412.09586 • Published Dec 12, 2024 • 6
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks Paper • 2310.19909 • Published Oct 30, 2023 • 21
LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images Paper • 2305.19164 • Published May 30, 2023 • 2