Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5, 2025 • 136
A Disease-Centric Vision-Language Foundation Model for Precision Oncology in Kidney Cancer Paper • 2508.16569 • Published Aug 22, 2025 • 1
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views Paper • 2510.18632 • Published Oct 21, 2025 • 22
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization Paper • 2602.23008 • Published 3 days ago • 29
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization Paper • 2602.23008 • Published 3 days ago • 29
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views Paper • 2510.18632 • Published Oct 21, 2025 • 22
LeanK: Learnable K Cache Channel Pruning for Efficient Decoding Paper • 2508.02215 • Published Aug 4, 2025 • 12
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5, 2025 • 136
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5, 2025 • 136 • 7
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression Paper • 2310.06839 • Published Oct 10, 2023 • 4
LLM-ABR: Designing Adaptive Bitrate Algorithms via Large Language Models Paper • 2404.01617 • Published Apr 2, 2024 • 8
Mitigate Position Bias in Large Language Models via Scaling a Single Dimension Paper • 2406.02536 • Published Jun 4, 2024
LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation Paper • 2411.04997 • Published Nov 7, 2024 • 39
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key Paper • 2501.09695 • Published Jan 16, 2025 • 1
On Memory Construction and Retrieval for Personalized Conversational Agents Paper • 2502.05589 • Published Feb 8, 2025
VisRL: Intention-Driven Visual Perception via Reinforced Reasoning Paper • 2503.07523 • Published Mar 10, 2025 • 1
Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs Paper • 2505.12929 • Published May 19, 2025 • 3