From Soliloquy to Agora: Memory-Enhanced LLM Agents with Decentralized Debate for Optimization Modeling Paper • 2604.25847 • Published Apr 28
OR-Space: A Full-Lifecycle Workspace Benchmark for Industrial Optimization Agents Paper • 2605.28158 • Published May 27 • 6
Skills on the Fly: Test-Time Adaptive Skill Synthesis for LLM Agents Paper • 2605.16986 • Published May 16
LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents Paper • 2606.06087 • Published 23 days ago • 66
LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents Paper • 2606.06087 • Published 23 days ago • 66
OR-Space: A Full-Lifecycle Workspace Benchmark for Industrial Optimization Agents Paper • 2605.28158 • Published May 27 • 6
OR-Space: A Full-Lifecycle Workspace Benchmark for Industrial Optimization Agents Paper • 2605.28158 • Published May 27 • 6
Auto-Formulating Dynamic Programming Problems with Large Language Models Paper • 2507.11737 • Published Apr 1 • 1
Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Paper • 2604.08224 • Published Apr 9 • 53
Auto-Formulating Dynamic Programming Problems with Large Language Models Paper • 2507.11737 • Published Apr 1 • 1
Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization Paper • 2604.09574 • Published Feb 24 • 30
Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Paper • 2604.08224 • Published Apr 9 • 53
Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Paper • 2604.08224 • Published Apr 9 • 53
StepORLM: A Self-Evolving Framework With Generative Process Supervision For Operations Research Language Models Paper • 2509.22558 • Published Sep 26, 2025 • 4
StepORLM: A Self-Evolving Framework With Generative Process Supervision For Operations Research Language Models Paper • 2509.22558 • Published Sep 26, 2025 • 4