neutrino12 's Collections Agent
updated
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper
• 2508.03680
• Published • 138
Training Long-Context, Multi-Turn Software Engineering Agents with
Reinforcement Learning
Paper
• 2508.03501
• Published • 59
SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from
Experience
Paper
• 2508.04700
• Published • 52
RoboMemory: A Brain-inspired Multi-memory Agentic Framework for Lifelong
Learning in Physical Embodied Systems
Paper
• 2508.01415
• Published • 8
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper
• 2508.06471
• Published • 210
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm
Bridging Foundation Models and Lifelong Agentic Systems
Paper
• 2508.07407
• Published • 99
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with
Long-Term Memory
Paper
• 2508.09736
• Published • 58
Memp: Exploring Agent Procedural Memory
Paper
• 2508.06433
• Published • 36
OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks
Paper
• 2508.05614
• Published • 20
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent
Foundation Models Training
Paper
• 2508.00414
• Published • 94
Tool-integrated Reinforcement Learning for Repo Deep Search
Paper
• 2508.03012
• Published • 20
SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution
Paper
• 2507.23348
• Published • 12
Think in Games: Learning to Reason in Games via Reinforcement Learning
with Large Language Models
Paper
• 2508.21365
• Published • 29
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex
Dynamic Environment? A Study on τ-bench
Paper
• 2508.20931
• Published • 16
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper
• 2508.16153
• Published • 162
Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance
for Text-to-Image Generation
Paper
• 2508.18032
• Published • 41
AWorld: Orchestrating the Training Recipe for Agentic AI
Paper
• 2508.20404
• Published • 38
Understanding Tool-Integrated Reasoning
Paper
• 2508.19201
• Published • 32
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
Paper
• 2509.01055
• Published • 79
EPO: Entropy-regularized Policy Optimization for LLM Agents
Reinforcement Learning
Paper
• 2509.22576
• Published • 137