Live-Evo: Online Evolution of Agentic Memory from Continuous Feedback Paper • 2602.02369 • Published Feb 2
EVA: Efficient Reinforcement Learning for End-to-End Video Agent Paper • 2603.22918 • Published 8 days ago • 41
EVA: Efficient Reinforcement Learning for End-to-End Video Agent Paper • 2603.22918 • Published 8 days ago • 41
PyBench: Evaluating LLM Agent on various real-world coding tasks Paper • 2407.16732 • Published Jul 23, 2024 • 1
ELV-Halluc: Benchmarking Semantic Aggregation Hallucinations in Long Video Understanding Paper • 2508.21496 • Published Aug 29, 2025 • 55