TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World Scenarios Paper • 2602.01675 • Published 2 days ago • 8
Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction Paper • 2601.05107 • Published 27 days ago • 23
Running 3.67k The Ultra-Scale Playbook 🌌 3.67k The ultimate guide to training LLM on large GPU Clusters