RealICU: Do LLM Agents Understand Long-Context ICU Data? A Benchmark Beyond Behavior Imitation Paper • 2605.13542 • Published 3 days ago • 6