view article Article Build real agentic apps using CUGA: two dozen working examples on a lightweight harness ibm-research • 1 day ago • 31
A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks Paper • 2605.28556 • Published 29 days ago • 70
view article Article Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic ibm-research • 23 days ago • 88
view article Article Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents ibm-research • Apr 15 • 28
view article Article CUGA on Hugging Face: Democratizing Configurable AI Agents ibm-research • Dec 15, 2025 • 67