Rohan Arora's picture

Rohan Arora

rohan-arora

·

AI & ML interests

None yet

Recent Activity

published an article about 1 month ago

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

upvoted a paper about 2 months ago

MCP-Cosmos: World Model-Augmented Agents for Complex Task Execution in MCP Environments

updated a dataset 2 months ago

ibm-research/ITBench-Lite

View all activity

Organizations

published an article about 1 month ago

Article

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

ibm-research

•

May 27

• 17

published an article 4 months ago

Article

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

ibm-research

•

Feb 18

• 19