Saurabh Jha

saurabhjha1

5 5

·

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

ScarfBench: Benchmarking AI Agents for Enterprise Java Framework Migration

liked a dataset about 1 month ago

ibm-research/ITBench-Trajectories

liked a dataset about 1 month ago

ibm-research/ITBench-Lite

View all activity

Organizations

Articles 2

Article

18

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

Article

19

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

View all Articles

models 1

saurabhjha1/ppo-LunarLander-v2

Reinforcement Learning • Updated Jan 16, 2023

datasets 0

None public yet