arxiv:2407.03651
Amanda Dsouza
andsouzasnorkelai
AI & ML interests
None yet
Recent Activity
upvoted a paper about 9 hours ago
SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks upvoted a paper 29 days ago
SkillOrchestra: Learning to Route Agents via Skill Transfer liked a dataset 4 months ago
snorkelai/Tau2-Bench-Airline-With-Code-Agents