arxiv:2410.06703
Segev Shlomov
segevshlomov
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
7 months ago
ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness
in Web Agents
liked
a dataset
7 months ago
dolev31/st-webagentbench