Wenbo Chen
wenbochen111
AI & ML interests
LLM
Recent Activity
authored a paper about 15 hours ago
ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces upvoted a paper about 18 hours ago
ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces authored a paper about 2 months ago
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse TasksOrganizations
None yet