UCSB AI Group

university

https://nlp.cs.ucsb.edu

AI & ML interests

NLP, ML, CV, etc.

Recent Activity

Gurusha updated a dataset 7 days ago

ucsbai/enact_tom_300_public

Gurusha published a dataset 7 days ago

ucsbai/enact_tom_300_public

ChuhanLi updated a dataset 26 days ago

ucsbai/SAW-Bench

View all activity

Papers

Auditing Agent Harness Safety

Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling

View all Papers

updated a dataset 7 days ago

ucsbai/enact_tom_300_public

Preview • Updated 7 days ago • 55

published a dataset 7 days ago

ucsbai/enact_tom_300_public

Preview • Updated 7 days ago • 55

updated a dataset 26 days ago

ucsbai/SAW-Bench

Viewer • Updated 26 days ago • 4.14k • 941 • 3

updated a collection 2 months ago

SAW-Bench

2 items • Updated May 23

published a dataset 2 months ago

ucsbai/SAW-Bench

Viewer • Updated 26 days ago • 4.14k • 941 • 3

updated a collection 2 months ago

SAW-Bench

2 items • Updated May 23

authored a paper 2 months ago

Auditing Agent Harness Safety

Paper • 2605.14271 • Published May 14 • 55

submitted a paper to Daily Papers 2 months ago

Auditing Agent Harness Safety

Paper • 2605.14271 • Published May 14 • 55

authored a paper 2 months ago

Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling

Paper • 2604.27039 • Published Apr 29 • 26

authored 2 papers 3 months ago

SafePro: Evaluating the Safety of Professional-Level AI Agents

Paper • 2601.06663 • Published Jan 13

Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling

Paper • 2604.27039 • Published Apr 29 • 26

submitted a paper to Daily Papers 3 months ago

Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling

Paper • 2604.27039 • Published Apr 29 • 26

authored 5 papers 3 months ago

Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space

Paper • 2512.12623 • Published Dec 14, 2025 • 4

Rethinking Memory Mechanisms of Foundation Agents in the Second Half: A Survey

Paper • 2602.06052 • Published Jan 14 • 7

Context Bootstrapped Reinforcement Learning

Paper • 2603.18953 • Published Mar 19

Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants

Paper • 2604.00842 • Published Apr 1 • 15

On the Reliability of Computer Use Agents

Paper • 2604.17849 • Published Apr 20 • 13

authored 3 papers 4 months ago

Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants

Paper • 2604.00842 • Published Apr 1 • 15

WildSci: Advancing Scientific Reasoning from In-the-Wild Literature

Paper • 2601.05567 • Published Jan 9

Group-Evolving Agents: Open-Ended Self-Improvement via Experience Sharing

Paper • 2602.04837 • Published Feb 4 • 10