AI & ML interests
Natural language processing, language models, language agents
Recent Activity
Papers
AgentCL: Toward Rigorous Evaluation of Continual Learning in Language Agents
SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction
spaces 4
pinned
Running
Agents
26
Online-Mind2Web Leaderboard
🌐
View and explore Mind2Web agent evaluation leaderboards
Running
Agents
20
QUEST
🔎
Answer complex questions with web‑sourced research
Running
Agents
21
TravelPlannerLeaderboard
💻
Display and submit travel planner evaluation results
Paused
Agents
4
TravelPlannerEnvironment
👀
Plan a travel itinerary with cost tracking
models 80
osunlp/QUEST-35B-MT-Plus-SFT
Text Generation • 35B • Updated • 329 • 4
osunlp/QUEST-35B-SFT
Text Generation • 35B • Updated • 93 • 1
osunlp/QUEST-30B-SFT
Text Generation • 31B • Updated • 220
osunlp/QUEST-35B-RL
Text Generation • 35B • Updated • 709 • 19
osunlp/QUEST-35B-MT
Text Generation • 35B • Updated • 117
osunlp/QUEST-30B-MT-Plus-SFT
Text Generation • 31B • Updated • 247 • 1
osunlp/QUEST-30B-RL
Text Generation • 31B • Updated • 199
osunlp/QUEST-2B
Text Generation • 2B • Updated • 141
osunlp/QUEST-9B
Text Generation • 9B • Updated • 381 • 3
osunlp/QUEST-4B
Text Generation • 5B • Updated • 266
datasets 37
osunlp/QUEST-Mid-Training-Data
Updated • 56
osunlp/SkillHarm
Viewer • Updated • 879 • 3.84k • 1
osunlp/ACuRL
Viewer • Updated • 9.22k • 37
osunlp/QUEST-RL-Data
Viewer • Updated • 1.13k • 4.71k • 1
osunlp/QUEST-SFT-Data-Open-ended
Viewer • Updated • 11.9k • 863 • 1
osunlp/QUEST-SFT-Data-Objective
Viewer • Updated • 39.9k • 1.46k • 1
osunlp/Online-Mind2Web
Viewer • Updated • 300 • 1.59k • 25
osunlp/bioscan-traits
Viewer • Updated • 80.8k • 206 • 10
osunlp/AgentCL
Viewer • Updated • 300 • 184 • 1
osunlp/D3-Gym-Trajectories
Viewer • Updated • 6.37k • 88