Xiangyi Li's picture

Xiangyi Li PRO

xdotli

·

https://www.xiangyi.li

AI & ML interests

None yet

Recent Activity

updated a model about 10 hours ago

benchflow/benchflow-qwen35-9b

updated a dataset 1 day ago

benchflow/env0-experiment-trajectories

published a model 2 days ago

benchflow/benchflow-qwen35-9b

View all activity

Organizations

updated a model about 10 hours ago

benchflow/benchflow-qwen35-9b

Updated about 5 hours ago • 13 • 1

updated a dataset 1 day ago

benchflow/env0-experiment-trajectories

Updated about 4 hours ago • 21

published a model 2 days ago

benchflow/benchflow-qwen35-9b

Updated about 5 hours ago • 13 • 1

New activity in benchflow/env0-experiment-trajectories 4 days ago

tasks-lite GPT-5.5 xhigh trajectories: all

#9 opened 4 days ago by

tasks-lite GPT-5.5 xhigh trajectories: wave4

#8 opened 4 days ago by

tasks-lite GPT-5.5 xhigh trajectories: wave3

#7 opened 4 days ago by

tasks-lite GPT-5.5 xhigh trajectories: wave2

#6 opened 4 days ago by

New activity in benchflow/env0-experiment-trajectories 5 days ago

Add full SkillsBench canary artifacts

#2 opened 5 days ago by

Add SkillsBench canary trajectories

#1 opened 5 days ago by

liked a dataset 9 days ago

benchflow/skillsbench

Updated 8 days ago • 4.15k • 6

liked 2 models 9 days ago

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated Mar 25 • 3.32k • • 1.14k

Qwen/Qwen3.5-9B

Image-Text-to-Text • 10B • Updated Mar 2 • 10M • • 1.61k

upvoted an article about 1 month ago

Article

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

anakin87

•

Sep 4, 2025

• 31

New activity in benchflow/benchmarks about 2 months ago

harvey-lab: refresh adapter parity artifacts

#2 opened about 2 months ago by

updated a dataset about 2 months ago

benchflow/benchmarks

Updated May 7 • 51

New activity in benchflow/benchmarks about 2 months ago

harvey-lab: refresh adapter parity artifacts (3-run agent-side)

#1 opened about 2 months ago by

published a dataset about 2 months ago

benchflow/benchmarks

Updated May 7 • 51

updated a dataset about 2 months ago

benchflow/skillsbench-research-artifacts

Updated May 5 • 35

published a dataset about 2 months ago

benchflow/skillsbench-research-artifacts

Updated May 5 • 35

updated a dataset about 2 months ago

benchflow/skillsbench-leaderboard

Updated 8 days ago • 5.9k • 1