11 18 9

Yif Yang

Yif29

Yif-Yang

AI & ML interests

None yet

Recent Activity

authored a paper 9 days ago

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

upvoted a paper 13 days ago

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

authored a paper 15 days ago

Latent Spatial Memory for Video World Models

View all activity

Organizations

authored a paper 9 days ago

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

Paper • 2606.09426 • Published 17 days ago • 102

upvoted a paper 13 days ago

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

Paper • 2606.09426 • Published 17 days ago • 102

authored a paper 15 days ago

Latent Spatial Memory for Video World Models

Paper • 2606.09828 • Published 17 days ago • 69

upvoted a paper 16 days ago

Latent Spatial Memory for Video World Models

Paper • 2606.09828 • Published 17 days ago • 69

updated a dataset 29 days ago

microsoft/AVGen-Bench

Viewer • Updated 29 days ago • 3.01k • 4.33k • 4

authored 2 papers about 1 month ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published May 22 • 246

From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

Paper • 2605.23899 • Published May 22 • 29

upvoted a paper about 1 month ago

From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

Paper • 2605.23899 • Published May 22 • 29

commented 2 papers about 1 month ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published May 22 • 246 •

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published May 22 • 246 •

upvoted a paper about 1 month ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published May 22 • 246

authored a paper about 1 month ago

Covering Human Action Space for Computer Use: Data Synthesis and Benchmark

Paper • 2605.12501 • Published May 12 • 16

upvoted a paper about 1 month ago

Covering Human Action Space for Computer Use: Data Synthesis and Benchmark

Paper • 2605.12501 • Published May 12 • 16

updated a dataset about 2 months ago

microsoft/World-R1

Viewer • Updated Apr 29 • 6.48k • 157 • 8

published a dataset about 2 months ago

microsoft/World-R1

Viewer • Updated Apr 29 • 6.48k • 157 • 8

authored a paper about 2 months ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published Apr 27 • 119

upvoted a paper about 2 months ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published Apr 27 • 119

updated a Space 2 months ago

BizGenEval Leaderboard

🥇

Official BizGenEval leaderboard on Hugging Face.

authored a paper 2 months ago

AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation

Paper • 2604.08540 • Published Apr 9 • 5

updated a dataset 2 months ago

microsoft/MM-WebGen-Bench

Viewer • Updated Apr 17 • 120 • 39

Yif Yang

AI & ML interests

Recent Activity

Organizations

Yif29's activity

BizGenEval Leaderboard