Yu Bai's picture

Yu Bai

woodenwhite

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 20 days ago

ExoActor: Exocentric Video Generation as Generalizable Interactive Humanoid Control

upvoted a collection about 2 months ago

UnifoLM_WBT_Dataset

liked a dataset 3 months ago

BAAI-Agents/SWITCH-Basic-v1-open

View all activity

Organizations

upvoted a paper 20 days ago

ExoActor: Exocentric Video Generation as Generalizable Interactive Humanoid Control

Paper • 2604.27711 • Published 21 days ago • 41

upvoted a collection about 2 months ago

UnifoLM_WBT_Dataset

14 items • Updated 3 days ago • 82

upvoted a paper 4 months ago

EgoActor: Grounding Task Planning into Spatial-aware Egocentric Actions for Humanoid Robots via Visual-Language Models

Paper • 2602.04515 • Published Feb 4 • 39

upvoted a paper 7 months ago

The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30, 2025 • 29

upvoted a collection 9 months ago

Emu3

Emu3: Next-Token Prediction is All You Need • 7 items • Updated Feb 4 • 81

upvoted a paper 10 months ago

DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning

Paper • 2508.05405 • Published Aug 7, 2025 • 65

upvoted a paper 12 months ago

Reward Reasoning Model

Paper • 2505.14674 • Published May 20, 2025 • 37

upvoted 3 papers about 1 year ago

Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills

Paper • 2503.12533 • Published Mar 16, 2025 • 68

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10, 2025 • 101

Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning

Paper • 2503.07002 • Published Mar 10, 2025 • 39

upvoted a paper over 1 year ago

MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents

Paper • 2410.03450 • Published Oct 4, 2024 • 36