yu

yqi19

3 16 11

https://github.com/yqi19

yqi19

AI & ML interests

Enthusiastic Music Lover🎵🎻

Recent Activity

updated a dataset about 2 hours ago

yqi19/robolab-omni

published a model about 6 hours ago

yqi19/robolab-omni

published a dataset about 6 hours ago

yqi19/robolab-omni

View all activity

Organizations

None yet

upvoted an article 23 days ago

Article

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

nvidia

•

30 days ago

• 84

upvoted a paper 5 months ago

Think3D: Thinking with Space for Spatial Reasoning

Paper • 2601.13029 • Published Jan 19 • 48

upvoted an article 6 months ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

danaaubakirova, Molbap, mshukor, cadene

•

Feb 4, 2025

• 192

upvoted a paper 8 months ago

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

Paper • 2510.26802 • Published Oct 30, 2025 • 34

upvoted a paper 9 months ago

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

Paper • 2510.01171 • Published Oct 1, 2025 • 19

upvoted a collection 9 months ago

VisionLM

Collection

1929 items • Updated May 25 • 151

upvoted 8 papers 9 months ago

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published Oct 7, 2025 • 147

RoboBrain 2.0 Technical Report

Paper • 2507.02029 • Published Jul 2, 2025 • 36

BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities

Paper • 2510.08759 • Published Oct 9, 2025 • 46

upvoted a paper about 1 year ago

VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning

Paper • 2506.09049 • Published Jun 10, 2025 • 37

upvoted a paper over 1 year ago

Speculative Ad-hoc Querying

Paper • 2503.00714 • Published Mar 2, 2025 • 13

yu

AI & ML interests

Recent Activity

Organizations

yqi19's activity

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control