10 36 59

Zekun Qi

qizekun

https://qizekun.github.io/

qizekun

AI & ML interests

Embodied Intelligence, Large Langugae Model, 3D Computer Vision

Recent Activity

upvoted a collection 4 days ago

SenseNova-U1

authored a paper 5 days ago

ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?

upvoted a paper 8 days ago

ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?

View all activity

Organizations

upvoted a collection 4 days ago

SenseNova-U1

Collection

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 10 items • Updated 15 days ago • 74

authored a paper 5 days ago

ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?

Paper • 2606.19531 • Published 11 days ago • 20

upvoted a paper 8 days ago

ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?

Paper • 2606.19531 • Published 11 days ago • 20

authored a paper 19 days ago

LIMMT: Less is More for Motion Tracking

Paper • 2606.06953 • Published 23 days ago • 16

upvoted a paper 20 days ago

LIMMT: Less is More for Motion Tracking

Paper • 2606.06953 • Published 23 days ago • 16

submitted a paper to Daily Papers 20 days ago

LIMMT: Less is More for Motion Tracking

Paper • 2606.06953 • Published 23 days ago • 16

authored 5 papers 23 days ago

Hybrid-grained Feature Aggregation with Coarse-to-fine Language Guidance for Self-supervised Monocular Depth Estimation

Paper • 2510.09320 • Published Oct 10, 2025 • 3

VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

Paper • 2602.10098 • Published Feb 10 • 22

ReWorld: Multi-Dimensional Reward Modeling for Embodied World Models

Paper • 2601.12428 • Published Jan 18

Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining

Paper • 2604.16391 • Published Mar 27 • 4

Learning Athletic Humanoid Tennis Skills from Imperfect Human Motion Data

Paper • 2603.12686 • Published Mar 13

authored a paper 24 days ago

Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking

Paper • 2606.03985 • Published 26 days ago • 41

upvoted a paper 25 days ago

Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking

Paper • 2606.03985 • Published 26 days ago • 41

upvoted a paper 29 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published about 1 month ago • 146

liked a model about 1 month ago

ginwind/VLA-JEPA

Updated Mar 25 • 13

liked 3 datasets 3 months ago

upvoted a paper 4 months ago

Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation

Paper • 2602.16705 • Published Feb 18 • 26

upvoted a paper 5 months ago

VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

Paper • 2602.10098 • Published Feb 10 • 22

Zekun Qi

AI & ML interests

Recent Activity

Organizations

qizekun's activity