2 21 3

Ruowen Zhao

zzzrw

https://zhaorw02.github.io/

AI & ML interests

Multi-modal Learning & Embodied AI

Recent Activity

upvoted a paper 1 day ago

ViQ: Text-Aligned Visual Quantized Representations at Any Resolution

updated a dataset 20 days ago

zzzrw/GEM-250K

updated a model 24 days ago

zzzrw/qwen3vl2b_checkpoints

View all activity

Organizations

None yet

upvoted a paper 1 day ago

ViQ: Text-Aligned Visual Quantized Representations at Any Resolution

Paper • 2606.27313 • Published 2 days ago • 37

upvoted a paper 29 days ago

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

Paper • 2605.30263 • Published about 1 month ago • 59

upvoted 2 papers about 1 month ago

GEM: Generative Supervision Helps Embodied Intelligence

Paper • 2605.28548 • Published May 27 • 32

Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation

Paper • 2605.15141 • Published May 14 • 96

upvoted a paper 3 months ago

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published Apr 8 • 181

upvoted a paper 5 months ago

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Paper • 2602.08439 • Published Feb 9 • 28

upvoted a paper 11 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 276

upvoted 2 papers 12 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 161

LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion

Paper • 2507.02813 • Published Jul 3, 2025 • 60

upvoted 2 papers about 1 year ago

ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding

Paper • 2506.01853 • Published Jun 2, 2025 • 32

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7, 2025 • 110

upvoted 7 papers over 1 year ago

OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

Paper • 2501.03841 • Published Jan 7, 2025 • 56

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 454

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Paper • 2411.09595 • Published Nov 14, 2024 • 78

upvoted 2 papers about 2 years ago

CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model

Paper • 2403.05034 • Published Mar 8, 2024 • 21

FlexiDreamer: Single Image-to-3D Generation with FlexiCubes

Paper • 2404.00987 • Published Apr 1, 2024 • 23

Ruowen Zhao

AI & ML interests

Recent Activity

Organizations

zzzrw's activity