2 17 3

Tong He

tonghe90

http://tonghe90.github.io

AI & ML interests

SII is an institution dedicated to innovation in education and research in the field of AI

Recent Activity

upvoted a paper 25 days ago

VLM3: Vision Language Models Are Native 3D Learners

upvoted a paper about 1 month ago

Geo-Align: Video Generation Alignment via Metric Geometry Reward

upvoted a paper about 1 month ago

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

View all activity

Organizations

upvoted a paper 25 days ago

VLM3: Vision Language Models Are Native 3D Learners

Paper • 2605.30561 • Published 29 days ago • 26

upvoted 3 papers about 1 month ago

Geo-Align: Video Generation Alignment via Metric Geometry Reward

Paper • 2605.23903 • Published May 22 • 10

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published May 14 • 91

Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

Paper • 2605.15182 • Published May 14 • 39

submitted a paper to Daily Papers about 1 month ago

Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

Paper • 2605.15182 • Published May 14 • 39

upvoted 2 papers 3 months ago

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Paper • 2603.25730 • Published Mar 26 • 53

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

Paper • 2603.23497 • Published Mar 24 • 92

upvoted 2 papers 6 months ago

VINO: A Unified Visual Generator with Interleaved OmniModal Context

Paper • 2601.02358 • Published Jan 5 • 30

Yume-1.5: A Text-Controlled Interactive World Generation Model

Paper • 2512.22096 • Published Dec 26, 2025 • 61

upvoted 2 papers 9 months ago

BRIDGE - Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation

Paper • 2509.25077 • Published Sep 29, 2025 • 15

Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation

Paper • 2509.15185 • Published Sep 18, 2025 • 29

authored 5 papers 9 months ago

upvoted a paper 9 months ago

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Paper • 2509.12201 • Published Sep 15, 2025 • 107

liked a dataset 9 months ago

InternRobotics/OmniWorld

Viewer • Updated Apr 17 • 7.09B • 46.8k • 94

liked a model 10 months ago

facebook/MobileLLM-R1-950M

Text Generation • 0.9B • Updated Sep 30, 2025 • 139 • 359

upvoted a paper 10 months ago

WinT3R: Window-Based Streaming Reconstruction with Camera Token Pool

Paper • 2509.05296 • Published Sep 5, 2025 • 8

Tong He

AI & ML interests

Recent Activity

Organizations

tonghe90's activity