13 9

JInatao rong

euminds

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago

WorldOlympiad: Can Your World Model Survive a Triathlon?

upvoted a paper 16 days ago

Latent Spatial Memory for Video World Models

upvoted a paper 21 days ago

Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching

View all activity

Organizations

None yet

upvoted a paper 15 days ago

WorldOlympiad: Can Your World Model Survive a Triathlon?

Paper • 2606.11129 • Published 16 days ago • 31

upvoted a paper 16 days ago

Latent Spatial Memory for Video World Models

Paper • 2606.09828 • Published 17 days ago • 69

upvoted a paper 21 days ago

Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching

Paper • 2606.03577 • Published 23 days ago • 16

upvoted 2 papers about 1 month ago

Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization

Paper • 2605.15980 • Published May 15 • 36

Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

Paper • 2605.15182 • Published May 14 • 39

upvoted a paper about 2 months ago

MARBLE: Multi-Aspect Reward Balance for Diffusion RL

Paper • 2605.06507 • Published May 7 • 40

liked a dataset 3 months ago

ropedia-ai/xperience-10m

Updated Apr 21 • 85.5k • 207

liked 2 models 4 months ago

yetter-ai/Wan2.2-TI2V-5B-Turbo-Diffusers

Updated Nov 12, 2025 • 573 • 6

Kijai/WanVideo_comfy

Updated 11 days ago • 2.09M • 2.39k

upvoted a paper 4 months ago

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

Paper • 2602.06422 • Published Feb 6 • 47

upvoted a paper 7 months ago

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

Paper • 2512.07951 • Published Dec 8, 2025 • 51

upvoted an article 7 months ago

Article

Diffusers welcomes FLUX-2

YiYiXu, dg845, sayakpaul, OzzyGT, dn6, ariG23498, linoyts, multimodalart

•

Nov 25, 2025

• 190

upvoted a paper 7 months ago

Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation

Paper • 2511.20714 • Published Nov 25, 2025 • 51

liked a model 7 months ago

black-forest-labs/FLUX.2-dev

Image-to-Image • Updated Feb 17 • 307k • • 1.82k

upvoted a paper 8 months ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 117

liked a dataset 9 months ago

InternRobotics/OmniWorld

Viewer • Updated Apr 17 • 7.09B • 47.1k • 94

upvoted a paper 9 months ago

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Paper • 2509.12201 • Published Sep 15, 2025 • 107

liked a Space over 1 year ago

VideoLLaMA2 AV

🚀

VideoLLaMA2-AV

upvoted a paper almost 2 years ago

MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence

Paper • 2407.16655 • Published Jul 23, 2024 • 30

liked a model almost 2 years ago

meta-llama/Meta-Llama-3-8B

Text Generation • 8B • Updated Sep 27, 2024 • 1.28M • • 6.59k