👋 Open to Work

4 147 11

Chengxuan Qian

Raymond-Qiancx

https://qiancx.com/

AI & ML interests

Vision-Language Models

Recent Activity

upvoted a paper about 6 hours ago

ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?

upvoted a paper about 6 hours ago

World Action Models: A Survey

upvoted a paper about 6 hours ago

Holistic Data Scheduler for LLM Pre-training via Multi-Objective Reinforcement Learning

View all activity

Organizations

None yet

upvoted 6 papers about 6 hours ago

ImageWAM: Do World Action Models Really Need Video Generation, or Just Image Editing?

Paper • 2606.19531 • Published 9 days ago • 19

World Action Models: A Survey

Paper • 2606.20781 • Published 8 days ago • 52

Holistic Data Scheduler for LLM Pre-training via Multi-Objective Reinforcement Learning

Paper • 2606.24133 • Published 3 days ago • 6

Autodata: An agentic data scientist to create high quality synthetic data

Paper • 2606.25996 • Published 2 days ago • 9

UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating

Paper • 2606.21661 • Published 7 days ago • 21

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

Paper • 2606.26907 • Published 1 day ago • 26

upvoted 4 papers 15 days ago

WorldMemArena: Evaluating Multimodal Agent Memory Through Action-World Interaction

Paper • 2605.29341 • Published 29 days ago • 18

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 25 days ago • 135

Benchmark Everything Everywhere All at Once

Paper • 2606.06462 • Published 22 days ago • 4

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

Paper • 2606.03988 • Published 23 days ago • 125

liked a dataset about 1 month ago

CSU-JPG/MIND

Preview • Updated Feb 10 • 1.77k • 3

upvoted a paper about 1 month ago

WorldAct: Activating Monolithic 3D Worlds into Interactive-Ready Object-Centric Scenes

Paper • 2605.15843 • Published May 15 • 6

upvoted a collection about 1 month ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.69k

upvoted 2 papers about 1 month ago

TideGS: Scalable Training of Over One Billion 3D Gaussian Splatting Primitives via Out-of-Core Optimization

Paper • 2605.20150 • Published May 19 • 7

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

Paper • 2605.18739 • Published May 18 • 115

liked a model about 1 month ago

google/gemma-2-2b-it

Text Generation • 3B • Updated Aug 27, 2024 • 392k • • 1.4k

upvoted 4 papers about 1 month ago

MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

Paper • 2605.15128 • Published May 14 • 64

Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

Paper • 2605.15182 • Published May 14 • 39

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published May 14 • 91

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published May 11 • 29

Chengxuan Qian

AI & ML interests

Recent Activity

Organizations

Raymond-Qiancx's activity