yl-1993

yl-1993

·

https://yanglei.me

yl-1993

AI & ML interests

None yet

Recent Activity

liked a model about 22 hours ago

sensenova/SenseNova-U1-8B-MoT-Infographic-V2

liked a model 18 days ago

sensenova/SenseNova-U1-8B-MoT-Interleaved

authored a paper about 1 month ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

View all activity

Organizations

upvoted a paper about 1 month ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published May 27 • 75

upvoted 3 papers about 2 months ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published May 12 • 194

UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors

Paper • 2605.00658 • Published May 1 • 86

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published Apr 30 • 92

upvoted 4 papers 3 months ago

MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction

Paper • 2603.19231 • Published Mar 19 • 37

Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer

Paper • 2603.19227 • Published Mar 19 • 42

Kinema4D: Kinematic 4D World Modeling for Spatiotemporal Embodied Simulation

Paper • 2603.16669 • Published Mar 17 • 70

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 373

upvoted a paper 4 months ago

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published Mar 16 • 153

upvoted an article 4 months ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

sensenova

•

Mar 5

• 167

upvoted 4 papers 4 months ago

SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning

Paper • 2512.24330 • Published Dec 30, 2025 • 36

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 275

ConsistCompose: Unified Multimodal Layout Control for Image Composition

Paper • 2511.18333 • Published Nov 23, 2025 • 5

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 526

upvoted a collection 5 months ago

NEO1_0

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale • 7 items • Updated Jan 27 • 9

upvoted 5 collections 6 months ago

Encoders-Lightx2v

2 items • Updated May 28 • 4

Wan2.1-Lightx2v

4 items • Updated May 28 • 2

Wan2.2-Lightx2v

4 items • Updated May 28 • 10

Qwen-Image-Lightx2v

4 items • Updated May 28 • 10

NVFP4-Lightx2v

2 items • Updated May 28 • 10