Ray Yang

rayruiyang

5 16 6

Yangr116

AI & ML interests

None yet

Recent Activity

updated a collection 14 days ago

VST

upvoted a paper about 1 month ago

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

upvoted a paper about 2 months ago

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

Paper • 2605.18739 • Published May 18 • 116

upvoted 3 papers about 2 months ago

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Paper • 2605.13779 • Published May 13 • 223

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Paper • 2605.13831 • Published May 13 • 89

AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward

Paper • 2605.12495 • Published May 12 • 35

upvoted 4 papers 3 months ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published Apr 6 • 116

ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

Paper • 2603.25823 • Published Mar 26 • 44

MolmoPoint: Better Pointing for VLMs with Grounding Tokens

Paper • 2603.28069 • Published Mar 30 • 9

ProAct: Agentic Lookahead in Interactive Environments

Paper • 2602.05327 • Published Feb 5 • 27

upvoted 2 papers 4 months ago

Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

Paper • 2603.07660 • Published Mar 8 • 87

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published Mar 3 • 186

upvoted a paper 7 months ago

DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning

Paper • 2512.12799 • Published Dec 14, 2025 • 12

upvoted a paper 8 months ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 218

upvoted a collection 8 months ago

VST

Collection

A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities. • 7 items • Updated 14 days ago • 6

upvoted 2 papers 8 months ago

Visual Spatial Tuning

Paper • 2511.05491 • Published Nov 7, 2025 • 53

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published Oct 27, 2025 • 181

upvoted a paper over 1 year ago

GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction

Paper • 2305.18752 • Published May 30, 2023 • 5

Ray Yang

AI & ML interests

Recent Activity

Organizations

rayruiyang's activity