Shusheng Yang's picture

Shusheng Yang PRO

ShushengYang

·

https://shushengyang.com

AI & ML interests

computer vision, vision language model

Recent Activity

authored a paper 18 days ago

BLIP3o-NEXT: Next Frontier of Native Image Generation

authored a paper 18 days ago

VideoNSA: Native Sparse Attention Scales Video Understanding

authored a paper 18 days ago

Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts

View all activity

Organizations

upvoted a paper 26 days ago

Benchmarking Visual State Tracking in Multimodal Video Understanding

Paper • 2606.03920 • Published 27 days ago • 52

upvoted a paper 4 months ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published Mar 3 • 107

upvoted a paper 5 months ago

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

Paper • 2601.16208 • Published Jan 22 • 55

upvoted 2 collections 6 months ago

Cambrian-S-Data

Data used during Cambrian-S's 4-stage training • 4 items • Updated 3 days ago • 5

Cambrian-S Models

18 items • Updated Mar 2 • 8

upvoted 2 papers 8 months ago

Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts

Paper • 2511.04655 • Published Nov 6, 2025 • 10

Cambrian-S: Towards Spatial Supersensing in Video

Paper • 2511.04670 • Published Nov 6, 2025 • 40

upvoted 2 papers 9 months ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 171

Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training

Paper • 2509.26625 • Published Sep 30, 2025 • 44

upvoted a paper over 1 year ago

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

Paper • 2412.14171 • Published Dec 18, 2024 • 24

upvoted a collection about 2 years ago

Cambrian-1 Models

6 items • Updated Feb 27 • 21

upvoted a paper about 2 years ago

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

Paper • 2406.16860 • Published Jun 24, 2024 • 63

upvoted a collection about 2 years ago

Cambrian Data

3 items • Updated Jun 25, 2024 • 12