2 34 6

Tian Shulin

shulin16

https://shulin16.github.io/

AI & ML interests

None yet

Recent Activity

updated a dataset 3 days ago

egotools-dev/egotools_v4_backfilled_sft_v5_902_20260623

published a dataset 3 days ago

egotools-dev/egotools_v4_backfilled_sft_v5_902_20260623

authored a paper 7 days ago

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

View all activity

Organizations

upvoted a paper 7 days ago

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Paper • 2606.20515 • Published 9 days ago • 39

upvoted a paper 9 days ago

Show the Signal, Hide the Noise: Spectral Forcing for Pixel-Space Diffusion

Paper • 2606.15236 • Published 11 days ago • 21

upvoted 2 papers 15 days ago

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Paper • 2606.02437 • Published 26 days ago • 233

Agents' Last Exam

Paper • 2606.05405 • Published 24 days ago • 364

upvoted 2 papers 25 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published 30 days ago • 146

Function2Scene: 3D Indoor Scene Layout from Functional Specifications

Paper • 2605.30819 • Published 29 days ago • 42

upvoted 3 papers 29 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published May 12 • 194

MolmoAct2: Action Reasoning Models for Real-world Deployment

Paper • 2605.02881 • Published May 4 • 355

SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

Paper • 2605.27367 • Published May 26 • 72

upvoted a paper about 1 month ago

PhysX-Omni: Unified Simulation-Ready Physical 3D Generation for Rigid, Deformable, and Articulated Objects

Paper • 2605.21572 • Published May 20 • 55

upvoted a paper 2 months ago

VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images

Paper • 2604.09531 • Published Apr 10 • 10

upvoted a paper 3 months ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published Apr 6 • 237

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 909

upvoted 5 papers 3 months ago

Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2603.18118 • Published Mar 18 • 12

upvoted 2 papers 4 months ago

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

Paper • 2603.03269 • Published Mar 3 • 63

Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

Paper • 2603.07660 • Published Mar 8 • 87

Tian Shulin

AI & ML interests

Recent Activity

Organizations

shulin16's activity

Welcome Gemma 4: Frontier multimodal intelligence on device