Tian Shulin's picture

Tian Shulin

shulin16

·

https://shulin16.github.io/

AI & ML interests

None yet

Recent Activity

updated a dataset 2 days ago

egotools-dev/egotools_v4_backfilled_sft_v5_902_20260623

published a dataset 2 days ago

egotools-dev/egotools_v4_backfilled_sft_v5_902_20260623

authored a paper 6 days ago

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

View all activity

Organizations

updated a dataset 2 days ago

egotools-dev/egotools_v4_backfilled_sft_v5_902_20260623

Preview • Updated 2 days ago • 9

published a dataset 2 days ago

egotools-dev/egotools_v4_backfilled_sft_v5_902_20260623

Preview • Updated 2 days ago • 9

authored 8 papers 6 days ago

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

Paper • 2510.13759 • Published Oct 15, 2025 • 11

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Paper • 2602.08439 • Published Feb 9 • 28

Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2603.18118 • Published Mar 18 • 12

PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning

Paper • 2603.26653 • Published Mar 27 • 18

HippoCamp: Benchmarking Contextual Agents on Personal Computers

Paper • 2604.01221 • Published Apr 1 • 30

A Simple Baseline for Streaming Video Understanding

Paper • 2604.02317 • Published Apr 2 • 74

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Paper • 2604.04901 • Published Apr 6 • 40

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Paper • 2606.20515 • Published 8 days ago • 39

upvoted a paper 6 days ago

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

Paper • 2606.20515 • Published 8 days ago • 39

upvoted a paper 8 days ago

Show the Signal, Hide the Noise: Spectral Forcing for Pixel-Space Diffusion

Paper • 2606.15236 • Published 10 days ago • 21

published 2 models 10 days ago

rmedev/dreamzero-droid14b-robomme-lora-ckpt1000

Updated 10 days ago

rmedev/dreamzero-droid14b-robomme-lora-80g8-260615-095102

Updated 10 days ago

upvoted 2 papers 14 days ago

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Paper • 2606.02437 • Published 25 days ago • 232

Agents' Last Exam

Paper • 2606.05405 • Published 23 days ago • 363

upvoted a paper 24 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published 29 days ago • 146

upvoted a paper 25 days ago

Function2Scene: 3D Indoor Scene Layout from Functional Specifications

Paper • 2605.30819 • Published 28 days ago • 42

upvoted 2 papers 29 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published May 12 • 194

MolmoAct2: Action Reasoning Models for Real-world Deployment

Paper • 2605.02881 • Published May 4 • 355