LMMs-Lab-SI

community

https://www.lmms-lab.com/

EvolvingLMMs-Lab

AI & ML interests

Feeling and building the multimodal intelligence.

Recent Activity

oscarqjh updated a dataset about 10 hours ago

lmms-lab-si/EASI-Leaderboard-Requests

yl-1993 authored a paper 4 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

oscarqjh authored a paper 19 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

View all activity

updated a dataset about 10 hours ago

lmms-lab-si/EASI-Leaderboard-Requests

Preview • Updated about 10 hours ago • 18

authored a paper 4 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published 6 days ago • 68

authored a paper 19 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 21 days ago • 191

authored a paper 20 days ago

EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents

Paper • 2602.23205 • Published Feb 26 • 11

authored a paper 20 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 21 days ago • 191

authored a paper 20 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 21 days ago • 191

authored a paper 2 months ago

Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer

Paper • 2603.19227 • Published Mar 19 • 42

authored a paper 2 months ago

Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer

Paper • 2603.19227 • Published Mar 19 • 42

authored a paper 2 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 372

authored a paper 3 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 372

authored 3 papers 3 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 372

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Paper • 2510.26794 • Published Oct 30, 2025 • 27

ConsistCompose: Unified Multimodal Layout Control for Image Composition

Paper • 2511.18333 • Published Nov 23, 2025 • 5

authored 2 papers 3 months ago

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Paper • 2510.26794 • Published Oct 30, 2025 • 27

ConsistCompose: Unified Multimodal Layout Control for Image Composition

Paper • 2511.18333 • Published Nov 23, 2025 • 5

authored a paper 3 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 525

authored a paper 3 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 525

updated a dataset 4 months ago

lmms-lab-si/EASI-Leaderboard-Data

Preview • Updated Feb 12 • 795 • 1

updated a dataset 4 months ago

lmms-lab-si/EASI-Leaderboard-Requests

Preview • Updated about 10 hours ago • 18

published a model 6 months ago

lmms-lab-si/third-party-models

Updated Dec 4, 2025