MMLab@NTU

university

https://www.mmlab-ntu.com/

Activity Feed Request to join this org

AI & ML interests

Computer Vision and Deep Learning

Recent Activity

ldkong authored a paper about 14 hours ago

Towards Unified World Models for Visual Navigation via Memory-Augmented Planning and Foresight

ldkong authored a paper about 14 hours ago

Language-Conditioned World Modeling for Visual Navigation

ldkong authored a paper about 14 hours ago

AI for Auto-Research: Roadmap & User Guide

View all activity

Papers

MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction

Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer

View all Papers

authored 3 papers about 14 hours ago

Towards Unified World Models for Visual Navigation via Memory-Augmented Planning and Foresight

Paper • 2510.08713 • Published Mar 22 • 1

Language-Conditioned World Modeling for Visual Navigation

Paper • 2603.26741 • Published Mar 23

AI for Auto-Research: Roadmap & User Guide

Paper • 2605.18661 • Published 3 days ago • 58

submitted a paper to Daily Papers 2 days ago

AI for Auto-Research: Roadmap & User Guide

Paper • 2605.18661 • Published 3 days ago • 58

authored 3 papers 7 days ago

GRNet: Gridding Residual Network for Dense Point Cloud Completion

Paper • 2006.03761 • Published Jun 6, 2020

InfiniteDance: Scalable 3D Dance Generation Towards in-the-wild Generalization

Paper • 2603.13375 • Published Mar 10 • 3

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 9 days ago • 184

authored 13 papers 17 days ago

BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment

Paper • 2104.13371 • Published Apr 27, 2021 • 2

SparseNeRF: Distilling Depth Ranking for Few-shot Novel View Synthesis

Paper • 2303.16196 • Published Mar 28, 2023

Correlational Image Modeling for Self-Supervised Visual Pre-Training

Paper • 2303.12670 • Published Mar 22, 2023

CelebV-Text: A Large-Scale Facial Text-Video Dataset

Paper • 2303.14717 • Published Mar 26, 2023

Iterative Prompt Learning for Unsupervised Backlit Image Enhancement

Paper • 2303.17569 • Published Mar 30, 2023

Self-Supervised Learning via Conditional Motion Propagation

Paper • 1903.11412 • Published Mar 27, 2019

ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks

Paper • 1809.00219 • Published Sep 1, 2018

Interpret Vision Transformers as ConvNets with Dynamic Convolutions

Paper • 2309.10713 • Published Sep 19, 2023 • 1

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation

Paper • 2309.13042 • Published Sep 22, 2023 • 9

Deep Geometrized Cartoon Line Inbetweening

Paper • 2309.16643 • Published Sep 28, 2023 • 26

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models

Paper • 2309.15103 • Published Sep 26, 2023 • 43

Text2Performer: Text-Driven Human Video Generation

Paper • 2304.08483 • Published Apr 17, 2023

SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation

Paper • 2309.17448 • Published Sep 29, 2023 • 1