Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sihan XU's picture
5 12 20

Sihan XU

sihanxu
shengyi-qian's profile picture yifan-Eva's profile picture ExplorerFreda's profile picture
·
https://sihanxu.github.io/
  • 6SihanXu
  • SihanXU

AI & ML interests

None yet

Recent Activity

upvoted an article 6 days ago
NEO-unify: Building Native Multimodal Unified Models End to End
upvoted a paper 10 days ago
Beyond Language Modeling: An Exploration of Multimodal Pretraining
upvoted a paper about 2 months ago
WildRayZer: Self-supervised Large View Synthesis in Dynamic Environments
View all activity

Organizations

University of Michigan's profile picture Situated Language and Embodied Dialogue Lab's profile picture SixAILab's profile picture 2Infinity Lab's profile picture Forty-Two AI Lab's profile picture

authored 3 papers 3 months ago

Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation

Paper • 2504.16060 • Published Apr 22, 2025

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Paper • 2506.18890 • Published Jun 23, 2025 • 6

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 87
authored a paper over 1 year ago

Multi-Object Hallucination in Vision-Language Models

Paper • 2407.06192 • Published Jul 8, 2024 • 12
authored 2 papers over 2 years ago

CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation

Paper • 2310.13165 • Published Oct 19, 2023

Inversion-Free Image Editing with Natural Language

Paper • 2312.04965 • Published Dec 7, 2023 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs