Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sihan XU's picture
5 9 20

Sihan XU

sihanxu
jhhuangchloe's profile picture Tian-Xia's profile picture shengyi-qian's profile picture
·
https://sihanxu.github.io/
  • 6SihanXu
  • SihanXU

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
liked a model 28 days ago
SixAILab/nepa-base-patch14-224-sft
updated a model about 1 month ago
SixAILab/nepa-large-patch14-224-sft
View all activity

Organizations

University of Michigan's profile picture Situated Language and Embodied Dialogue Lab's profile picture SixAILab's profile picture 2Infinity Lab's profile picture Forty-Two AI Lab's profile picture

authored 3 papers about 1 month ago

Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation

Paper • 2504.16060 • Published Apr 22, 2025

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Paper • 2506.18890 • Published Jun 23, 2025 • 6

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 84
authored a paper over 1 year ago

Multi-Object Hallucination in Vision-Language Models

Paper • 2407.06192 • Published Jul 8, 2024 • 12
authored 2 papers about 2 years ago

CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation

Paper • 2310.13165 • Published Oct 19, 2023

Inversion-Free Image Editing with Natural Language

Paper • 2312.04965 • Published Dec 7, 2023 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs