Sihan XU's picture

Sihan XU

sihanxu

·

https://sihanxu.github.io/

AI & ML interests

None yet

Recent Activity

updated a model 16 days ago

sihanxu/optimal-gemini-8b-NPO-Llama3-8B-L7-gate_proj

published a model 16 days ago

sihanxu/optimal-gemini-8b-NPO-Llama3-8B-L7-gate_proj

upvoted a paper about 1 month ago

Steerable Visual Representations

View all activity

Organizations

authored 3 papers 5 months ago

Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation

Paper • 2504.16060 • Published Apr 22, 2025

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Paper • 2506.18890 • Published Jun 23, 2025 • 6

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 89

authored a paper almost 2 years ago

Multi-Object Hallucination in Vision-Language Models

Paper • 2407.06192 • Published Jul 8, 2024 • 12

authored 2 papers over 2 years ago

CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation

Paper • 2310.13165 • Published Oct 19, 2023

Inversion-Free Image Editing with Natural Language

Paper • 2312.04965 • Published Dec 7, 2023 • 2