1 8 2

Yikai Wang

yikaiwang

https://yikai-wang.github.io

Yikai-Wang

AI & ML interests

Disclaimer: Some of listed papers are not mine, but Hugging Face does not allow me to remove them.

Recent Activity

authored a paper 17 days ago

Direct 3D-Aware Object Insertion via Decomposed Visual Proxies

upvoted a paper 18 days ago

Direct 3D-Aware Object Insertion via Decomposed Visual Proxies

upvoted a paper 29 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

View all activity

Organizations

None yet

authored a paper 17 days ago

Direct 3D-Aware Object Insertion via Decomposed Visual Proxies

Paper • 2606.06601 • Published 22 days ago • 26

upvoted a paper 18 days ago

Direct 3D-Aware Object Insertion via Decomposed Visual Proxies

Paper • 2606.06601 • Published 22 days ago • 26

upvoted a paper 29 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published about 1 month ago • 75

upvoted a paper about 2 months ago

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Paper • 2604.24763 • Published Apr 27 • 71

upvoted a paper 8 months ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16, 2025 • 70

authored a paper 9 months ago

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Paper • 2510.08673 • Published Oct 9, 2025 • 128

upvoted a paper 9 months ago

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Paper • 2510.08673 • Published Oct 9, 2025 • 128

published a model 9 months ago

yikaiwang/NVG

Updated Oct 7, 2025 • 1

updated a model 9 months ago

yikaiwang/NVG

Updated Oct 7, 2025 • 1

authored a paper 10 months ago

Next Visual Granularity Generation

Paper • 2508.12811 • Published Aug 18, 2025 • 49

upvoted a paper 10 months ago

Next Visual Granularity Generation

Paper • 2508.12811 • Published Aug 18, 2025 • 49

commented a paper 10 months ago

Next Visual Granularity Generation

Paper • 2508.12811 • Published Aug 18, 2025 • 49 •

upvoted a paper 11 months ago

Reconstructing 4D Spatial Intelligence: A Survey

Paper • 2507.21045 • Published Jul 28, 2025 • 38

updated a dataset about 1 year ago

yikaiwang/MISATO

Updated Jun 24, 2025 • 7

updated a model about 1 year ago

yikaiwang/ASUKA-FLUX.1-Fill

Updated Jun 11, 2025 • 4

published a model about 1 year ago

yikaiwang/ASUKA-FLUX.1-Fill

Updated Jun 11, 2025 • 4

published a dataset about 1 year ago

yikaiwang/MISATO

Updated Jun 24, 2025 • 7

liked a model over 1 year ago

jingwei-xu-00/pretrained_backup_for_streetunveiler

Image-to-3D • Updated Mar 4, 2025 • 1

authored 2 papers over 1 year ago

Unified Lexical Representation for Interpretable Visual-Language Alignment

Paper • 2407.17827 • Published Jul 25, 2024 • 1

3D StreetUnveiler with Semantic-Aware 2DGS

Paper • 2405.18416 • Published May 28, 2024

Yikai Wang

AI & ML interests

Recent Activity

Organizations

yikaiwang's activity