Zhaochong An

ZhaochongAn

·

https://zhaochongan.github.io/

ZhaochongAn

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

FLAT: Feedforward Latent Triangle Splatting for Geometrically Accurate Scene Generation

upvoted a paper 15 days ago

Track2View: 4D-Consistent Camera-Controlled Video Generation via Paired 3D Point Tracks

upvoted a paper 19 days ago

Echo-Memory: A Controlled Study of Memory in Action World Models

View all activity

Organizations

upvoted a paper 8 days ago

FLAT: Feedforward Latent Triangle Splatting for Geometrically Accurate Scene Generation

Paper • 2606.24876 • Published 9 days ago • 22

upvoted a paper 15 days ago

Track2View: 4D-Consistent Camera-Controlled Video Generation via Paired 3D Point Tracks

Paper • 2606.15534 • Published 18 days ago • 12

upvoted a paper 19 days ago

Echo-Memory: A Controlled Study of Memory in Action World Models

Paper • 2606.09803 • Published 24 days ago • 32

upvoted a paper about 1 month ago

Stitched Value Model for Diffusion Alignment

Paper • 2605.19804 • Published May 19 • 12

upvoted 2 papers 2 months ago

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Paper • 2604.24763 • Published Apr 27 • 71

(1D) Ordered Tokens Enable Efficient Test-Time Search

Paper • 2604.15453 • Published Apr 16 • 18

upvoted a paper 3 months ago

VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward

Paper • 2603.26599 • Published Mar 27 • 67

upvoted a paper 6 months ago

HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming

Paper • 2512.21338 • Published Dec 24, 2025 • 23

upvoted 3 papers 7 months ago

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

Paper • 2512.07802 • Published Dec 8, 2025 • 47

Scaling Zero-Shot Reference-to-Video Generation

Paper • 2512.06905 • Published Dec 7, 2025 • 29

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published Dec 1, 2025 • 78

upvoted 2 papers over 1 year ago

Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation

Paper • 2410.22489 • Published Oct 29, 2024 • 1

Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model

Paper • 2503.16282 • Published Mar 20, 2025 • 6