zhiyuanyou

7 11 11

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

We’re open-sourcing our text-to-image model and the process behind it

upvoted an article about 1 month ago

Text-to-image Architectural Experiments

upvoted an article about 1 month ago

PRX Part 3 — Training a Text-to-Image Model in 24h!

View all activity

Organizations

None yet

upvoted 4 articles about 1 month ago

Article

We’re open-sourcing our text-to-image model and the process behind it

Photoroom

•

Nov 12, 2025

• 100

Article

Text-to-image Architectural Experiments

Photoroom

•

Nov 13, 2025

• 60

Article

PRX Part 3 — Training a Text-to-Image Model in 24h!

Photoroom

•

Mar 3

• 67

Article

Training Design for Text-to-Image Models: Lessons from Ablations

Photoroom

•

Feb 3

• 77

updated a model about 1 month ago

zhiyuanyou/PhotoFramer-preview

Image-Text-to-Image • Updated May 26

published a model about 1 month ago

zhiyuanyou/PhotoFramer-preview

Image-Text-to-Image • Updated May 26

upvoted a paper 2 months ago

AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model

Paper • 2604.19747 • Published Apr 21 • 40

updated a dataset 3 months ago

zhiyuanyou/Datasets-PhotoFramer-Assessment

Preview • Updated Apr 18 • 46

published a dataset 3 months ago

zhiyuanyou/Datasets-PhotoFramer-Assessment

Preview • Updated Apr 18 • 46

updated a model 3 months ago

zhiyuanyou/Qwen2.5-VL-7B-GRPO-Composition-Score-Class

Image-Text-to-Text • 8B • Updated Apr 17 • 622

published a model 3 months ago

zhiyuanyou/Qwen2.5-VL-7B-GRPO-Composition-Score-Class

Image-Text-to-Text • 8B • Updated Apr 17 • 622

upvoted a paper 5 months ago

Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis

Paper • 2602.03139 • Published Feb 3 • 45

liked a model 5 months ago

OmniGen2/OmniGen2-EditScore7B-v1.1

Updated Oct 27, 2025 • 7 • 6

authored a paper 5 months ago

RouteMoA: Dynamic Routing without Pre-Inference Boosts Efficient Mixture-of-Agents

Paper • 2601.18130 • Published Jan 26 • 2

upvoted a paper 6 months ago

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

Paper • 2512.21675 • Published Dec 25, 2025 • 28

New activity in zhiyuanyou/DeQA-Score-Mix3 6 months ago

Adding `safetensors` variant of this model

#6 opened 6 months ago by

SFconvertbot

New activity in zhiyuanyou/DeQA-Score-Mix3 7 months ago

Fix llama prompt

#5 opened 7 months ago by

vniclas

updated a model 7 months ago

zhiyuanyou/DeQA-Score-Mix3

Image-to-Text • 8B • Updated Dec 24, 2025 • 9.61k • 6

liked a Space 9 months ago

vggt

🏆

472

VGGT (CVPR 2025)

liked a model 9 months ago

nyu-visionx/RAE-collections

Unconditional Image Generation • Updated Mar 1 • 47

zhiyuanyou

AI & ML interests

Recent Activity

Organizations

zhiyuanyou's activity

We’re open-sourcing our text-to-image model and the process behind it

Text-to-image Architectural Experiments

PRX Part 3 — Training a Text-to-Image Model in 24h!

Training Design for Text-to-Image Models: Lessons from Ablations

Adding `safetensors` variant of this model

Fix llama prompt

vggt