Zhiqiu Lin PRO

zhiqiulin

·

https://linzhiqiu.github.io

linzhiqiu

AI & ML interests

None yet

Recent Activity

posted an update 26 days ago

🚀 VQAScore now supports text-to-video evaluation! VQAScore scores how well a generated image or video matches a prompt by asking a VLM "does this show {prompt}?" and using P(Yes). It became a go-to evaluation metric and reward model for image generation (2M+ downloads), and we just added text-to-video support across 20+ VLMs (GPT, Gemini, Qwen). Free and open-source, and it keeps improving as VLMs improve. 💻 Code: https://github.com/linzhiqiu/t2v_metrics 📄 Paper: https://arxiv.org/abs/2404.01291 🧵 Launch thread + demo video: https://x.com/ZhiqiuLin/status/2064316582461841499

updated a dataset about 1 month ago

zhiqiulin/caption_export

liked a dataset 2 months ago

chancharikm/CHAI_testset

View all activity

Organizations

upvoted a paper 2 months ago

Building a Precise Video Language with Human-AI Oversight

Paper • 2604.21718 • Published Apr 22 • 17

upvoted 3 papers about 1 year ago

Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers

Paper • 2412.00142 • Published Nov 28, 2024 • 5

Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published Apr 21, 2025 • 157

AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis

Paper • 2504.13157 • Published Apr 17, 2025 • 20

upvoted 2 papers over 1 year ago

Motion Prompting: Controlling Video Generation with Motion Trajectories

Paper • 2412.02700 • Published Dec 3, 2024 • 16

NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples

Paper • 2410.14669 • Published Oct 18, 2024 • 39

upvoted a paper almost 2 years ago

GenAI-Bench: Evaluating and Improving Compositional Text-to-Visual Generation

Paper • 2406.13743 • Published Jun 19, 2024 • 2

upvoted a paper about 2 years ago

Evaluating Text-to-Visual Generation with Image-to-Text Generation

Paper • 2404.01291 • Published Apr 1, 2024 • 6

upvoted 2 papers over 2 years ago

The Neglected Tails of Vision-Language Models

Paper • 2401.12425 • Published Jan 23, 2024 • 3

Language Models as Black-Box Optimizers for Vision-Language Models

Paper • 2309.05950 • Published Sep 12, 2023 • 4