AI & ML interests

None defined yet.

Recent Activity

zhiqiulinĀ 
posted an update 17 days ago
view post
Post
116
šŸš€ VQAScore now supports text-to-video evaluation!

VQAScore scores how well a generated image or video matches a prompt by asking a VLM "does this show {prompt}?" and using P(Yes). It became a go-to evaluation metric and reward model for image generation (2M+ downloads), and we just added text-to-video support across 20+ VLMs (GPT, Gemini, Qwen). Free and open-source, and it keeps improving as VLMs improve.

šŸ’» Code: https://github.com/linzhiqiu/t2v_metrics
šŸ“„ Paper: https://arxiv.org/abs/2404.01291
🧵 Launch thread + demo video: https://x.com/ZhiqiuLin/status/2064316582461841499
  • 1 reply
Ā·