Yiwei Guo's picture

Yiwei Guo

cantabile-kwok

·

cantabile-kwok

AI & ML interests

Text to Speech

Organizations

upvoted a paper 9 months ago

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 171

upvoted a paper about 1 year ago

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

Paper • 2503.08625 • Published Mar 11, 2025 • 27

upvoted a paper over 1 year ago

MobA: A Two-Level Agent System for Efficient Mobile Task Automation

Paper • 2410.13757 • Published Oct 17, 2024 • 32