Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yiwei Guo's picture
2 3 4

Yiwei Guo

cantabile-kwok
Gatozu35's profile picture
·
  • cantabile-kwok

AI & ML interests

Text to Speech

Recent Activity

liked a model 12 days ago
microsoft/VibeVoice-ASR
new activity about 2 months ago
HKUSTAudio/Audio-FLAN-Dataset:"audio_files/speech/188_HQ-Conversations/" and "118_HQ-Conversations" seem to be the same
updated a model 5 months ago
cantabile-kwok/lscodec_25hz
View all activity

Organizations

Shanghai Jiao Tong University's profile picture SJTU Cross Media Language Intelligence Lab's profile picture

upvoted a paper 5 months ago

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 143
upvoted a paper 11 months ago

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

Paper • 2503.08625 • Published Mar 11, 2025 • 27
upvoted a paper over 1 year ago

MobA: A Two-Level Agent System for Efficient Mobile Task Automation

Paper • 2410.13757 • Published Oct 17, 2024 • 32
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs