Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
wenxiang guo's picture
4 6 2

wenxiang guo

verstar
nanless's profile picture 21world's profile picture EricsXt's profile picture
·

AI & ML interests

None yet

Recent Activity

published a dataset 10 days ago
verstar/SO-FOA
upvoted a paper 24 days ago
Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios
upvoted a paper 24 days ago
Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer
View all activity

Organizations

Zhejiang University's profile picture Generative AI For Audio's profile picture MRSAudio's profile picture

upvoted 3 papers 24 days ago

Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios

Paper • 2605.28618 • Published 29 days ago • 32

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

Paper • 2605.30940 • Published 27 days ago • 38

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Paper • 2605.30993 • Published 27 days ago • 59
upvoted a collection 5 months ago

Qwen3-TTS

Collection
7 items • Updated Jan 22 • 367
upvoted 2 papers about 1 year ago

ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting

Paper • 2504.20630 • Published Apr 29, 2025 • 9

Versatile Framework for Song Generation with Prompt-based Control

Paper • 2504.19062 • Published Apr 27, 2025 • 6
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs