wenxiang guo's picture

wenxiang guo

verstar

·

AI & ML interests

None yet

Recent Activity

published a dataset 10 days ago

upvoted a paper 24 days ago

Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios

upvoted a paper 24 days ago

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

View all activity

Organizations

upvoted 3 papers 24 days ago

Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios

Paper • 2605.28618 • Published 30 days ago • 32

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

Paper • 2605.30940 • Published 28 days ago • 38

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Paper • 2605.30993 • Published 28 days ago • 59

upvoted a collection 5 months ago

Qwen3-TTS

7 items • Updated Jan 22 • 367

upvoted 2 papers about 1 year ago

ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting

Paper • 2504.20630 • Published Apr 29, 2025 • 9

Versatile Framework for Song Generation with Prompt-based Control

Paper • 2504.19062 • Published Apr 27, 2025 • 6