PanChanghao's picture

PanChanghao

DavidPigeon

·

https://david-pigeon.github.io/

DavidPigeon

AI & ML interests

audio synthesis

Recent Activity

upvoted a paper 24 days ago

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

upvoted a paper 24 days ago

Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios

upvoted a paper 24 days ago

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

View all activity

Organizations

upvoted 3 papers 24 days ago

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

Paper • 2605.30940 • Published 28 days ago • 38

Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios

Paper • 2605.28618 • Published 30 days ago • 32

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Paper • 2605.30993 • Published 28 days ago • 59

upvoted a paper about 1 month ago

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Paper • 2605.19833 • Published May 19 • 137

upvoted a paper 2 months ago

WavAlign: Enhancing Intelligence and Expressiveness in Spoken Dialogue Models via Adaptive Hybrid Post-Training

Paper • 2604.14932 • Published Apr 16 • 11

upvoted 2 papers about 1 year ago

ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting

Paper • 2504.20630 • Published Apr 29, 2025 • 9

Versatile Framework for Song Generation with Prompt-based Control

Paper • 2504.19062 • Published Apr 27, 2025 • 6