Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
PanChanghao's picture
3 7 6

PanChanghao

DavidPigeon
·
https://david-pigeon.github.io/
  • DavidPigeon

AI & ML interests

audio synthesis

Recent Activity

upvoted a paper 24 days ago
Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer
upvoted a paper 24 days ago
Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios
upvoted a paper 24 days ago
SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue
View all activity

Organizations

Zhejiang University's profile picture

liked a Space about 1 month ago
Running
85

ACL Pubcheck

📝
85

Check your PDF for ACL guidelines

liked a Space 5 months ago
Running on Zero
Agents
Featured
1.99k

Qwen3-TTS Demo

🎙
1.99k

Generate speech from text using voice design, cloning or presets

liked a model 5 months ago

stepfun-ai/Step-Audio-R1.1

Audio-Text-to-Text • 33B • Updated Feb 14 • 299 • 182
liked a Space 5 months ago
Running
Agents
22

Fun-ASR-Nano

🚀
22

LLM-powered ASR: 31 languages, Chinese dialects, timestamps

liked a model 5 months ago

nvidia/bigvgan_v2_24khz_100band_256x

Audio-to-Audio • Updated Sep 5, 2024 • 17.7k • 22
liked a dataset 10 months ago

OpenSound/CapSpeech

Viewer • Updated Jun 4, 2025 • 20.8M • 195 • 25
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs