Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Zhiyuan Zhu's picture
5 2

Zhiyuan Zhu

dieKarotte
Gargaz's profile picture asiiiiir0105's profile picture 21world's profile picture
·

AI & ML interests

None yet

Recent Activity

updated a dataset 7 days ago
dieKarotte/SO-Dataset
updated a model 8 days ago
dieKarotte/Spatial-Omni
published a model 11 days ago
dieKarotte/Spatial-BEATs
View all activity

Organizations

MRSAudio's profile picture

upvoted a paper 14 days ago

Spatial-Omni: Spatial Audio Understanding Integration in Multimodal LLMs via FOA Encoding

Paper • 2606.10738 • Published 16 days ago • 2
upvoted a paper 16 days ago

ASAudio: A Survey of Advanced Spatial Audio Research

Paper • 2508.10924 • Published Aug 8, 2025 • 2
upvoted 3 papers 24 days ago

Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios

Paper • 2605.28618 • Published 29 days ago • 32

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

Paper • 2605.30940 • Published 27 days ago • 38

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Paper • 2605.30993 • Published 27 days ago • 59
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs