Zhiyuan Zhu's picture

5 2

Zhiyuan Zhu

dieKarotte

·

AI & ML interests

None yet

Recent Activity

updated a dataset 7 days ago

dieKarotte/SO-Dataset

updated a model 8 days ago

dieKarotte/Spatial-Omni

published a model 11 days ago

dieKarotte/Spatial-BEATs

View all activity

Organizations

upvoted a paper 14 days ago

Spatial-Omni: Spatial Audio Understanding Integration in Multimodal LLMs via FOA Encoding

Paper • 2606.10738 • Published 16 days ago • 2

upvoted a paper 16 days ago

ASAudio: A Survey of Advanced Spatial Audio Research

Paper • 2508.10924 • Published Aug 8, 2025 • 2

upvoted 3 papers 24 days ago

Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios

Paper • 2605.28618 • Published 29 days ago • 32

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

Paper • 2605.30940 • Published 27 days ago • 38

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Paper • 2605.30993 • Published 27 days ago • 59