arxiv:2510.10396
Yu Zhang
AaronZ345
AI & ML interests
Multi-Modal Generative AI (Spatial Audio/Music/Singing/Speech).
Recent Activity
new activity 3 days ago
GTSinger/GTSinger:Annotation quality is very low, not usable for training new activity about 1 month ago
GTSinger/GTSinger:Annotation quality is very low, not usable for training authored a paper 7 months ago
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with
Refined Annotations