-
PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings
Paper • 2403.02288 • Published -
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations
Paper • 2510.10396 • Published -
BUT-FIT/DiCoW_v3_3_large
Automatic Speech Recognition • 2B • Updated • 398 • 1 -
dieKarotte/Spatial-BEATs
Updated • 1
adili tuheti
asiiiiir0105
·
AI & ML interests
AI, ML
Recent Activity
updated a collection 3 days ago
Speech new activity 13 days ago
BUT-FIT/DiCoW_v3_3_large:Update generation_config.json new activity 13 days ago
BUT-FIT/DiCoW_v3_3_large:Update generation_config.jsonOrganizations
None yet