Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
he-shuwei
/
MS2KU-VTTS
like
0
Text-to-Speech
soundspaces-speech
English
visual-tts
spatial-audio
speech-synthesis
icassp2025
License:
mit
Model card
Files
Files and versions
xet
Community
main
MS2KU-VTTS
/
data
67.9 MB
Ctrl+K
Ctrl+K
1 contributor
History:
5 commits
he-shuwei
Upload data/processed_data/mfa/mfa_outputs.tar.gz with huggingface_hub
ff84c40
verified
about 1 month ago
processed_data
Upload data/processed_data/mfa/mfa_outputs.tar.gz with huggingface_hub
about 1 month ago
raw_data
Upload data/raw_data/captions/test-unseen.jsonl with huggingface_hub
about 1 month ago