Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
he-shuwei
/
MS2KU-VTTS
like
0
Text-to-Speech
soundspaces-speech
English
visual-tts
spatial-audio
speech-synthesis
icassp2025
License:
mit
Model card
Files
Files and versions
xet
Community
main
MS2KU-VTTS
/
data
/
raw_data
/
captions
18.1 MB
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
he-shuwei
Upload data/raw_data/captions/test-unseen.jsonl with huggingface_hub
9fd0c9a
verified
about 1 month ago
test-seen.jsonl
Safe
858 kB
Upload data/raw_data/captions/test-seen.jsonl with huggingface_hub
about 1 month ago
test-unseen.jsonl
Safe
873 kB
Upload data/raw_data/captions/test-unseen.jsonl with huggingface_hub
about 1 month ago
train.jsonl
Safe
16.2 MB
xet
Upload data/raw_data/captions/train.jsonl with huggingface_hub
about 1 month ago
val-mini.jsonl
Safe
167 kB
Upload data/raw_data/captions/val-mini.jsonl with huggingface_hub
about 1 month ago