Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
MIT-SLS
/
USAD-Large
like
2
Follow
Spoken Language Systems
14
Feature Extraction
Transformers
Safetensors
6 datasets
English
usad
automatic-speech-recognition
audio-classification
audio
speech
music
custom_code
arxiv:
2506.18843
License:
cc-by-nc-sa-4.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
7b6b150
USAD-Large
1.34 GB
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
vectominist
Update README.md
7b6b150
verified
10 months ago
.gitattributes
Safe
1.52 kB
initial commit
10 months ago
README.md
Safe
3.02 kB
Update README.md
10 months ago
config.json
Safe
991 Bytes
upload model and code
10 months ago
configuration_usad.py
Safe
2.59 kB
upload model and code
10 months ago
model.safetensors
Safe
1.34 GB
xet
upload model and code
10 months ago
modeling_usad.py
Safe
498 Bytes
upload model and code
10 months ago
usad_model.py
Safe
6.77 kB
upload model and code
10 months ago
usad_modules.py
Safe
26.7 kB
upload model and code
10 months ago